You are here: Home Tutorials Tutorial 1
Tutorial 1 PDF Print E-mail


Tutorial 1 (Confirmed)

IEEE Computer Society Italy Chapter sponsored lecturer:

Prof. Kishor S. Trivedi
Hudson Professor of Electrical and Computer Engineering
Duke University, Department of Electrical and Computer Engineering
Durham, USA.

Tutorial 1. September 22, 2011,  h. 14:00-18:00, room B,  ground floor

Prof. Kishor S. Trivedi

Title: Software Fault Tolerance in the Twenty First Century

Abstract: Safety critical and other technical systems contain significant amount of software.
Several recent studies have established that most system outages are due to software faults.
Traditional methods of fault avoidance, extensive testing and design/data diversity-based software fault tolerance are found wanting. In this tutorial a new view of fault tolerance of software-based systems is discussed. We begin by classifying software faults into Bohrbugs and Mandelbugs, and identify aging-related bugs as a subtype of the latter. Traditional methods have been designed to deal with Bohrbugs. We examine mitigation methods for Mandelbugs in general and aging-related bugs in particular. These include, restart of application software, failover to an identical copy of the software application, reboot of the operating system and proactive method of software maintenance known as rejuvenation. Analysis of measurement data will be shared. Finally, techniques for the quantitative reliability and availability assurance for such systems will be addressed.

Instructor Bio: Kishor S. Trivedi holds the Hudson Chair in the Department of Electrical and Computer Engineering at Duke University, Durham, NC. He has been on the Duke faculty since 1975. He is the author of a well known text entitled, Probability and Statistics with Reliability, Queuing and Computer Science Applications, published by Prentice-Hall; a thoroughly revised second edition (including its Indian edition) of this book has been published by John Wiley. He has also published two other books entitled, Performance and Reliability Analysis of Computer Systems, published by Springer and Queueing Networks and Markov Chains, John Wiley. He is a Fellow of the Institute of Electrical and Electronics Engineers. He is a Golden Core Member of IEEE Computer Society. He has published over 450 articles and has supervised 42 Ph.D. dissertations. He is the recipient of IEEE Computer Society Technical Achievement Award for his research on Software Aging and Rejuvenation. He works closely with industry in carrying our reliability/availability analysis, providing short courses on reliability, availability, performability modeling and in the development and dissemination of software packages  such as SHARPE and SPNP.

Expected Duration: 4 hours