High Integrity System Design

The dependability is a main concern in all real time critical function system design such as patient monitoring, avionics, space, and automatic teller machines (ATMs). High Integrity Systems Design (HISD) research program addresses issues related to design and analysis of such systems.

We have developed new techniques for fault detection, diagnosis, reconfiguration, redundancy management, and have developed new system architectures incorporating these techniques. Currently, our focus is on developing techniques and protocols for efficient error detection and recovery in cache memory while maintaining synchronization among high-performance redundant processor systems employing cache memories for use in real-time applications. The effective design of such systems relies heavily on the ability to recover from transient faults which have been estimated to occur at a rate of 5 to 100 times that of the permanent ones. We are also addressing the issue of using idle capacity for fault diagnosis.

We are developing reliability analysis methods and tools for the analysis of large and potentially complex fault tolerant systems.

Principal Investigator: Somani

webmaster@cs.washington.edu