Fault tolerance has been an active research area for many years. This volume presents papers from a workshop held in 1993 where a small number of key researchers and practitioners in the area met to discuss the experiences of industrial practitioners, to provide a perspective on the state of the art of fault tolerance research, to determine whether the subject is becoming mature, and to learn from the experiences so far in order to identify what might be important research topics for the coming years. The workshop provided a more intimate environment for discussions and presentations than usual at conferences. The papers in the volume were presented at the workshop, then updated and revised to reflect what was learned at the workshop.
Top five challenges facing the practice of fault-tolerance.- Fault tolerance cost effectiveness.- Fault tolerant applications systems; A requirements perspective.- Scalable shared memory multiprocessors: Some ideas to make them reliable.- Application of compiler-assisted rollback recovery to speculative execution repair.- Fault tolerance: Why should I pay for it?.- Stable disk - A fault-tolerant cached RAID subsystem.- Simple design makes reliable computers.- Roll-forward checkpointing schemes.- Fault-tolerant architectures - Past, present and (?) future.- A highly available application in the transis environment.- Reliable enterprise computing systems.- Fault tolerance for clusters of workstations.- Two techniques for transient software error recovery.- Software-faults: The remaining problem in fault tolerant systems?.- Fault tolerance enablers in the CHORUS microkernel.- A reliable client-server model on top of a micro-kernel.- Distributed fault tolerance - Lessons learnt from Delta-4.- Arjuna and Voltan: Case studies in building fault-tolerant distributed systems using standard components.- Fault tolerant platforms for emerging telecommunications markets.- Fault-tolerance in embedded real-time systems.- The systematic design of large real-time systems or interface simplicity.- Fault tolerance in embedded real-time systems: Importance and treatment of common mode failures.- Highly-available data services for UNIX client-server networks: Why fault-tolerant hardware isn't the answer.- The management of replicated data.
Series: Lecture Notes in Computer Science
Number Of Pages: 314
Published: 28th February 1994
Publisher: Springer-Verlag Berlin and Heidelberg Gmbh & Co. Kg
Country of Publication: DE
Dimensions (cm): 23.39 x 15.6
Weight (kg): 0.47