RTOS fault tolerance, error detection, and correction
Important developments are taking place in the areas of fault tolerance and high availability. Fault tolerant and high availability systems require thoughtful design of both the hardware and the software, with the goal of providing error detection, error correction, fault tolerance, and ease of repair. Compact PCI’s hot swap specification has defined the hardware requirements for fault tolerant, high availability systems, and now significant progress is being made in defining the software requirements. In this article, Curt lists the operating system features that are important when building fault tolerant, high availability systems, and discusses why each is important.