Abstract
The authors present the process-replication protocol of Manetho, a system whose goal is to provide efficient, application-transparent fault tolerance to long-running distributed computations. Manetho uses a novel negative-acknowledgment multicast protocol to enforce the same receipt order of application messages among all replicas of a process. The protocol depends on a combination of antecedence graph maintenance, a form of sender-based message logging, and the fact that the receivers of each multicast execute the same deterministic program. This combination allows the protocol to void the delay in application message delivery that is common in existing negative-acknowledgment multicast protocols, without giving up the advantage of requiring only a small number of control messages.
Original language | English (US) |
---|---|
Title of host publication | FTCS 1992 - 22nd Annual International Symposium on Fault-Tolerant Computing |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
Pages | 18-27 |
Number of pages | 10 |
ISBN (Electronic) | 0818628758, 9780818628757 |
DOIs | |
State | Published - 1992 |
Event | 22nd Annual International Symposium on Fault-Tolerant Computing, FTCS 1992 - Boston, United States Duration: Jul 8 1992 → Jul 10 1992 |
Publication series
Name | FTCS 1992 - 22nd Annual International Symposium on Fault-Tolerant Computing |
---|
Conference
Conference | 22nd Annual International Symposium on Fault-Tolerant Computing, FTCS 1992 |
---|---|
Country/Territory | United States |
City | Boston |
Period | 07/8/92 → 07/10/92 |
Bibliographical note
Publisher Copyright:© 1992 IEEE.
ASJC Scopus subject areas
- Software
- Safety, Risk, Reliability and Quality
- Hardware and Architecture