Replicated distributed processes in Manetho

E. N. Elnozahy, W. Zwaenepoel

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

20 Scopus citations

Abstract

The authors present the process-replication protocol of Manetho, a system whose goal is to provide efficient, application-transparent fault tolerance to long-running distributed computations. Manetho uses a novel negative-acknowledgment multicast protocol to enforce the same receipt order of application messages among all replicas of a process. The protocol depends on a combination of antecedence graph maintenance, a form of sender-based message logging, and the fact that the receivers of each multicast execute the same deterministic program. This combination allows the protocol to void the delay in application message delivery that is common in existing negative-acknowledgment multicast protocols, without giving up the advantage of requiring only a small number of control messages.

Original languageEnglish (US)
Title of host publicationFTCS 1992 - 22nd Annual International Symposium on Fault-Tolerant Computing
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages18-27
Number of pages10
ISBN (Electronic)0818628758, 9780818628757
DOIs
StatePublished - 1992
Event22nd Annual International Symposium on Fault-Tolerant Computing, FTCS 1992 - Boston, United States
Duration: Jul 8 1992Jul 10 1992

Publication series

NameFTCS 1992 - 22nd Annual International Symposium on Fault-Tolerant Computing

Conference

Conference22nd Annual International Symposium on Fault-Tolerant Computing, FTCS 1992
Country/TerritoryUnited States
CityBoston
Period07/8/9207/10/92

Bibliographical note

Publisher Copyright:
© 1992 IEEE.

ASJC Scopus subject areas

  • Software
  • Safety, Risk, Reliability and Quality
  • Hardware and Architecture

Fingerprint

Dive into the research topics of 'Replicated distributed processes in Manetho'. Together they form a unique fingerprint.

Cite this