| Probabilistic failure detection protocol based on "A Gossip-Style Failure Detection Service"
by Renesse, Minsky and Hayden.
Each member maintains a list of all other members: for each member P, 2 data are maintained, a heartbeat
counter and the time of the last increment of the counter. Each member periodically sends its own heartbeat
counter list to a randomly chosen member Q. Q updates its own heartbeat counter list and the associated
time (if counter was incremented). Each member periodically increments its own counter. If, when sending
its heartbeat counter list, a member P detects that another member Q's heartbeat counter was not incremented
for timeout seconds, Q will be suspected.
This protocol can be used both with a PBCAST *and* regular stacks.
author: Bela Ban 1999 version: $Revision: 1.10.6.1 $ |