Browse > Article
http://dx.doi.org/10.3745/KIPSTA.2005.12A.6.507

Partially Decentralized Passive Replication Algorithm  

Ahn, Jin-Ho (경기대하교 정보과학부 전자계산학과)
Abstract
This paper presents a partially decentralized passive replication algorithm for deterministic servers in message-passing distributed systems. The algorithm allows any backup server, not necessarily the primary server, to take responsibility for processing its received client request and coordinating with the other replica servers after obtaining the delivery sequence number of the request from the primary. Thanks to thus desirable feature, the algorithm with conventional load balancing techniques can efficiently avoid extreme load conditions on the primary. Therefore, it can provide better scalability of deterministic and replicated sewer systems than traditional passive replication algorithms. Simulation results indicate that the proposed algorithm can reduce $16.5\%{\~}52.3\%$ of the average response time of a client request compared with the traditional ones.
Keywords
Message-passing Distributed System; Fault-tolerance; passive Replication; Deterministic Servers; Scalability;
Citations & Related Records
연도 인용수 순위
  • Reference
1 M. F. Wiesmann, A. Schiper, B. Kemme and G. Alonso, 'Understanding Replication in Databases and Distributed Systems,' In Proc, of the 21st International Conference on Distributed Computing Systems, pp.464-474, 2000   DOI
2 R. D. Schlichting and F. B. Schneider, 'Fail-stop processors: an approach to designing fault-tolerant distributed computing systems,' ACM Transactions on Computer Systems, Vol.1, pp.222-238, 1985   DOI
3 F. Schneider, 'Implementing fault-tolerant services using the state machine approach: A tutorial,' ACM Computing Surveys, Vol.22, pp.299-319, 1990   DOI
4 A. Spector, 'Performing remote operations efficiently on local computer network,' Communications of the ACM, Vol.25, No.4, pp.246-260, 1982   DOI   ScienceOn
5 R. B. Strom and S. Yemeni, 'Optimistic recovery in distributed systems,' ACM Transactions on Computer Systems, Vol.3, pp.204-226, 1985   DOI   ScienceOn
6 X. Defago and A. Schiper, 'Semi-passive Replication and Lazy Consensus,' Journal of Parallel and Distributed Computing Systems, Vol.64, No.12, pp.1380-1398, 2004   DOI   ScienceOn
7 M. J. Fischer, N. A. Lynch and M. S. Paterson, 'Impossibility of distributed consensus with one faulty process,' Journal of ACM, Vol.32, pp.374-382, 1985   DOI   ScienceOn
8 M. Malcom and W. Zhao, 'Hard real time communication in multiple-access networks,' Real-Time Systems, Vol.8, pp.35-77, 1995   DOI
9 R. Guerraoui and A. Schiper, 'Software-Based Replication for Fault Tolerance,' IEEE Computer, Vol.30, pp.68-74, 1997   DOI   ScienceOn
10 M. Herlihy and J. Wing, 'Linearizability: a correctness condition for concurrent objects,' ACM Transactions on Progr. Languages and Syst., Vol.12, No.3, pp.463-492, 1990   DOI
11 D. Powell, M. Chereque and D. Drackley, 'Fault-tolerance in Delta-4.' ACM Operating Systems Review, Vol.25, pp.122-125, 1991   DOI
12 K. P. Birman, T. A. Joseph, T. Raeuchle and A. E. Abbadi, 'Implementing fault-tolerant distributed objects,' IEEE Transactions on Software Engineering, Vol.11, No.6, pp.502-508, 1985   DOI   ScienceOn
13 V. Cardellini, M. Colajanni and P. Yu, 'Dynamic load balancing on Web- server systems,' IEEE Internet Computing, Vol.3, pp.28-39, 1999   DOI   ScienceOn
14 R. Bagrodia, R. Meyer, M. Takai, Y. Chen, X. Zeng, J. Martin and H. Y. Song, 'Parsec: A Parallel Simulation Environments for Complex Systems,' IEEE Computer, pp.77-85, 1998   DOI   ScienceOn
15 H. Bryhni, E. Klovning and O. Kure, 'A Comparison of Load Balancing Techniques for Scalable Web Servers,' IEEE Network, Vol.14, pp.58-64, 2000   DOI   ScienceOn
16 N. Budhiraja, K. Marzullo, F. Schneider and S. Toueg, 'The primary-backup approach, Distributed Systems,' ch.8, pp.199-216, 2nd Ed., Addison-Wesley, 1993
17 T. D. Chandra and S. Toueg, 'Unreliable failure detectors for reliable distributed systems,' Journal of ACM, Vol.43, No.2, pp.225-267, 1996   DOI   ScienceOn
18 K. P. Birman and T. A. Joseph, 'Reliable communication in the presence of failures,' ACM Transactions on Computer Systems, Vol.5, No.1, pp.47-76, 1987   DOI   ScienceOn