Browse > Article

Improving Fault Tolerance for High-capacity Shared Distributed File Systems using the Rotational Lease Under Network Partitioning  

Tak, Byung-Chul (한국전자통신연구원 Embeded S/W 연구단)
Chung, Yon-Dohn (동국대학교 컴퓨터공학과)
Kim, Myoung-Ho (한국과학기술원 전산학과)
Abstract
In the shared storage file system, systems can directly access the shared storage device through specialized data-only subnetwork unlike in the network attached file server system. In this shared-storage architecture, data consistency is maintained by some designated set of lock servers which use control network to send and receive the lock information. Furthermore, lease mechanism is introduced to cope with the control network failure. But when the control network is partitioned, participating systems can no longer make progress after the lease term expires until the network recovers. This paper addresses this limitation and proposes a method that allows partitioned systems to make progress under the partition of control network. The proposed method works in a manner that each participating system is rotationally given a predefined lease term periodically. It is also shown that the proposed mechanism always preserves data consistency.
Keywords
shared distributed file system; network partitioning; lease; data consistency; fault tolerance;
Citations & Related Records
연도 인용수 순위
  • Reference
1 George Coulouris, Jean Dollimore, Tim Kindberg, Distributed Systems Concepts and Design Third edition, Addison-Wesley 2001
2 Michael R. Garey and David S. Johnson. Computers and Intractability: A Guide to the Theory of NP-Comleteness. W.H.Freeman, 1979
3 J. Yin, L. Alvisi, M. Dahlin, and C. Lin. Using leases to support server-driven consistency in large scale systems. In Proc. of the 18th IntI. Conf. on Distributed Computing Systems, May 1998   DOI
4 C.A. Thekkath, T. Mann, and E.K. Lee. 'Frangipani: A Scalable Distributed File System,' Proceedings of the ACM Symposium on Operating Systems Principles, pp. 224-237, Dec. 1997
5 C. Gray and D. Cheriton, 'Lease: An efficient fault-tolerant mechanism for distributed file cache consistency,' Twelfth ACM Symposium on Operating Systems Principles, pp. 202 210, 1989   DOI
6 CXFS: A high-performance, multi-OS SAN file system from SGI. SGI White Paper. URL http://www.sgi.com/products/storage/tech/file_systems.html
7 R. C. Burns, R. M. Rees, and D. D. E. Long. Semi-Preemptible Locks for a Distributed File System. In Proceedings of the 2000 International Performance Computing and Communication Conference (IPCCC), IEEE, 2000   DOI
8 Frank Schmuck and Roger Haskin. GPFS: A Shared-Disk File System for Large Computing Clusters. Proceedings of the Conference on File and Storage Technologies (FAST'02), pp. 231-244, 2002
9 R. C. Burns, R. M. Rees, and D. D. E. Long. An analytical study of opportunistic lease renewal. In Proceedings of the 16th International Conference on Distributed Computing Systems, 2001   DOI
10 T. E. Anderson, M. D. Dahlin, J. M. Neefe, D. A. Patterson, D. S. Roselli and R. Y. Wang, 'Serverless network file systems,' In Proceedings of the 15th Symposium on Operating Systems Principles, pages 109-126, December 1995   DOI
11 T. E. Anderson, M. D. Dahlin, J. M. Neefe, D. A. Patterson, D. S. Roselli and R. Y. Wang, 'Serverless network file systems,' In Proceedings of the 15th Symposium on Operating Systems Principles, pages 109-126, December 1995   DOI
12 Matthew T. O'Keefe, Shared File Systems and Fibre Channel, Sixth NASA Goddard Space Flight Center Conference on Mass Storage and Technologies in cooperation with the Fifteenth IEEE Symposium on Mass Storage Systems March 23-26, 1998
13 J. H. Howard, M. L. Kazar, S. G. Menees, D. A. Nichols, M. Satyanarayanan, R. N. Sidebotham, and Michael J. West, 'Scale and Performance in a Distributed File System,' ACM Transactions on Computer Systems, 6(1), pages 51-81, February 1988   DOI
14 R. C. Burns, R. M. Rees, and D. D. E. Long, Safe Caching in a Distributed File System for Network Attached Storage, In Proceedings of the International Parallel and Distributed Processing Symposium (IPDPS), IEEE, 2000   DOI
15 Chang-Soo Kim, gyoung-Bae Kim, Bum-Joo Shin, 'Volume Management for SAN environment,' In Proceedings of the International Conference on Parallel and Distributed Systems, 2001   DOI
16 R. Sandberg, D. Goldberg, S. Kleiman, D. Walsh, and B. Lyon, 'Design and Implementation of the Sun Network Filesystem,' Proceedings of the Summer 1985 USENIX Conference, pages 119-130 June 1985
17 Steven R. Soltis, Thomas M. Ruwart, Matthew T.O'Keefe, The Global File System, Proceedings of the Fifth NASA Goddard Space Flight Center Conference on Mass Storage Systems and Technologies, Sept 17-19, 1996