GLORY-FS: 대규모 인터넷 서비스를 위한 분산 파일 시스템

  • Published : 2013.03.30

Abstract

본고에서는 분산 파일 시스템 기술의 현황 및 최근 이슈를 다룬다. 먼저 클라우드 컴퓨팅 및 빅데이터 분석 분야에서 산업체 표준으로 간주되고 있는 Hadoop의 분산 파일 시스템을 위주로 현황과 한계에 대해 다루고, 국내에서 개발된 유사한 구조의 분산 파일시스템인 GLORY-FS를 Hadoop 파일 시스템과 대비하여 국내 활용 사례를 기반으로 유사성 및 차이점을 비교한다.

Keywords

References

  1. Apache Hadoop. http://hadoop.apache.org/.
  2. Konstantin V. S."The Hadoop Distributed File System," Proc. of MSST2010, May 2010.
  3. Konstantin V. S."Apache Hadoop: the Scalability Update," ;login:, pp.7-13, June 2011.
  4. MapR's Direct Access NFS vs. Hadoop FUSE, http://www.mapr.com
  5. P. H. Carns, W. B. Ligon III, R. B. Ross, and R. Thakur. "PVFS: A parallel file system for Linux clusters," in Proc. of 4th Annual Linux Showcase and Conference, 2000, pp. 317-327.
  6. W. Tantisiriroj, S. Patil, G. Gibson. "Data-intensive file systems for Internet services: A rose by any other name ..." Technical Report CMUPDL-08-114, Parallel Data Laboratory, Carnegie Mellon University,Pittsburgh, PA, October 2008.
  7. Lustre File System. http://www.lustre.org
  8. S. Ghemawat, H. Gobioff, S. Leung. "The Google file system," In Proc. of ACM Symposium on Operating Systems Principles, Lake George, NY, Oct 2003, pp 29-43.
  9. M. K. McKusick, S. Quinlan. "GFS: Evolution on Fast-forward," ACM Queue, vol. 7, no. 7, New York, NY. August 2009.
  10. Konstantin V. Shvachko. HDFS Scalability: The limits to growth, ;login:, pp.6-16, April 2010.
  11. The MapR Distribution for Apache Hadoop, http:// www.mapr.com
  12. $Intel^{\circledR}$ Distribution for Apache Hadoop, http://www. intel.com
  13. EMC delivers on Isilon-Hadoop bundle, http:// gigaom.com/2012/01/31/emc-delivers-on-isilonhadoop- bundle/
  14. NetApp Open Solution for Hadoop, http://www. netapp.com/us/solutions/big-data/hadoop.aspx
  15. Because Hadoop isn't perfect: 8 ways to replace HDFS, http://gigaom.com/2012/07/11/becausehadoop- isnt-perfect-8-ways-to-replace-hdfs/
  16. Thinking about the HDFS v.s. Other Storage Technologies, http://hortonworks.com/blog/ thinking-about-the-hdfs-vs-other-storagetechnologies/
  17. S. Weil, S. Brandt, E. Miller, D. Long, and C. Maltzahn, "Ceph: A Scalable, High-Performance Distributed File System," Proceedings of the 7th Symposium on Operating Systems Design and Implementation, November 2006.
  18. S. Radia, S. Srinivas, "Scaling HDFS Cluster Using Namenode Federation," HDFS-1052, August 2010: https://issues.apache.org/jira/secure/ attachment/12453067/high-level-design.pdf.
  19. J. Dean, "Large-Scale Distributed Systems at Google: Current Systems and Future Directions," Keynote at Large-Scale Distributed Systems and Middleware, October 2009.
  20. J. Darcy, VoldFS and CassFS: https://github.com/ jdarcy/VoldFS, https://github.com/jdarcy/CassFS.
  21. K. T. Park, H. Y. Kim, Y.C. Kim, S.M. Lee, Y.K. Kim, M.J. Kim, "Lake: Towards highly manageable cluster storage for extremely scalable services," ICCSA 2008
  22. 민영수, 진기성, 김홍연, 김영균, "클라우드 컴퓨팅을 위한 분산파일 시스템 기술 동향," pp.55-68, 전자통신동향분석 제24권 제4호, 2009년 8월