Browse > Article

Optimization and Performance Analysis of Cloud Computing Platform for Distributed Processing of Big Data  

Hong, Seung-Tae (전북대학교 전자정보공학부)
Shin, Young-Sung (전북대학교 전자정보공학부)
Chang, Jae-Woo (전북대학교 전자정보공학부)
Publication Information
Abstract
Recently, interest in cloud computing which provides IT resources as service form in IT field is increasing. As a result, much research has been done on the distributed data processing that store and manage a large amount of data in many servers. Meanwhile, in order to effectively utilize the spatial data which is rapidly increasing day by day with the growth of GIS technology, distributed processing of spatial data using cloud computing is essential. Therefore, in this paper, we review the representative distributed data processing techniques and we analyze the optimization requirements for performance improvement of the distributed processing techniques for a large amount of data. In addition, we uses the Hadoop and we evaluate the performance of the distributed data processing techniques for their optimization requirements.
Keywords
Cloud Computing; Big Data; Distributed Processing;
Citations & Related Records
Times Cited By KSCI : 3  (Citation Analysis)
연도 인용수 순위
1 T. Condie, N. Conway, P. Alvaro, J. M. Hellerstein, K. Elmeleegy and R. Sears, 2010, "MapReduce Online", Networked Systems Design and Implementation, pp. 21-21.
2 B. F. Cooper, A. Silberstein, E. Tam, R. Ramakrishnan and R. Sears, 2010, "Benchmarking cloud serving systems with YCSB", ACM Symposium on Cloud Computing, pp. 143-154.
3 J. Dean, S. Ghemawat, 2004, "MapReduce : Simplified Data Processing on Large Clusters", Operating System Design and Implementation, pp. 10-10.
4 S. Ghemawat, H. Gobioff and Shun-Tak Leung, 2003, "The Google File System", Symposium on Operating Systems Principles, pp.29-43.
5 D. Jiang, B. C. Ooi, L. Shi and S. Wu, 2010, "The performance of MapReduce: an in-depth study", VLDB, vol. 3, Issue 1-2, pp. 472-483.
6 C. Olston, B. Reed, U. Srivastava, R. Kumer and A. Tomkins, 2008, "Pig latin: a not-so-foreign language for data processing", ACM SIGMOD, pp. 1099-1110.
7 민영수, 김흥연, 김영균, 2009, "클라우드 컴퓨팅을 위한 분산 파일 시스템 기술", 한국정보과학회지, 제27권, 제5호, pp. 86-94.
8 민옥기, 김학영, 남궁한, 2009, "클라우드 컴퓨팅 기술 동향", ETRI 전자통신동향분석, 제24권, 제4호, pp 1-13.
9 성병용, 2009, "국내 기업의 클라우드 컴퓨팅 동향 및 전략", 한국소프트웨어진흥원 정책리포트, 2009년, 7월호, pp. 6-25.
10 이기영, 김동오, 신중수, 한기준, 2008, "대용량 공간 데이터의 효율적인 검색을 위한 공간미들웨어의 개발", 한국공간정보시스템학회 논문지, 제10권, 제1호, pp. 1-14.
11 이동규, 이경민, 정석호, 이성호, 류근호, 2010, "대용량 공간 데이터로부터 빈발 패턴 마이닝", 한국공간정보시스템학회 논문지, 제12권, 제1호, pp. 49-56.
12 정재호, 2008, "클라우드. 컴퓨팅의 현재와 미래, 그리고 시장전략", 한국소프트웨어진흥원 정책리포트, 2008년, 10월호, pp. 56-85.
13 Apache Software Foundation, Apache Hadoop: http://hadoop.apache.org/.
14 Apache Software Foundation, Apache Lucene: http://lucene.apache.org/.
15 Apache Software Foundation, Hadoop Map- Redce: http://hadoop.apache.org/mapreduce.
16 Shivnath Babu, 2010, "Towards Automatic Optimization of MapReduce Programs", ACM Symposium on Cloud Computing, pp. 137-142
17 D. Borthakur, 2009, HDFS Architecture: http://hadoop.apache.org/common/docs/r0 -.20.0/hdfs_design.pdf.
18 T. Condie, N. Conway, Hadoop Online Prototype: http://code.google.com/p/hop.