Browse > Article
http://dx.doi.org/10.3745/KIPSTA.2011.18A.4.123

Scalable Data Provisioning Scheme on Large-Scale Distributed Computing Environment  

Kim, Byungs-Sang (한국과학기술원 정보통신공학과)
Youn, Chan-Hyun (한국과학기술원 전자및전기공학과)
Abstract
As the global grid has grown in size, large-scale distributed data analysis schemes have gained momentum. Over the last few years, a number of methods have been introduced for allocating data intensive tasks across distributed and heterogeneous computing platforms. However, these approaches have a limited potential for scaling up computing nodes so that they can serve more tasks simultaneously. This paper tackles the scalability and communication delay for computing nodes. We propose a distributed data node for storing and allocating the data. This paper also provides data provisioning method based on the steady states for minimizing the communication delay between the data source and the computing nodes. The experimental results show that scalability and communication delay can be achieved in our system.
Keywords
Distributed Data Analysis; Data Provisioning; Steady State Scheduling; Divisible Load Theory;
Citations & Related Records
연도 인용수 순위
  • Reference
1 David R. Cox, et.al, "The theory of stochastic processes" Chapman & Hall/CRC, 2001.
2 The SimJava Tutorial, http://www.dcs.ed.ac.uk/home/hase/simjava/
3 C. Banino, O. Beaumont, L. Carter, J. Ferrante, A. Legrand, and Y. Robert, ``Scheduling Strategies for Master-Slave Tasking on Heterogeneous Processor Platforms,'' IEEE Trans. Parallel Distributed Systems, Vol.15, No.4, pp.319-330, 2004.   DOI   ScienceOn
4 Moscicki and T. Jakub, ``DIANE - Distributed analysis environment for GRID-enabled simulation and analysis of physics data,'' Proc. IEEE Nuclear Science Symposium Conference Record, Vol.3, pp.1617-1620, 2003.   DOI
5 https://twiki.cern.ch/twiki/bin/view/Atlas/PanDA
6 Pacific Rim Applications and Grid Middleware Assemply, http://www.pragma-grid.net/
7 J. Andreeva, S. Campana, F. Fanzago, and J. Herrala, ``High-Energy Physics on the Grid: the ATLAS and CMS Experience,'' Journal of Grid Computing, Vol.6, No.1, pp.3-13, Mar., 2008.   DOI
8 Y. Asim and J. J. Dongarra, ``Biological sequence alignment on the computational grid using the GrADS framework,'' Future Generation Computer Systems, Vol.21, No.6, pp.980-986, June, 2005.   DOI   ScienceOn
9 P. Luo, K. Lu , Z. Shi , and Q. He, ``Distributed Data Mining in Grid Computing Environments,'' Future Generation Computer Systems, Vol.23, No.1, pp.84-91, Jan., 2007.   DOI   ScienceOn