Browse > Article
http://dx.doi.org/10.9709/JKSS.2010.19.2.081

Grid-based Biological Data Mining using Dynamic Load Balancing  

Ma, Yong-Beom (인하대학교 정보공학과)
Kim, Tae-Young (인하대학교 정보공학과)
Lee, Jong-Sik (인하대학교 정보공학과)
Abstract
Biological data mining has been noticed as an issue as the volume of biological data is increasing extremely. Grid technology can share and utilize computing data and resources. In this paper, we propose a hybrid system that combines biological data mining with grid technology. Especially, we propose a decision range adjustment algorithm for processing efficiency of biological data mining. We obtain a reliable data mining recognition rate automatically and rapidly through this algorithm. And communication loads and resource allocation are key issues in grid environment because the resources are geographically distributed and interacted with themselves. Therefore, we propose a dynamic load balancing algorithm and apply it to the grid-based biological data mining method. For performance evaluation, we measure average processing time, average communication time, and average resource utilization. Experimental results show that this method provides many advantages in aspects of processing time and cost.
Keywords
Dynamic load balancing; Grid computing; Biological data mining;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Rajagopalan, A., S. Hariri, "An agent based dynamic load balancing system", Proceedings of the International Workshop on Autonomous Decentralized Systems, pp. 164-171, 2000.
2 Sureswaran, R., M. Samaka and J. Knaggs, "LOADIST:a distributed processing environment based on load sharing", IEEE SICON/ICIE '95 International Conference on Networks and Information Engineering, pp. 518-522, 1995.
3 Tzanis, G., C. Berberidis and I. Vlahavas, "Data Mining in Biological Data", Encyclopedia of Database Technologies and Applications, IDEA Group Publishing, 2005.
4 Wang, J. T. L., M. J. Zaki, H. T. T. Toivonen and D. Shasha, Data Mining in Bioinformatics, Springer, Berlin, 2005.
5 Watson, S. and S. Arkinstall, The G-protein Linked Receptor Facts Book. Academic Press, Burlington, 1994.
6 Wikinson and Allen, Parallel Programming Techniques & Applications using Networked Workstations and Parallel Computers, Prentice Hall, New Jersey, 1999.
7 Zaki, M. J., W. Li and S. Parthasarathy, "Customized Dynamic Load Balancing for a Network of Workstations", 5th IEEE International Symposium on High Performance Distributed Computing (HPDC '96), 1996.
8 Foster, I. and C. Kesselman, The Grid: Blueprint for a New Computing Infrastructure, Morgan Kaufmann, San Francisco, 1998.
9 Berman, F., G. Fox and T. Hey, Grid Computing: Making the Global Infrastructure a Reality, J. Wiley, New Jersey, 2003.
10 Department of Defense, High Level Architecture Run-Time Infrastructure Programmer's Guide 1.3 Version 5, 1998.
11 Kim, C. and H. Kameda, "An Algorithm for Optimal Load Balancing in Distributed Computer Systems", IEEE Transactions on Computers, Vol. 41, No. 3, pp. 381-384, 1990.
12 Kumar V., A. Grama, A. Gupta and G. Karypis, Introduction to Parallel Computing: Design and Analysis of Algorithms, The Benjamin/Cummings Publishing Company, San Francisco, 1994.
13 Kumar, V., A. Grama and V. N. Rao, "Scalable Load Balancing Techniques for Parallel Computers", Journal of Distributed Computing, Vol. 7, 1994.