Browse > Article

Parallel Distributed Implementation of GHT on Ethernet Multicluster  

Kim, Yeong-Soo (Div. of Computer Information, Yeungnam College of Science & Technology)
Kim, Myung-Ho (Div. of Computer Information, Yeungnam College of Science & Technology)
Choi, Heung-Moon (School of Electrical Engineering & Computer Science, Kyungpook National University)
Publication Information
Abstract
Extending the scale of the distributed processing in a single Ethernet cluster is physically restricted by maximum ports per switch. This paper presents an implementation of MPI-based multicluster consisting of multiple Ethernet switches for extending the scale of distributed processing, and a asymptotical analysis for communication overhead through execution-time analysis model. To determine an optimum task partitioning, we analyzed the processing time for various partitioning schemes, and AAP(accumulator array partitioning) scheme was finally chosen to minimize the overall communication overhead. The scope of data partitioned in AAP was modified to fit for incremented nodes, and suitable load balancing algorithm was implemented. We tried to alleviate the communication overhead through exploiting the pipelined broadcast and flat-tree based result gathering, and overlapping of the communication and the computation time. We used the linear pipeline broadcast to reduce the communication overhead in intercluster which is interconnected by a single link. Experimental results shows nearly linear speedup by the proposed parallel distributed GHT implemented on MPI-based Ethernet multicluster with four 100Mbps Ethernet switches and up to 128 nodes of Pentium PC.
Keywords
GHT; MPI;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 M. Atiquzzaman, 'Pipelined implementation of the multiresolution Hough transform in a pyramid multiprocessor,' Pattern Recognition Letters, vol. 15, no. 9, pp. 841-851, Sep. 1994   DOI   ScienceOn
2 T. Kielmann, H. E. Bal, and K. Verstoep, 'Fast Measurement of LogP Parameters for Message Passing Platforms,' Proc. of IPDPS Workshop on Parallel and Distributed Processing, pp. 1176-1183, May 2000   DOI   ScienceOn
3 Beinglass, A. and Wolfson, H. J., 'Articulated Object Recognition, or: How to Generalize the Generalized Hough Transform', Proc. of IEEE CVPR, pp. 461-466, June 1991
4 A. Underhill, M. Atiquzzaman, and J. Ophel, ' Performance of the Hough transform on a distributed memory multiprocessor,' Microprocessors and Microsystems, vol. 22, no. 7, pp. 355-362, Jan. 1999   DOI   ScienceOn
5 Chaoqiang Liu, Tao Xia, and Hui Li, 'A Hierarchical Hough Transform for Fingerprint Matching', Lecture Notes in Computer Science, vol. 3072, 2004   DOI   ScienceOn
6 Fujii, K. and Arikawa, T., 'Urban object reconstruction using airborne laser elevation image and aerial image', IEEE Trans. on Geoscience and Remote Sensing, vol. 40, Iss. 10, pp. 2234-2240, Oct. 2002   DOI   ScienceOn
7 Chmielewski and Leszek, 'Choice of the Hough transform for image registration', Proc. of the SPIE, vol. 5505, pp. 122-134, 2004   DOI
8 D. Baumann and S. Ranka, 'The Generalized Hough Transform on an MIMD Machine,' Journal of Undergraduate Research in High-Performance Computing, 2, 1992
9 A. N. Choudhary and R. Ponnusamy, 'Implementation and evaluation of Hough algorithms on a shared-memory multiprocessor,' Journal of Parallel and Distributed Computing, vol. 12, no. 2, pp. 178-188, June 1991   DOI   ScienceOn
10 Y. S. Kim, J. S. Kim, and H. M, Choi, 'Parallel Distributed Implementation of GHT on MPI-based PC Cluster', Journal of IEEK, vol. 44-CI, no. 3, May 2007
11 B. Howe, A. Gururajan, H. Sari-Sarraf, and L. R. Long, 'Hierarchical segmentation of cervical and lumbar vertebrae using a customized generalized Hough transform and extensions to active appearance models', 6th IEEE Southwest Symposium on Image Analysis and Interpretation, pp. 182-186, March 2004
12 C. Guerra and S. Hambrusch, 'Parallel algorithms for line detection on a mesh,' Journal of Parallel and Distributed Computing, vol. 6, no. 1, pp. 1-19, Feb 1989   DOI   ScienceOn
13 S. S. Vadhiyar, G. E. Fagg, and J. J. Dongarra, 'Towards an Accurate Model for Collective Communications', International Journal of High Performance Computing Applications, Vol. 18, No. 1, pp. 159-167, 2004   DOI   ScienceOn
14 Y. Pan and Y. H. Chuang, 'Parallel Hough transform algorithms on SIMD hypercube arrays,' Proc. of ICPP, vol. 3, pp. 83-86, Aug. 1990
15 P. Patarasuka, X. Yuan, and A. Farajb, 'Techniques for pipelined broadcast on ethernet switched clusters ', Journal of Parallel and Distributed Computing, vol. 68, Iss. 6, pp. 809-824, June 2008   DOI   ScienceOn