Browse > Article
http://dx.doi.org/10.6109/jkiice.2022.26.4.637

Efficient Message Scattering and Gathering Based on Processing Node Status  

Park, Jongsu (Department of Electrical and Electronic Engineering, Mokwon University)
Abstract
To maximize performance in a high-performance multicore processor system. it is essential to enable effective data communication between processing cores. Data communication between processor nodes can be broadly classified into collective and point-to-point communications. Collective communication comprises scattering and gathering. This paper presents a efficient message scattering and gathering based on processing node status. In the proposed algorithms, the transmission order is changed according to the data size of the pre-existing communication, to reduce the waiting time required until the collective communications begin. From the simulation, the performances of the proposed message scattering and gathering algorithms were improved by approximately 71.41% and 69.84%.
Keywords
Scatter; Gather; Collective Communication; Message Passing;
Citations & Related Records
Times Cited By KSCI : 1  (Citation Analysis)
연도 인용수 순위
1 A. Vijay Bhaskar and T. G. Venkatesh, "Performance analysis of network-on-chip in many-core processors," Journal of Parallel and Distributed Computing, vol. 147, pp. 196-208, Jan. 2021.   DOI
2 S. H. Gade and S. Deb, "A Novel Hybrid Cache Coherence with Global Snooping for Many-core Architectures," ACM Transactions on Design Automation of Electronic Systems, vol. 27, no. 1, pp. 1-31, 2021.
3 S. Kim, M. Fayazi, A. Daftardar, K. -Y. Chen, J. Tan, S. Pal, T. Ajayi, Y. Xiong, T. Mudge, C. Chakrabarti, D. Blaauw, R. Dreslinski, and H. -S. Kim, "Versa: A 36-Core Systolic Multiprocessor With Dynamically Reconfigurable Interconnect and Memory," IEEE Journal of Solid-State Circuits, vol. 57, no. 4, pp. 986-998, Apr. 2022. DOI: 10.1109/JSSC.2022.3140241   DOI
4 K. Fernandes, "GPU Development and Computing Experiences," University of Cambridge, Research Computing Services, 2015.
5 J. L. Traff, A. Ripke, C. Siebert, P. Balaji, R. Thakur, and W. Gropp, "A Simple, Pipelined Algorithm for Large, Irregular All-gather Problems," in Proceedings of the 15th European PVM/MPI Users' Group Meeting on Recent Advances in Parallel Virtual Machine and Message Passing Interface, vol. 5205, pp. 84-93, 2008.
6 J. Park, H. Yun, and S. Moon, "Enhancing Performance Using Atomic Pipelined Message Broadcast in a Distributed Memory MPSoC," IEICE Electronics Express, vol. 11, pp. 1-7, Nov. 2014.
7 J. Park, "Efficient Pipelined Broadcast with Monitoring Processing Node Status on a Multi-Core Processor," Mathematics, vol. 7, no. 12, Dec. 2019. DOI:10.3390/math7121159.   DOI