Browse > Article
http://dx.doi.org/10.7471/ikeee.2017.21.1.92

Thread Distribution Method of GP-GPU for Accelerating Parallel Algorithms  

Lee, Kwan-Ho (NEXT CHIP Inc.)
Kim, Chi-Yong (Dept. of Computer Science, Seokyeong University)
Publication Information
Journal of IKEEE / v.21, no.1, 2017 , pp. 92-95 More about this Journal
Abstract
In this paper, we proposed a way to improve function of small scale GP-GPU. Instead of using superscalar which increase scheduling-complexity, we suggested the application of simple core to maximize GP-GPU performance. Our studies also demonstrated that simplified Stream Processor is one of the way to achieve functional improvement in GP-GPU. In addition, we found that developing of optimal thread-assigning method in Warp Scheduler for specific application improves functional performance of GP-GPU. For examination of GP-GPU functional performance, we suggested the thread-assigning way which coordinated with Deep-Learning system; a part of Neural Network. As a result, we found that functional index in algorithm of Neural Network was increased to 90%, 98% compared with Intel CPU and ARM cortex-A15 4 core respectively.
Keywords
Stream Processor; Superscalar; Thread distribution; GP-GPU; Warp scheduler;
Citations & Related Records
Times Cited By KSCI : 2  (Citation Analysis)
연도 인용수 순위
1 Tariq Rashid, "Make Your Own Neural Network," Hanbit media, 2017
2 Gyutaek Kyung, "A design of a SIMT architecture based GP-GPU using multi-banked cache memory structure," Master thesis, Seokyeong University, 2015.
3 Yun-Seop Hwang, Hee-Kyeong Jeon, Kwan-ho Lee, Kwang-yeob Lee, "Implementation of the SIMT based image signal processor for the image processing," j.inst.Korean.electr.electron.eng, vol 20, no.1, pp89-93, Apr, 2016
4 Odroid, "Odroid-XU," http://www.hardkernel.com
5 Raspberrypi, "raspberrypi," http://www.raspberrypi.org
6 Seonghyeon Han, Sukwon Yoo, "The parallelization of binarization using a GP-GPU," The International Journal of Advanced Culture Technology, vol. 4, no. 4,, 2016
7 Shuai , Tao Li, Qiankun Dong, Xuechen Liu, Yule Yang, "CPU-assisted GPU thread pool model for dynamic task parallelism," Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on, 2015 DOI: 10.1109/NAS.2015.7255234   DOI