[KSCI] Korea Science Citation Index Service

http://dx.doi.org/10.7471/ikeee.2017.21.1.92

Thread Distribution Method of GP-GPU for Accelerating Parallel Algorithms

Lee, Kwan-Ho (NEXT CHIP Inc.)
Kim, Chi-Yong (Dept. of Computer Science, Seokyeong University)

Publication Information

Journal of IKEEE / v.21, no.1, 2017 , pp. 92-95 More about this Journal

Abstract

In this paper, we proposed a way to improve function of small scale GP-GPU. Instead of using superscalar which increase scheduling-complexity, we suggested the application of simple core to maximize GP-GPU performance. Our studies also demonstrated that simplified Stream Processor is one of the way to achieve functional improvement in GP-GPU. In addition, we found that developing of optimal thread-assigning method in Warp Scheduler for specific application improves functional performance of GP-GPU. For examination of GP-GPU functional performance, we suggested the thread-assigning way which coordinated with Deep-Learning system; a part of Neural Network. As a result, we found that functional index in algorithm of Neural Network was increased to 90%, 98% compared with Intel CPU and ARM cortex-A15 4 core respectively.

Keywords

Stream Processor; Superscalar; Thread distribution; GP-GPU; Warp scheduler;

Citations & Related Records

Times Cited By KSCI : 2 (Citation Analysis)

Reference
Cited By KSCI

1	Tariq Rashid, "Make Your Own Neural Network," Hanbit media, 2017
2	Gyutaek Kyung, "A design of a SIMT architecture based GP-GPU using multi-banked cache memory structure," Master thesis, Seokyeong University, 2015.
3	Yun-Seop Hwang, Hee-Kyeong Jeon, Kwan-ho Lee, Kwang-yeob Lee, "Implementation of the SIMT based image signal processor for the image processing," j.inst.Korean.electr.electron.eng, vol 20, no.1, pp89-93, Apr, 2016
4	Odroid, "Odroid-XU," http://www.hardkernel.com
5	Raspberrypi, "raspberrypi," http://www.raspberrypi.org
6	Seonghyeon Han, Sukwon Yoo, "The parallelization of binarization using a GP-GPU," The International Journal of Advanced Culture Technology, vol. 4, no. 4,, 2016
7	Shuai , Tao Li, Qiankun Dong, Xuechen Liu, Yule Yang, "CPU-assisted GPU thread pool model for dynamic task parallelism," Networking, Architecture and Storage (NAS), 2015 IEEE International Conference on, 2015 DOI: 10.1109/NAS.2015.7255234 DOI

KSCI

Thread Distribution Method of GP-GPU for Accelerating Parallel Algorithms 병렬 알고리즘의 가속화를 위한 GP-GPU의 Thread할당 기법

Thread Distribution Method of GP-GPU for Accelerating Parallel Algorithms