CGRA Compilation Boost up for Acceleration of Graphics

Kim, Wonsub;Choi, Yoonseo;Kim, Jaehyun;

Proceedings of the Korean Society of Broadcast Engineers Conference (한국방송∙미디어공학회:학술대회논문집)

2014.06a
/
Pages.166-168
/
2014

The Korean Institute of Broadcast and Media Engineers (한국방송∙미디어공학회)

CGRA Compilation Boost up for Acceleration of Graphics

영상처리 가속을 위한 CGRA compilation 속도 향상

Kim, Wonsub (Samsung Electronics DMC R&D Center) ;
Choi, Yoonseo (Samsung Electronics SAIT) ;
Kim, Jaehyun (Samsung Electronics DMC R&D Center)

김원섭 (삼성전자 DMC R&D Center) ;
최윤서 (삼성전자 SAIT) ;
김재현 (삼성전자 DMC R&D Center)

Published : 2014.06.30

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Coarse-grained reconfigurable architectures (CGRAs) present a potential of high compute throughput with energy efficiency. A CGRA consists of an array of functional units (FU), which communicate with each other through an interconnect network containing transmission nodes and register files. To achieve high performance from the software solutions mapped onto CGRAs, modulo scheduling of loops is generally employed. One of the key challenges in modulo scheduling for CGRAs is to explicitly handle routings of operands from a source to a destination operations through various routing resources. Existing modulo schedulers for CGRAs are slow because finding a valid routing is generally a searching problem over a large space, even with the guidance of well-defined cost metrics. Applications in traditional embedded multimedia domains are regarded relatively tolerant to a slow compile time in exchange of a high quality solution. However, many rapidly growing domains of applications, such as 3D graphics, require a fast compilation. Entrances of CGRAs to these domains have been blocked mainly due to its long compile time. We attack this problem by utilizing patternized routes, for which resources and time slots for a success can be estimated in advance when a source operation is placed. By conservatively reserving predefined resources at predefined time slots, future routings originated from the source operation are guaranteed. Experiments on a real-world 3D graphics benchmark suite show that our scheduler improves the compile time up to 6000 times while achieving average 70% throughputs of the state-of-art CGRA modulo scheduler, edge-centric modulo scheduler (EMS).

Proceedings of the Korean Society of Broadcast Engineers Conference (한국방송∙미디어공학회:학술대회논문집)

CGRA Compilation Boost up for Acceleration of Graphics

영상처리 가속을 위한 CGRA compilation 속도 향상

Abstract

Keywords

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)