Browse > Article

Performance Analysis and Enhancing Techniques of Kd-Tree Traversal Methods on GPU  

Chang, Byung-Joon (서강대학교 컴퓨터공학과)
Ihm, In-Sung (서강대학교 컴퓨터공학과)
Abstract
Ray-object intersection is an important element in ray tracing that takes up a substantial amount of computing time. In general, such spatial data structure as kd-tree has been frequently used for static scenes to accelerate the intersection computation. Recently, a few variants of kd-tree traversal have been proposed suitable for the GPU that has a relatively restricted computing architecture compared to the CPU. In this article, we propose yet another two implementation techniques that can improve those previous ones. First, we present a cached stack method that is aimed to reduce the costly global memory access time needed when the stack is allocated to global memory. Secondly, we present a rope-with-short-stack method that eases the substantial memory requirement, often necessary for the previous rope method. In order to show the effectiveness of our techniques, we compare their performances with those of the previous GPU traversal methods. The experimental results will provide prospective GPU ray tracer developers with valuable information, helping them choose a proper kd-tree traversal method.
Keywords
real-time ray tracing; kd-tree traversal; stack structure; ray-polygon intersection; GPU implementation;
Citations & Related Records
연도 인용수 순위
  • Reference
1 T. Foley and J. Sugerman, "KD-tree acceleration structures for a GPU raytracer," Proc. of the ACM SIGGRAPH/EUROGRAPHICS Conference on Graphics Hardware, pp.15-22, 2005.
2 J. Gunther, S. Popov, H. -P. Seidel, P. Slusallek, "Realtime Ray Tracing on GPU with BVH-based Packet Traversal," In IEEE Symposium on Interactive Ray Tracing, pp.113-118, 2007.
3 R. Overbeck, R. Ramamoorthi and W. Mark, "Large ray packets for real-time Whitted ray tracing," In IEEE/EG Symposium on Interactive Ray Tracing, pp.41-48, 2008.
4 NVIDIA. NVIDIA CUDA Compute Unified Device Architecture: Programming Guide (Version 2.3), 2009.
5 오상락, GPU를 사용한 효과적인 Kd-Tree 탐색 알고리즘, 서강대학교 석사학위 논문, 2009년 1월.
6 S. Popov, J. Günther, H. Seidel, and P. Slusallek, "Stackless KD-Tree traversal for high performance GPU ray tracing," Computer Graphics Forum, vol.26, no.3, pp.415-424, 2007.   DOI   ScienceOn
7 I. Wald, W. Mark, J. Gunther, S. Boulos, T. Ize, W. Hunt, S. Parker and P. Shirley, "State of the Art in Ray Tracing Animated Scenes," In Eurographics 2007 State of the Art Reports, 2007.
8 L. Seiler, D. Carmean, E. Sprangle, T. Forsyth, M. Abrash, P. Dubey, S. Junkins, A. Lake, J. Sugerman, R. Cavin, R. Espasa, E. Grochowski, T. Juan and P. Hanrahan, "Larrabee: a many-core x86 architecture for visual computing," ACM Transactions on Graphics, vol.27, no.3, pp.1-15, 2008.
9 I. Wald, S. Boulos and P. Shirley, "Ray tracing deformable scenes using dynamic bounding volume hierarchies," ACM Transactions on Graphics, vol. 26, no.1, pp.1-18, 2007.   DOI   ScienceOn
10 NVIDIA. NVIDIA CUDA Compute Unified Device Architecture: Programming Guide (Version 2.3), 2009.
11 D. R. Horn and J. Sugerman and M. Houston and P. Hanrahan, "Interactive k-d tree GPU ray tracing," Proc. of the 2007 Symposium on Interactive 3D Graphics and Games, pp.167-174, 2007.
12 장병준, 임인성, "GPU상에서의 광선 추적을 위한 Kd-Tree 탐색 기법 비교 분석", 2009 한국컴퓨터그래픽스학회 하계학술대회 논문집, pp.71-73, 2009년 10월
13 T. J. Purcell, I. Buck, W. R. Mark, and P. Hanrahan, "Ray tracing on programmable graphics hardware," ACM Transactions on Graphics, vol.21, no.3, pp.703-712, 2002.
14 M. Shevtsov, A. Soupikov and A. Kapustin, "Highly parallel fast KD-tree construction for interactive ray tracing of dynamic scenes," Computer Graphics Forum, vol.26, no.3, pp.395-404, 2007.   DOI   ScienceOn
15 I. Wald, C. Benthin and S. Boulos, "Getting rid of packets-efficient SIMD single-ray traversal using multibranching BVHs," In IEEE/EG Symposium on Interactive Ray Tracing, pp.49-57, 2008.
16 K. Zhou, Q. Hou, R. Wang, and B. Guo, "Real-Time KD-Tree Construction on Graphics Hardware," ACM Transactions on Graphics, vol.27, no.5, pp.126:1-126:11, Dec 2008.
17 I. Wald, "Realtime Ray Tracing and Interactive Global Illumination," PhD thesis, Computer Graphics Group, Saarland University, 2004.