Search | Korea Science

Real-time Ray-tracing Chip Architecture

Yoon, Hyung-Min;Lee, Byoung-Ok;Cheong, Cheol-Ho;Hur, Jin-Suk;Kim, Sang-Gon;Chung, Woo-Nam;Lee, Yong-Ho;Park, Woo-Chan
- IEIE Transactions on Smart Processing and Computing
- /
- v.4 no.2
- /
- pp.65-70
- /
- 2015
In this paper, we describe the world's first real-time ray-tracing chip architecture. Ray-tracing technology generates high-quality 3D graphics images better than current rasterization technology by providing four essential light effects: shadow, reflection, refraction and transmission. The real-time ray-tracing chip named RayChip includes a real-time ray-tracing graphics processing unit and an accelerating tree-building unit. An ARM Ltd. central processing unit (CPU) and other peripherals are also included to support all processes of 3D graphics applications. Using the accelerating tree-building unit named RayTree to minimize the CPU load, the chip uses a low-end CPU and decreases both silicon area and power consumption. The evaluation results with RayChip show appropriate performance to support real-time ray tracing in high-definition (HD) resolution, while the rendered images are scaled to full HD resolution. The chip also integrates the Linux operating system and the familiar OpenGL for Embedded Systems application programming interface for easy application development.
https://doi.org/10.5573/IEIESPC.2015.4.2.065 인용 PDF KSCI

Adaptive Foveated Ray Tracing Based on Time-Constrained Rendering for Head-Mounted Display (헤드 마운티드 디스플레이를 위한 시간 제약 렌더링을 이용한 적응적 포비티드 광선 추적법)

Kim, Youngwook;Ihm, Insung
- Journal of the Korea Computer Graphics Society
- /
- v.28 no.3
- /
- pp.113-123
- /
- 2022
Ray tracing-based rendering creates by far more realistic images than the traditional rasterization-based rendering. However, it is still burdensome when implemented for a Head-Mounted Display (HMD) system that demands a wide field of view and a high display refresh rate. Furthermore, for presenting high-quality images on the HMD screen, a sufficient number of ray sampling should be carried out per pixel to alleviate visually annoying spatial and temporal aliases. In this paper, we extend the recent selective foveated ray tracing technique by Kim et al. [1], and propose an improved real-time rendering technique that realizes the rendering effect of the classic Whitted-style ray tracing on the HMD system. In particular, by combining the ray tracing hardware-based acceleration technique and time-constrained rendering scheme, we show that fast HMD ray tracing is possible that is well suited to human visual systems.
https://doi.org/10.15701/kcgs.2022.28.3.113 인용 PDF KSCI

Implementation of Real-time Interactive Ray Tracing on GPU (GPU 기반의 실시간 인터렉티브 광선추적법 구현)

Bae, Sung-Min;Hong, Hyun-Ki
- Journal of Korea Game Society
- /
- v.7 no.3
- /
- pp.59-66
- /
- 2007
Ray tracing is one of the classical global illumination methods to generate a photo-realistic rendering image with various lighting effects such as reflection and refraction. However, there are some restrictions on real-time applications because of its computation load. In order to overcome these limitations, many researches of the ray tracing based on GPU (Graphics Processing Unit) have been presented up to now. In this paper, we implement the ray tracing algorithm by J. Purcell and combine it with two methods in order to improve the rendering performance for interactive applications. First, intersection points of the primary ray are determined efficiently using rasterization on graphics hardware. We then construct the acceleration structure of 3D objects to improve the rendering performance. There are few researches on a detail analysis of improved performance by these considerations in ray tracing rendering. We compare the rendering system with environment mapping based on GPU and implement the wireless remote rendering system. This system is useful for interactive applications such as the realtime composition, augmented reality and virtual reality.
PDF

Implementation of Ray Tracing Processor for the Parallel Processing (병렬처리를 위한 고속 Ray Tracing 프로세서의 설계)

Choe, Gyu-Yeol;Jeong, Deok-Jin
- The Transactions of the Korean Institute of Electrical Engineers A
- /
- v.48 no.5
- /
- pp.636-642
- /
- 1999
The synthesis of the 3D images is the most important part of the virtual reality. The ray tracing is the best method for reality in the 3D graphics. But the ray tracing requires long computation time for the synthesis of the 3D images. So, we implement the ray tracing with software and hardware. Specially we design the hit-test unit with FPGA tool for the ray tracing. Hit-test unit is a very important part of ray tracing to improve the speed. In this paper, we proposed a new hit-test algorithm and apply the parallel architecture for hit-test unit to improve the speed. We optimized the arithmetic unit because the critical path of hit-test unit is in the multiplication part. We used the booth algorithm and the baugh-wooley algorithm to reduce the partial product and adapted the CSA and CLA to improve the efficiency of the partial product addition. Our new Ray tracing processor can produce the image about 512ms/F and can be adapted to real-time application with only 10 parallel processors.
PDF

A Soft Shadow Technique for a Real-time Mobile Ray Tracing Hardware (실시간 모바일 레이트레이싱 하드웨어를 위한 소프트 쉐도우 생성 기법)

Kwon, Hyuck-Joo;Hong, Dukki;Park, Woo-Chan;Lee, Sanghoon
- Journal of the Korea Computer Graphics Society
- /
- v.23 no.3
- /
- pp.55-64
- /
- 2017
In this paper, a novel soft shadow method is suggested to support realistic shadows in mobile ray tracing. In ray tracing, soft shadow is generally generated by sampling a shadow ray. As this sampling method increases the number of rays to be processed, it has undermined the performance. We designed the proposed soft shadow processing method and hardware architecture to overcome this problem through selective shadow generation and triangle address caching for minimizing the performance degradation caused by sampling. The proposed hardware architecture can be integrated into a mobile ray-tracing hardware and was evaluated in terms of its performance on the FPGA. Based on the results, the rendering performance about 4, 8, and 16 samples were improved, respectively, by 40%, 50%, and 56% on average compared to the previous method, and it was found that the real-time soft shadow processing is feasible with the proposed hardware architecture.
https://doi.org/10.15701/kcgs.2017.23.3.55 인용 PDF KSCI

Efficient Process Network Implementation of Ray-Tracing Application on Heterogeneous Multi-Core Systems

Jung, Hyeonseok;Yang, Hoeseok
- IEIE Transactions on Smart Processing and Computing
- /
- v.5 no.4
- /
- pp.289-293
- /
- 2016
As more mobile devices are equipped with multi-core CPUs and are required to execute many compute-intensive multimedia applications, it is important to optimize the systems, considering the underlying parallel hardware architecture. In this paper, we implement and optimize ray-tracing application tailored to a given mobile computing platform with multiple heterogeneous processing elements. In this paper, a lightweight ray-tracing application is specified and implemented in Kahn process network (KPN) model-of-computation, which is known to be suitable for the description of real-time applications. We take an open-source C/C++ implementation of ray-tracing and adapt it to KPN description in the Distributed Application Layer framework. Then, several possible configurations are evaluated in the target mobile computing platform (Exynos 5422), where eight heterogeneous ARM cores are integrated. We derive the optimal degree of parallelism and a suitable distribution of the replicated tasks tailored to the target architecture.
https://doi.org/10.5573/IEIESPC.2016.5.4.289 인용 PDF KSCI

GPU-accelerated Global Illumination for Point Set Rendering (GPU 가속을 이용한 점집합 렌더링을 위한 전역 조명기법)

Min, Heajung;Kim, Young J.
- Journal of the Korea Computer Graphics Society
- /
- v.26 no.1
- /
- pp.7-15
- /
- 2020
In the process of visualizing a point set representing a smooth manifold surface, global illumination techniques can be used to render a realistic scene with various effects of lighting. Thanks to the continuous demand for ray tracing and the development of graphics hardware, dedicated GPUs and programmable pipeline for ray tracing have been introduced in recent years. In this paper, real-time global illumination rendering is studied for a point-set model using ray-tracing GPUs. We apply the moving least-squares (MLS) method to approximate the point set to a smooth implicit surface and render it using global illumination by performing massive ray-intersection tests with the surface and generating shading effects at the intersection point. As a result, a complicated point-set scene consisting of more than 0.5M points can be generated in real-time.
https://doi.org/10.15701/kcgs.2020.26.1.7 인용 PDF KSCI

Fast Ray Reordering and Approximate Sibson Interpolation for Foveated Rendering on GPU

Kwon, Oh-Seok;park, Keon-kuk;Yoon, Joseph;Kim, Young-Bong
- Journal of Korea Multimedia Society
- /
- v.22 no.2
- /
- pp.311-321
- /
- 2019
Virtual reality applications in Head-Mounted Displays require high frame rates and low latency rendering techniques. Ray tracing offers many benefits, such as high-quality image generation, but has not been utilized due to lower performance than rasterization. But that can obtain good result combined with gaze-tracking technology and human visual system's operation principle. In this paper, we propose a method to optimize the foveated sampling map and to maintain the visual quality through the fast voronoi nearest interpolation. The proposed method further reduces the computational cost that has been improved by the previous foveated sampling. It also smoothes the voronoi boundary using adaptive sibson interpolation, which was not possible in real-time. As a result, the proposed method can render real-time high-quality images with low visual difference.
https://doi.org/10.9717/kmms.2019.22.2.311 인용 PDF KSCI HTML

In-situ Determination of Absorption Coefficients in a Room

Suh, Jin-Sung
- The Journal of the Acoustical Society of Korea
- /
- v.20 no.3E
- /
- pp.10-17
- /
- 2001
The possibility is investigated of determining the diffuse absorption coefficients of the wall surfaces in a real room by minimizing the errors between the measured energy impulse response of a real room and the predicted energy impulse responses obtained from the ray tracing simulation of the room. In other words, this can possibly serve as a basis for "acoustical system identification" in attempting to determine the "best fit" of modelled absorption coefficients to measured energy response data. Algorithms for attempting this were investigated. The algorithms developed for this purpose proved to be rigorous and efficient. Instead of using the ray tracing model to determine the absorption coefficients, the phase image model was used in order to determine the acoustic impedances of wall surfaces. However, the numerical algorithms could not find the correct impedance values, primarily due to the wide range of the acoustic impedance values of any single acoustic material and very long computation time.
PDF

An efficient acceleration algorithm of GPU ray tracing using CUDA (CUDA를 이용한 효과적인 GPU 광선추적 가속 알고리즘)

Ji, Joong-Hyun;Yun, Dong-Ho;Ko, Kwang-Hee
- 한국HCI학회:학술대회논문집
- /
- 2009.02a
- /
- pp.469-474
- /
- 2009
This paper proposes an real time ray tracing system using optimized kd-tree traversal environment and ray/triangle intersection algorithm. The previous kd-tree traversal algorithms search for the upper nodes in a bottom-up manner. In a such way we need to revisit the already visited parent node or use redundant memory after failing to find the intersected primitives in the leaf node. Thus ray tracing for relatively complex scenes become more difficult. The new algorithm contains stacks implemented on GPU's local memory on CUDA framework, thus elegantly eliminate the problems of previous algorithms. After traversing the node we perform the latest CPU-based ray/triangle intersection algorithm 'Plucker coordinate test', which is further accelerated in massively parallel thanks to CUDA. Plucker test can drastically reduce the computational costs since it does not use barycentric coordinates but only simple test using the relations between a ray and the triangle edges. The entire system is consist of a single ray kernel simply and implemented without introduction of complicated synchronization or ray packets. Consequently our experiment shows the new algorithm can is roughly twice as faster as the previous.
PDF

Search Result 30, Processing Time 0.029 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)