Search | Korea Science

Performance Analysis on Parallel Processing of a Hybrid of a CPU and a GPU (CPU와 GPU의 혼합 병렬 계산에 대한 성능 분석)

Hwang, Keunchang;Kim, Youngtae
- Proceedings of the Korea Information Processing Society Conference
- /
- 2016.04a
- /
- pp.59-60
- /
- 2016
본 논문에서는 고성능 병렬 계산 장치로 주목받고 있는 GPU를 CPU와 동시에 병렬로 사용한 계산 성능을 분석하였다. 성능 분석을 위하여 원주율(${\pi}$)을 적분으로 계산하는 CUDA 프로그램을 사용하였으며, 전체 계산을 GPU 대비 CPU 계산 부분으로 할당하여 성능을 분석하였다.
https://doi.org/10.3745/PKIPS.y2016m04a.59 인용 PDF

GPU based Shrapnel Drop Computational Model for Specific Area (GPU 기반의 특정 영역에 대한 파편 낙하 계산 모델)

Kim, Tae-Gwon;Cho, Kyu-Tae;Lee, Seung-Young
- Proceedings of the Korea Information Processing Society Conference
- /
- 2016.10a
- /
- pp.41-42
- /
- 2016
특정 영역에 낙하하는 파편에 대한 계산은 파편의 개수가 증가할수록 계산량이 급격히 늘어나기 때문에 많은 자원이 소비된다. 이러한 파편의 낙하 대한 계산은 각각의 파편이 서로 영향을 받지 않기 때문에 일반적으로 CPU나 GPU를 활용하여 병렬로 연산을 수행할 수 있다. 이 논문에서는 특정 영역에 낙하하는 파편을 효율적으로 계산하기 위한 GPU 기반의 파편 낙하 계산 설계 모델을 제안한다. 이 모델은 공중의 특정점에서 폭파한 물체의 파편 방향을 계산한 후, 해당 방향으로 이동한 각각의 파편들이 떨어지는 방향에 대해 트리형식으로 계산을 반복적으로 수행해 최종 낙하 위치를 도출한다. 제안하는 방법은 GPU를 활용하여 파편의 낙하 영역을 사진트리를 통해 하향식(top-down)으로 계산하므로 넓은 영역에 대해 효율적으로 낙하점을 계산할 수 있다.
https://doi.org/10.3745/PKIPS.y2016m10a.41 인용 PDF

Precise Sweep Volume Computation Accelerated by GPU (GPU 가속을 이용한 정밀밀한 스웹 볼륨 경계 계산)

Lee, Hyunho;Kyung, Minho
- Journal of the Korea Computer Graphics Society
- /
- v.21 no.1
- /
- pp.13-21
- /
- 2015
We present a robust GPU algorithm constructing a sweep volume boundary for a triangular mesh model. Sweeping geometric entities of a triangular mesh object is first approximated to a set of triangles, the envelope of which becomes the outer boundary of the sweep volume. We find the envelope by computing the arrangement of the triangle set and extracting its outmost boundary. To ensure robustness of the algorithm, we adopt random perturbation of sweep vertices and the interval arithmetic using multi-level precisions. The algorithm is implemented to perform most computation on GPU, and as a result it runs two orders of magnitude faster than other algorithms.
https://doi.org/10.15701/kcgs.2015.21.1.13 인용 PDF KSCI

Normal Calculation Using GPU in Point Rendering (점 렌더링에서 GPU를 이용한 법선 벡터 계산)

조광현;권구주;신병석
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.10b
- /
- pp.634-636
- /
- 2004
3차원 공간상에 분포되어 있는 정들로부터 기하정보를 재구성하여 렌더링 할 때 법선 벡터가 필요하다. 이 점들은 서로간의 연결성 정보가 없고 법선 벡터가 없기 때문에 음영 효과를 표현 할 수 없다. 본 논문에서는 점들의 연결성 정보를 추정하여 법선 벡터를 구하는 과정에서 GPU를 사용하는 방법을 제안한다 GPU로 법선 벡터를 계산할 경우 CPU의 부하가 줄게 되고 계산 시간도 감소된다 이 방법을 점 렌더링에 적용하여 법선 벡터를 신속하게 계산하고 가시성 검사와 음영처리를 효과적으로 할 수 있도록 한다.
PDF

A Study on comparison of calculation between CPU-intensive and GPU-intensive and finding proper model for specific program (GPU기반의 계산속도와 CPU기반의 계산속도 비교 및 특정 프로그램에 따른 적합한 모델 찾기에 대한 연구)

Shin, Hyun-Soo
- Proceedings of the Korea Information Processing Society Conference
- /
- 2019.05a
- /
- pp.48-51
- /
- 2019
최근 기술이 발달함으로 인해 더 짧은시간에 더 많은 계산량이 필요해진 시대가 왔다. 본 연구에서는 CPU와 GPU의 구조를 파악하고 계산속도를 비교한다. 직렬 방식의 알고리즘에서의 병렬 방식의 알고리즘 및 현재 GPU 병렬처리 적용 사례 및 추후 적합한 모델 찾기에 대해 연구한다.
https://doi.org/10.3745/PKIPS.y2019m05a.48 인용 PDF

A Study on the Performance of Multiple GPU's (다중 GPU의 성능에 대한 연구)

Kim, Yerim;Kim, Youngtae
- Proceedings of the Korea Information Processing Society Conference
- /
- 2016.04a
- /
- pp.49-50
- /
- 2016
본 논문에서는 다중 GPU의 효율성을 알아보기 위하여 정적분 계산을 이용하여 원주율(${\pi}$)를 계산하는 CUDA 프로그램을 구현하였으며, 다중 GPU를 사용하기 위해서는 병렬처리 라이브러리인 MPI를 사용하였다. 실험 결과 GPU의 수에 비례하여 성능이 선형으로 증가함을 보였다.
https://doi.org/10.3745/PKIPS.y2016m04a.49 인용 PDF

GPU-Based Dynamic Remeshing to Simulate Cloth Tearing (옷감 찢기 시뮬레이션을 표현하는 GPU기반 동적 재메쉬)

Seong-Hyeok Moon;Jong-Hyun Kim
- Proceedings of the Korean Society of Computer Information Conference
- /
- 2023.01a
- /
- pp.353-356
- /
- 2023
본 논문에서는 GPU 기반으로 옷감을 찢는 데 필요한 동적 재메쉬 기법에 대해서 제안한다. 일반적으로 메쉬를 파괴(Fracture)하거나 찢는 시뮬레이션에서는 안정적인 동역학 계산하는데 있어서 동적 재 메쉬과정에 매우 중요하며 이 과정이 계산양이 가장 크다. 본 논문에서는 GPU 친화적인 동적 메쉬 알고리즘을 새롭게 제안함으로써 옷감 찢기 시뮬레이션을 실시간으로 보여준다.
PDF

Acceleration of Mesh Denoising Using GPU Parallel Processing (GPU의 병렬 처리 기능을 이용한 메쉬 평탄화 가속 방법)

Lee, Sang-Gil;Shin, Byeong-Seok
- Journal of Korea Game Society
- /
- v.9 no.2
- /
- pp.135-142
- /
- 2009
Mesh denoising is a method to remove noise applying various filters. However, those methods usually spend much time since filtering is performed on CPU. Because GPU is specialized for floating point operations and faster than CPU, real-time processing for complex operations is possible. Especially mesh denoising is adequate for GPU parallel processing since it repeats the same operations for vertices or triangles. In this paper, we propose mesh denoising algorithm based on bilateral filtering using GPU parallel processing to reduce processing time. It finds neighbor triangles of each vertex for applying bilateral filter, and computes its normal vector. Then it performs bilateral filtering to estimate new vertex position and to update its normal vector.
PDF

Efficient Parallel CUDA Random Number Generator on NVIDIA GPUs (NVIDIA GPU 상에서의 난수 생성을 위한 CUDA 병렬프로그램)

Kim, Youngtae;Hwang, Gyuhyeon
- Journal of KIISE
- /
- v.42 no.12
- /
- pp.1467-1473
- /
- 2015
In this paper, we implemented a parallel random number generation program on GPU's, which are known for high performance computing, using LCG (Linear Congruential Generator). Random numbers are important in all fields requiring the use of randomness, and LCG is one of the most widely used methods for the generation of pseudo-random numbers. We explained the parallel program using the NVIDIA CUDA model and MPI(Message Passing Interface) and showed uniform distribution and performance results. We also used a Monte Carlo algorithm to calculate pi(${\pi}$) comparing the parallel random number generator with cuRAND, which is a CUDA library function, and showed that our program is much more efficient. Finally we compared performance results using multi-GPU's with those of ideal speedups.
https://doi.org/10.5626/JOK.2015.42.12.1467 인용 KSCI

Implementation of parallel blocked LU decomposition program for utilizing cache memory on GP-GPUs (GP-GPU의 캐시메모리를 활용하기 위한 병렬 블록 LU 분해 프로그램의 구현)

Kim, Youngtae;Kim, Doo-Han;Yu, Myoung-Han
- Journal of Internet Computing and Services
- /
- v.14 no.6
- /
- pp.41-47
- /
- 2013
GP-GPUs are general purposed GPUs for numerical computation based on multiple threads which are originally for graphic processing. GP-GPUs provide cache memory in a form of shared memory which user programs can access directly, unlikely typical cache memory. In this research, we implemented the parallel block LU decomposition program to utilize cache memory in GP-GPUs. The parallel blocked LU decomposition program designed with Nvidia CUDA C run 7~8 times faster than nun-blocked LU decomposition program in the same GP-GPU computation environment.
https://doi.org/10.7472/jksii.2013.14.6.41 인용 PDF KSCI

Search Result 196, Processing Time 0.032 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)