• Title/Summary/Keyword: NVIDIA

Search Result 163, Processing Time 0.025 seconds

Novel Kernel Design for Implementing Volume Rendering in the PyCUDA Framework (PyCUDA 프레임워크에서 볼륨 렌더링을 구현하기 위한 새로운 커널 디자인)

  • Lee, SooHo;Kim, Jong-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.349-351
    • /
    • 2022
  • 본 논문에서는 계산양이 큰 볼륨 렌더링을 구현할 수 있는 파이썬 기반의 CUDA(Computed Unified Device Architecture) 커널(Kernel) 디자인에 대해서 소개한다. 최근에 파이썬은 인공지능뿐만 아니라 서버, 보안, GUI, 데이터 시각화, 빅 데이터 처리 등 다양한 분야에서 활용이 되고 있기 때문에 인터페이스만을 위한 언어라는 색을 탈피한지 오래이다. 본 논문에서는 대용량 병렬처리 기법인 NVIDIA의 CUDA를 이용하여 파이썬 환경에서 커널을 디자인하고, 계산양이 큰 볼륨 렌더링이 빠르게 계산되는 결과를 보여준다. 결과적으로 C언어 기반의 CUDA뿐만 아니라, 상대적으로 개발이 효율적인 파이썬 환경에서도 GPU(Graphic Processing Unit)기반 애플리케이션 개발이 가능하다는 것을 볼륨 렌더링을 통해 보여준다.

  • PDF

Fast Access Method of Neighboring Particles Using Bitonic Sort Based GPU Hashing, and Its Applications (바이토닉 정렬 기반의 GPU 해싱을 이용한 인접 입자의 빠른 접근 기법과 그 응용 사례)

  • Lee, SuBin;Kim, Jong-Hyun
    • Proceedings of the Korean Society of Computer Information Conference
    • /
    • 2022.01a
    • /
    • pp.357-360
    • /
    • 2022
  • 본 논문에서는 대용량 데이터에서 빠르게 주변 데이터를 접근하기 위한 자료구조인 최근접 이웃 탐색(Nearest neighbor search, NNS) 문제를 빠르게 풀 수 있는 바이토닉 정렬(Bitonic sort) 기반 해시 테이블을 GPU기반에서 설계하는 방법과 이를 통해 입자 기반 물리 시뮬레이션을 고속화할 수 있는 방법에 대해 살펴본다. 본 논문에서는 CUDA 아키텍처를 이용하여 해시 테이블을 설계하였으며, 계산양이 가장 큰 데이터 정렬부분을 최적화함으로써 NVIDIA에서 제공하는 CUDA 해시 테이블보다 빠른 결과를 얻을 수 있으며, 이 자료구조를 입자 기반 시뮬레이션에 통합함으로써 고성능 시뮬레이션을 쉽게 제작할 수 있다.

  • PDF

Development of hipSYCL Runtime Backend Using HSA Runtime API (HSA 런타임 API를 이용한 hipSYCL 런타임 백엔드 개발)

  • Myung, Hunjoo;Gu, Gibeom
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2022.11a
    • /
    • pp.60-62
    • /
    • 2022
  • SYCL은 OpenCL 디바이스를 위해 추상화한 C++ 프로그래밍 모델이다. OpenCL에 비해 SYCL은 높은 생산성 등 C++이 가지고 있는 강점을 보유하며, 인텔이 이기종 컴퓨팅을 위한 개발 언어로 SYCL 기반의 DPC++을 출시함에 따라 많은 주목을 받고 있다. 우리는 여러 SYCL 구현물들 중에서 NVIDIA, AMD 등 다양한 GPU를 지원하고, 코드의 수정 및 추가가 용이한 hipSYCL를 채택하여 여러 연구를 진행하고 있다. 본 논문에서는 hipSYCL 구조 내에 AMD GPU를 위한 HIP 백엔드 플러그인을 대체할 수 있는 새로운 백엔드 플러그인을 제안한다. 이 플러그인은 HSA 런타임 API를 사용하여 기존의 플러그인보다 계층 구조를 줄이고 경량화하였다.

Development Trends in Advanced Packaging Technology of Global Foundry Big Three (글로벌 파운드리 Big3의 첨단 패키징 기술개발 동향)

  • H.S. Chun;S.S. Choi;D.H. Min
    • Electronics and Telecommunications Trends
    • /
    • v.39 no.3
    • /
    • pp.98-106
    • /
    • 2024
  • Advanced packaging is emerging as a core technology owing to the increasing demand for multifunctional and highly integrated semiconductors to achieve low power and high performance following digital transformation. It may allow to overcome current limitations of semiconductor process miniaturization and enables single packaging of individual devices. The introduction of advanced packaging facilitates the integration of various chips into one device, and it is emerging as a competitive edge in the industry with high added value, possibly replacing traditional packaging that focuses on electrical connections and the protection of semiconductor devices.

Evaluating the groundwater prediction using LSTM model (LSTM 모형을 이용한 지하수위 예측 평가)

  • Park, Changhui;Chung, Il-Moon
    • Journal of Korea Water Resources Association
    • /
    • v.53 no.4
    • /
    • pp.273-283
    • /
    • 2020
  • Quantitative forecasting of groundwater levels for the assessment of groundwater variation and vulnerability is very important. To achieve this purpose, various time series analysis and machine learning techniques have been used. In this study, we developed a prediction model based on LSTM (Long short term memory), one of the artificial neural network (ANN) algorithms, for predicting the daily groundwater level of 11 groundwater wells in Hankyung-myeon, Jeju Island. In general, the groundwater level in Jeju Island is highly autocorrelated with tides and reflected the effects of precipitation. In order to construct an input and output variables based on the characteristics of addressing data, the precipitation data of the corresponding period was added to the groundwater level data. The LSTM neural network was trained using the initial 365-day data showing the four seasons and the remaining data were used for verification to evaluate the fitness of the predictive model. The model was developed using Keras, a Python-based deep learning framework, and the NVIDIA CUDA architecture was implemented to enhance the learning speed. As a result of learning and verifying the groundwater level variation using the LSTM neural network, the coefficient of determination (R2) was 0.98 on average, indicating that the predictive model developed was very accurate.

Explication and Rational Conceptualization of Metaverse (메타버스 해석과 합리적 개념화)

  • Song, Stephen W.;Chung, Dong-Hun
    • Informatization Policy
    • /
    • v.28 no.3
    • /
    • pp.3-22
    • /
    • 2021
  • This article reviews previous literature on the metaverse and attempts to provide a refined definition for this phenomenon. Metaverse has recently been in the spotlight among discussions by the industry and the media while a consensus on the exact definition of metaverse is yet to be determined. Since Neal Stephenson first coined the term metaverse in his novel "Snow Crash" in 1992, the Acceleration Studies Foundation (ASF) was the first to analyze the concept of metaverse in 2007. While ASF's effort did not receive much spotlight it may have deserved, metaverse gained much attention in the fall of 2020 when NVIDIA announced its real-time simulation and collaboration platform for 3D production named "Omniverse" as a next-generation alternative for the Internet along with Roblox defining its service as metaverse during its IPO. Since then, metaverse has been commonly recognized as a world where we can cross over reality and virtuality. Based on the two axes and four scenarios proposed by the ASF, we review the literature across four realms as follows - virtual reality, mirror world, augmented reality, and lifelogging. Then, we examine the issues with the existing definition of metaverse and propose an alternative explanation by focusing on human behavior and user experience. Finally, we reassess the concept of metaverse and incorporate human communication, reality-based and virtual-based activities, and eXtended reality as elements to properly define metaverse.

A Study on the Performance Improvement of Software Digital Filter using GPU (GPU를 이용한 소프트웨어 디지털 필터의 성능개선에 관한 연구)

  • Yeom, Jae-Hwan;Oh, Se-Jin;Roh, Duk-Gyoo;Jung, Dong-Kyu;Hwang, Ju-Yeon;Oh, Chungsik;Kim, Hyo-Ryoung
    • Journal of the Institute of Convergence Signal Processing
    • /
    • v.19 no.4
    • /
    • pp.153-161
    • /
    • 2018
  • This paper describes the performance improvement of Software (SW) digital filter using GPU (Graphical Processing Unit). The previous developed SW digital filter has a problem that it operates on a CPU (Central Processing Unit) basis and has a slow speed. The GPU was introduced to filter the data of the EAVN (East Asian VLBI Network) observation to improve the operation speed and to process data with other stations through filtering, respectively. In order to enhance the computational speed of the SW digital filter, NVIDIA Titan V GPU board with built-in Tensor Core is used. The processing speed of about 0.78 (1Gbps, 16MHz BW, 16-IF) and 1.1 (2Gbps, 32MHz BW, 16-IF) times for the observing time was achieved by filtering the 95 second observation data of 2 Gbps (512 MHz BW, 1-IF), respectively. In addition, 2Gbps data is digitally filtered for the 1 and 2Gbps simultaneously observed with KVN (Korean VLBI Network), and compared with the 1Gbps, we obtained similar values such as cross power spectrum, phase, and SNR (Signal to Noise Ratio). As a result, the effectiveness of developed SW digital filter using GPU in this research was confirmed for utilizing the data processing and analysis. In the future, it is expected that the observation data will be able to be filtered in real time when the distributed processing optimization of source code for using multiple GPU boards.

Implementation of 3D Road Surface Monitoring System for Vehicle based on Line Laser (선레이저 기반 이동체용 3차원 노면 모니터링 시스템 구현)

  • Choi, Seungho;Kim, Seoyeon;Kim, Taesik;Min, Hong;Jung, Young-Hoon;Jung, Jinman
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.20 no.6
    • /
    • pp.101-107
    • /
    • 2020
  • Road surface measurement is an essential process for quantifying the degree and displacement of roughness in road surface management. For safer road surface management and quick maintenance, it is important to accurately measure the road surface while mounted on a vehicle. In this paper, we propose a sophisticated road surface measurement system that can be measured on a moving vehicle. The proposed road surface measurement system supports more accurate measurement of the road surface by using a high-performance line laser sensor. It is also possible to measure the transverse and longitudinal profile by matching the position information acquired from the RTK, and the velocity adaptive update algorithm allows a manager to monitor in a real-time manner. In order to evaluate the proposed system, the Gocator laser sensor, MRP module, and NVIDIA Xavier processor were mounted on a test mobile and tested on the road surface. Our evaluation results demonstrate that our system measures accurate profile base on the MSE. Our proposed system can be used not only for evaluating the condition of roads but also for evaluating the impact of adjacent excavation.

Understanding of Metaverse Platform Ecosystem: Focusing on the Theory of Double Lines and Five Elements (메타버스 플랫폼 생태계의 이해: 양선오요소(兩線五要素) 이론을 중심으로)

  • Lee, Seoyoun;Chang, Younghoon
    • Knowledge Management Research
    • /
    • v.23 no.2
    • /
    • pp.15-35
    • /
    • 2022
  • With the development of virtual and augmented reality technologies, the metaverse, a digital world that provides an immersive feeling like the real world, is overgrowing. Many IT companies such as Naver, Facebook (Meta), and NVIDIA are developing innovative technologies and launching the Metaverse platform and related products on the market. However, even though it is a new business in which many global big tech companies are aggressively investing, the results are not yet precise compared to the market expectations, and the rate of increase in the number of users is gradually slowing down. This can be attributed to the lack of consideration and understanding about how to grow the metaverse ecosystem and operate & harmonize various users/components from the time the metaverse platform was designed. In order to propose a better solution to these problems, this study adopts the yin-yang and five elements theory, which was created to understand the operation logic and logic of the human world for thousands of years. This research would like to propose a theory of double lines-five elements by defining two essential spaces of the metaverse platform, online and offline, and five essential elements constituting the metaverse platform. This study intends to provide a theoretical lens on how to design and operate a platform through the double lines and five elements theory and the concept of coexistence and polarity between the five elements.

An Accelerated Approach to Dose Distribution Calculation in Inverse Treatment Planning for Brachytherapy (근접 치료에서 역방향 치료 계획의 선량분포 계산 가속화 방법)

  • Byungdu Jo
    • Journal of the Korean Society of Radiology
    • /
    • v.17 no.5
    • /
    • pp.633-640
    • /
    • 2023
  • With the recent development of static and dynamic modulated brachytherapy methods in brachytherapy, which use radiation shielding to modulate the dose distribution to deliver the dose, the amount of parameters and data required for dose calculation in inverse treatment planning and treatment plan optimization algorithms suitable for new directional beam intensity modulated brachytherapy is increasing. Although intensity-modulated brachytherapy enables accurate dose delivery of radiation, the increased amount of parameters and data increases the elapsed time required for dose calculation. In this study, a GPU-based CUDA-accelerated dose calculation algorithm was constructed to reduce the increase in dose calculation elapsed time. The acceleration of the calculation process was achieved by parallelizing the calculation of the system matrix of the volume of interest and the dose calculation. The developed algorithms were all performed in the same computing environment with an Intel (3.7 GHz, 6-core) CPU and a single NVIDIA GTX 1080ti graphics card, and the dose calculation time was evaluated by measuring only the dose calculation time, excluding the additional time required for loading data from disk and preprocessing operations. The results showed that the accelerated algorithm reduced the dose calculation time by about 30 times compared to the CPU-only calculation. The accelerated dose calculation algorithm can be expected to speed up treatment planning when new treatment plans need to be created to account for daily variations in applicator movement, such as in adaptive radiotherapy, or when dose calculation needs to account for changing parameters, such as in dynamically modulated brachytherapy.