• Title/Summary/Keyword: General purpose computing

Search Result 160, Processing Time 0.031 seconds

Development of a 3-D CFD Program for Computing Two-Phase Flows with a Level Set Method (Level Set 상경계면 추적법을 적용한 3차원 CFD 프로그램의 개발)

  • Son G.;Hur N.
    • Journal of computational fluids engineering
    • /
    • v.9 no.3
    • /
    • pp.73-80
    • /
    • 2004
  • A LS(Level Set) formulation is developed for computing two-phase flows on non- orthogonal meshes. Compared with the VOF(Volume-of-Fluid) method based on a non-smooth volume-fraction function, the LS method can calculate an interfacial curvature more accurately by using a smooth distance function. Also, it is quite straightforward to implement for 3-D irregular meshes compared with the VOF method requiring much more complicated geometric calculations. The LS formulation is implemented into a general purpose program for 3-D flows and verified through several test problems.

Technical Trend and Challenging Issues for Quantum Computing Control System (양자컴퓨터 제어 기술)

  • Jeong, Y.H.;Choi, B.S.
    • Electronics and Telecommunications Trends
    • /
    • v.36 no.3
    • /
    • pp.87-96
    • /
    • 2021
  • Quantum computers will be a game-changer in various fields, such as cryptography and new materials. Quantum computer is quite different from the classical computer by using quantum-mechanical phenomena, such as superposition, entanglement, and interference. The main components of a quantum computer can be divided into quantum-algorithm, quantum-classical control interface, and quantum processor. Universal quantum computing, which can be applied in various industries, is expected to have more than millions of qubits with high enough gate accuracy. Currently, It uses general-purpose electronic equipment, which is placed in a rack, at room temperature to make electronic signals that control qubits. However, implementing a universal quantum computer with a low error rate requires a lot of qubits demands the change of the current control system to be an integrated and miniaturized system that can be operated at low temperatures. In this study, we explore the fundamental units of the control system, describe the problems and alternatives of the current control system, and discuss a future quantum control system.

Accelerating 2D DCT in Multi-core and Many-core Environments (멀티코어와 매니코어 환경에서의 2 차원 DCT 가속)

  • Hong, Jin-Gun;Jung, Sung-Wook;Kim, Cheong-Ghil;Burgstaller, Bernd
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2011.04a
    • /
    • pp.250-253
    • /
    • 2011
  • Chip manufacture nowadays turned their attention from accelerating uniprocessors to integrating multiple cores on a chip. Moreover desktop graphic hardware is now starting to support general purpose computation. Desktop users are able to use multi-core CPU and GPU as a high performance computing resources these days. However exploiting parallel computing resources are still challenging because of lack of higher programming abstraction for parallel programming. The 2-dimensional discrete cosine transform (2D-DCT) algorithms are most computational intensive part of JPEG encoding. There are many fast 2D-DCT algorithms already studied. We implemented several algorithms and estimated its runtime on multi-core CPU and GPU environments. Experiments show that data parallelism can be fully exploited on CPU and GPU architecture. We expect parallelized DCT bring performance benefit towards its applications such as JPEG and MPEG.

General Purpose Optical Fuzzy Computing Modules

  • Mamano, Kazuho
    • Proceedings of the Korean Institute of Intelligent Systems Conference
    • /
    • 1993.06a
    • /
    • pp.777-780
    • /
    • 1993
  • Three optical fuzzy calculating modules, MAX/MIN, NOT/THROUGH, and SUP/THROUGH operating modules, are proposed. The MAX/MIN operating on inputted 2 membership functions. The NOT/THROUGH operating module calculates the complement of the membership function. The SUP/THROUGH operating module outputs an image representing the supremum (least upper bound) of the membership function. The THROUGH operation passes the image of the inputted membership function from the entrance to the exit. This paper demonstrates that these modules can output the image into which the modules transform inputted images on the basis of operation on fuzzy logic.

  • PDF

Analytical fragility curves of a structure subject to tsunami waves using smooth particle hydrodynamics

  • Sihombing, Fritz;Torbol, Marco
    • Smart Structures and Systems
    • /
    • v.18 no.6
    • /
    • pp.1145-1167
    • /
    • 2016
  • This study presents a new method to computes analytical fragility curves of a structure subject to tsunami waves. The method uses dynamic analysis at each stage of the computation. First, the smooth particle hydrodynamics (SPH) model simulates the propagation of the tsunami waves from shallow water to their impact on the target structure. The advantage of SPH over mesh based methods is its capability to model wave surface interaction when large deformations are involved, such as the impact of water on a structure. Although SPH is computationally more expensive than mesh based method, nowadays the advent of parallel computing on general purpose graphic processing unit overcome this limitation. Then, the impact force is applied to a finite element model of the structure and its dynamic non-linear response is computed. When a data-set of tsunami waves is used analytical fragility curves can be computed. This study proves it is possible to obtain the response of a structure to a tsunami wave using state of the art dynamic models in every stage of the computation at an affordable cost.

Analysis of the GPGPU Performance for Various Combinations of Workloads Executed Concurrently (동시에 실행되는 워크로드 조합에 따른 GPGPU 성능 분석)

  • Kim, Dongwhan;Eom, Hyeonsang
    • KIISE Transactions on Computing Practices
    • /
    • v.23 no.3
    • /
    • pp.165-170
    • /
    • 2017
  • Many studies have utilized GPGPU (General-Purpose Graphic Processing Unit) and its high computing power to compute complex tasks. The characteristics of GPGPU programs necessitate the operations of memory copy between the host and device. A high latency period can affect the performance of the program. Thus, it is required to significantly improve the performance of GPGPU programs by optimizations. By executing multiple GPGPU programs simultaneously, the latency hiding effect of memory copy is achieved by overlapping the memory copy and computing operations in GPGPU. This paper presents the results of analyzing the latency hiding effect for memory copy operations. Furthermore, we propose a performance anticipation model and an algorithm for the limitations of using pinned memory, and show that the use of the proposed algorithm results in a 41% performance increase.

Factors Affecting on Reusing Intention of Cloud Computing based Smartphone Environment (스마트폰 환경에서 클라우드 컴퓨팅 재사용의도에 미치는 요인)

  • Nam, Soo-Tai;Jin, Chan-Yong
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2016.05a
    • /
    • pp.140-142
    • /
    • 2016
  • Recently, rapid innovation of Smartphone is changing the paradigm of our daily life. Smartphone users, opinion experts more than 99 percent of the economically active population is using, it has reached the saturation past the early stages of formation. Smartphone is equipped with a general purpose OS possible the implementation of high performance environment similar level as a personal computer. Thus, it is a mobile communication terminal scalable which can be removed or installed various applications. Such extensibility, it is possible to use different applications through the Apps store. Also, it is also possible various services which are location based service. In this research, we aim to analyze factors influencing of cloud computing service on reusing intention of smartphone. In addition, we aim to analyze the causal relationship from perceived value to reusing intention. This study suggests practical and theoretical implications based on the results.

  • PDF

Sub-Frame Analysis-based Object Detection for Real-Time Video Surveillance

  • Jang, Bum-Suk;Lee, Sang-Hyun
    • International Journal of Internet, Broadcasting and Communication
    • /
    • v.11 no.4
    • /
    • pp.76-85
    • /
    • 2019
  • We introduce a vision-based object detection method for real-time video surveillance system in low-end edge computing environments. Recently, the accuracy of object detection has been improved due to the performance of approaches based on deep learning algorithm such as Region Convolutional Neural Network(R-CNN) which has two stage for inferencing. On the other hand, one stage detection algorithms such as single-shot detection (SSD) and you only look once (YOLO) have been developed at the expense of some accuracy and can be used for real-time systems. However, high-performance hardware such as General-Purpose computing on Graphics Processing Unit(GPGPU) is required to still achieve excellent object detection performance and speed. To address hardware requirement that is burdensome to low-end edge computing environments, We propose sub-frame analysis method for the object detection. In specific, We divide a whole image frame into smaller ones then inference them on Convolutional Neural Network (CNN) based image detection network, which is much faster than conventional network designed forfull frame image. We reduced its computationalrequirementsignificantly without losing throughput and object detection accuracy with the proposed method.

An Edge AI Device based Intelligent Transportation System

  • Jeong, Youngwoo;Oh, Hyun Woo;Kim, Soohee;Lee, Seung Eun
    • Journal of information and communication convergence engineering
    • /
    • v.20 no.3
    • /
    • pp.166-173
    • /
    • 2022
  • Recently, studies have been conducted on intelligent transportation systems (ITS) that provide safety and convenience to humans. Systems that compose the ITS adopt architectures that applied the cloud computing which consists of a high-performance general-purpose processor or graphics processing unit. However, an architecture that only used the cloud computing requires a high network bandwidth and consumes much power. Therefore, applying edge computing to ITS is essential for solving these problems. In this paper, we propose an edge artificial intelligence (AI) device based ITS. Edge AI which is applicable to various systems in ITS has been applied to license plate recognition. We implemented edge AI on a field-programmable gate array (FPGA). The accuracy of the edge AI for license plate recognition was 0.94. Finally, we synthesized the edge AI logic with Magnachip/Hynix 180nm CMOS technology and the power consumption measured using the Synopsys's design compiler tool was 482.583mW.

Implementation of handwritten digit recognition CNN structure using GPGPU and Combined Layer (GPGPU와 Combined Layer를 이용한 필기체 숫자인식 CNN구조 구현)

  • Lee, Sangil;Nam, Kihun;Jung, Jun Mo
    • The Journal of the Convergence on Culture Technology
    • /
    • v.3 no.4
    • /
    • pp.165-169
    • /
    • 2017
  • CNN(Convolutional Nerual Network) is one of the algorithms that show superior performance in image recognition and classification among machine learning algorithms. CNN is simple, but it has a large amount of computation and it takes a lot of time. Consequently, in this paper we performed an parallel processing unit for the convolution layer, pooling layer and the fully connected layer, which consumes a lot of handling time in the process of CNN, through the SIMT(Single Instruction Multiple Thread)'s structure of GPGPU(General-Purpose computing on Graphics Processing Units).And we also expect to improve performance by reducing the number of memory accesses and directly using the output of convolution layer not storing it in pooling layer. In this paper, we use MNIST dataset to verify this experiment and confirm that the proposed CNN structure is 12.38% better than existing structure.