Search | Korea Science

Implementation and Performance Analysis of High Performance Computing Library for Parallel Processing (병렬처리를 위한 고성능 라이브러리의 구현과 성능 평가)

김영태;이용권
- Journal of KIISE:Computer Systems and Theory
- /
- v.31 no.7
- /
- pp.379-386
- /
- 2004
We designed a portable parallel library HPCL(High Performance Computing Library) with following objectives: (1) to provide a close relationship between the parallel code and the original sequential code that will help future versions of the sequential code and (2) to enhance performance of the parallel code. The library is an interface written in C and Fortran programming languages between MPI(Message Passing Interface) and parallel programs in Fortran. Performance results were determined on clusters of PC's and IBM SP4.
PDF KSCI

Parallel FFT and Quick-Merge Sort on the Reflective Memory Networked Computers and a Cluster of Work-stations

Lee, Changhun;Kwon, Wook-Hyun
- 제어로봇시스템학회:학술대회논문집
- /
- 2002.10a
- /
- pp.94.1-94
- /
- 2002
This paper is concerned with parallel FFT and Quick-Merge Sort. They are implemented on computers interconnected by VMIC 5579 reflective memory and a cluster of workstations (PCs) interconnected via Fast Ethernet. Message passing interface (MPI) parallel library is used for communication in a cluster of workstations. An improved parallel FFT is also presented to decrease an execution time in the case of a small number of hosts. Distributed shared memory (DSM), VMIC 5579 reflective memory (RM), a cluster of workstations (COW) and message passing interface (MPI) parallel library are described.
PDF

A STUDY OF THE APPLICATION OF DELAUNAY GRID GENERATION ON GPU USING CUDA LIBRARY (GPU Library CUDA를 이용한 효율적인 Delaunay 격자 생성에 관한 연구)

Song, J.H.;Kang, S.H.;Kim, G.M.;Kim, B.S.
- 한국전산유체공학회:학술대회논문집
- /
- 2011.05a
- /
- pp.194-198
- /
- 2011
In this study, an efficient algorithm for Delaunay triangulation of a number of points which can be used on a GPU-based parallel computation is studied The developed algorithm is programmed using CUDA library. and the program takes full advantage of parallel computation which are concurrently performed on each of the threads on GPU. The results of partitioned triangulation collected from the GPU computation requires proper stitching between neighboring partitions and calculation of connectivities among triangular cells on CPU In this study, the effect of number of threads on the efficiency and total duration for Delaunay grid generation is studied. And it is also shown that GPU computing using CUDA for Delaunay grid generation is feasible and it saves total time required for the triangulation of the large number points compared to the sequential CPU-based triangulation programs.
PDF

Solid-phase Parallel Synthesis of a Novel N-[Alkylsulfonamido-spiro(2H-1-benzopyran-2,4-piperidine)-6-yl] substituted Amide and Amine Drug-like Libraries

Kim, Ji-Hye;Gong, Young-Dae;Lee, Gee-Hyung;Seo, Jin-Soo
- Bulletin of the Korean Chemical Society
- /
- v.33 no.1
- /
- pp.128-136
- /
- 2012
We report the solid-phase library construction of 222 number of a novel N-[alkyl sulfonamido-spiro(2H-1-benzopyran-2,4-piperidine)-6-yl] substituted amide 1A and amine 1B derivatives. The polymer-bound N-[alkylsulfonamido-spiro(2H-1-benzopyran-2,4-piperidine)-6-yl] substituted amide 9 and amine 10 derivatives were obtained by first diversity generation with various acid chlorides and alkyl halides. Further reactions on the resins 9 and 10 with substituted sulfonyl chlorides produced the desired N-[alkylsulfonamido-spiro(2H-1-benzopyran-2,4-piperidine)-6-yl] substituted amide 1A and amine 1B analogues.
https://doi.org/10.5012/bkcs.2012.33.1.128 인용 PDF KSCI

The Mixed Finite Element Analysis for Nearly Incompressible and Impermeable Porous Media Using Parallel Algorithm (병렬알고리즘 이용한 비압축, 비투과성 포화 다공질매체의 혼합유한요소해석)

Tak, Moon-Ho;Kang, Yoon-Sik;Park, Tae-Hyo
- Journal of the Computational Structural Engineering Institute of Korea
- /
- v.23 no.4
- /
- pp.361-368
- /
- 2010
In this paper, the parallel algorithm using MPI(Message-Passing Interface) library is introduced in order to improve numerical efficiency for the staggered method for nearly incompressible and impermeable porous media which was introduced by Park and Tak(2010). The porous media theory and the staggered method are also briefly introduced in this paper. Moreover, we account for MPI library for blocking, non-blocking, and collective communication, and propose combined the staggered method with the blocking and nonblocking MPI library. And then, we present how to allocate CPUs on the staggered method and the MPI library, which is related with the numerical efficiency in order to solve unknown variables on nearly incompressible and impermeable porous media. Finally, the results comparing serial solution with parallel solution are verified by 2 dimensional saturated porous model according to the number of FEM meshes.
PDF KSCI

Performance Analysis of a Parallel Mesh Smoothing Algorithm using Graph Coloring and OpenMP (그래프 컬러링과 OpenMP를 이용한 병렬 메쉬 스무딩 알고리즘의 성능 분석)

Shin, Myeonggyu;Kim, Jibum
- Journal of the Institute of Electronics and Information Engineers
- /
- v.53 no.6
- /
- pp.80-87
- /
- 2016
We propose a parallel mesh smoothing algorithm using graph coloring and OpenMP library for shared memory many core computer architectures. The proposed algorithm partitions a mesh into independent sets and performs a parallel mesh smoothing using OpenMP library. We study the effect of using various graph coloring and color reordering algorithms on the efficiency of performing the proposed parallel mesh smoothing algorithm. We also investigate the influence of using various OpenMP loop scheduling methods on the parallel mesh smoothing efficiency.
https://doi.org/10.5573/ieie.2016.53.6.080 인용 PDF KSCI

Application for parallel computation for finite element analysis of welding processes (용접공정 유한요소 해석의 병렬 처리 적용)

임세영;김주완;최강혁
- Proceedings of the KWS Conference
- /
- 2004.05a
- /
- pp.273-275
- /
- 2004
A parallel multi-frontal solver is developed for finite element analysis of an arc-welding process, which entails phase evolution, heat transfer, and deformations of structure. We verify the code via comparison to a commercial code,SYSWELD. Attention is focused on the implementation of the parallel solver using MPI library, on the speedup by parallel computation, and on the effectiveness of the solver in welding application
PDF

UTLIZATION OF FUZZY AND VOLETTRA ALGORITHM FOR 3D BATHYMETRY SIMULATION FROM TOPSAR POLARISED DATA

Marghany, Maged;Hussien, Mohd. Lokman
- Proceedings of the KSRS Conference
- /
- 2003.11a
- /
- pp.432-434
- /
- 2003
The main objective of this research is to utilize the parallel Fuzzy arithmetic for constructing ocean bathymetry from polarized remote sensing data such as TOPSAR image. In doing so, the parallel library for Fuzzy arithmetic has been developed. Three- dimensional surface modeling consisted of Volettra model, non-linear model which construct a global topological structure between the data points, used to support an approximation of real surface. The output of the parallel library was a digital terrain model for bathymetry along the coastal waters of Kuala Terengganu Malaysia. This paper describes the principles behind the Fuzzy algorithm, indicates for what type of application it might be useful, notes on the accuracy and gives an example of an application.
PDF

Parallel VHDL Simulation on IBM SP2 and SGI Origin 2000 (IBM SP2와 SGI Origin 2000에서의 병렬 VHDL 시뮬레이션)

정영식
- Journal of the Korea Society for Simulation
- /
- v.7 no.1
- /
- pp.69-83
- /
- 1998
In this paper, we present the results of simulation by running parallel VHDL simulation on typical MPP(Massively Parallel Processor) systems such as IBM SP2 and SGI Origin 2000. Parallel simulation uses the synchronous protocol and parallel program is implemented using MPI(Message Passing Interface) based on message passing model, so that it can urn on any parallel programming environment which supports MPI, a standard communication library. And then GVT(Global Virtual Time) computation for parallel simulation is based on the global broadcasting with MPI＿Bcast(), which is a standard function in MPI and piggybacking. Our benchmark exhibits that as size of VHDL grows, the parallel simulation has a better performance compared with the sequential simulation. In addition, we also show the results of comparison between IBM SP2 and SGI Origin 2000 by applying the same application to those indirectly.
PDF

Lock-free unique identifier allocation for parallel macro expansion

Son, Bum-Jun;Ahn, Ki Yung
- Journal of the Korea Society of Computer and Information
- /
- v.27 no.4
- /
- pp.1-8
- /
- 2022
In this paper, we propose a more effective unique identifier allocation method for macro expansion in a single-process multicore parallel computing environment that does not require locks. Our key idea for such an allocation method is to remove sequential dependencies using the remainder operation. We confirmed that our lock-free method is suitable for improving the performance of parallel macro expansion through the following benchmark: we patched an existing library, which is based on a sequential unique identifier allocation, with our proposed method, and compared the performances of the same program but using two different versions of the library, before and after the patch.
https://doi.org/10.9708/jksci.2022.27.04.001 인용 PDF KSCI HTML

Search Result 188, Processing Time 0.038 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)