• Title/Summary/Keyword: MPI system

Search Result 158, Processing Time 0.031 seconds

Development of Numerical Tool for the DNS/LES of Turbulent Flow for Frictional Drag Reduction (마찰저항감소를 위한 난류유동의 DNS/LES 해석기술의 개발)

  • ;;Osama A. El-Samni
    • Journal of the Society of Naval Architects of Korea
    • /
    • v.41 no.1
    • /
    • pp.47-54
    • /
    • 2004
  • The friction drag reduction of a ship is of prime importance for the design and production of high-valued/high-tech ship. Thus, this study carried out the development of reliable numerical tools to identify the friction drag reduction mechanism for turbulent boundary layer on the ship surface and to deduce the optimum reduction technique by numerical experiment. The developed LES and DNS numerical tools were applied to simulate the turbulent channel flow These results were very well matched with previous results not only qualitatively but also quantitatively. The parallelization using MPI (Message Passing Interface) technique implemented in the developed code to speed up the simulation and to obtain the accurate results from the fine grid system was testified its computational efficiency.

Direct Methods for Linear System on Distributed Memory Parallel Computers

  • Nishimura, S.;Shigehara, T.;Mizoguchi, H.;Mishima, T.;Kobayashi, H.
    • Proceedings of the IEEK Conference
    • /
    • 2000.07a
    • /
    • pp.333-336
    • /
    • 2000
  • We discuss the direct methods (Gauss-Jordan and Gaussian eliminations) to solve linear systems on distributed memory parallel computers. It will be shown that the so-called row-cyclic storage gives rise to the best performance among the standard three (row-cyclic, column-cyclic and cyclic-cyclic) data storages. We also show that Gauss-Jordan elimination, rather than Gaussian elimination, is highly efficient for the direct solution of linear systems in parallel processing, though Gauss-Jordan elimination requires a larger number of arithmetic operations than Gaussian elimination. Numerical experiment is performed on HITACHI SR12201 with the standard libraries MPI and BLAS.

  • PDF

The Implementation of Parallel Bidirectional Search on Cluster System (클러스터 시스템에서의 양방향 병렬 탐색의 구현)

  • Cha, Kwang-Ho;Hong, Jeong-Woo;Kwak, Jai-Seung;Byeon, Ok-Hwan
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2002.11a
    • /
    • pp.189-192
    • /
    • 2002
  • 특정 문제를 위하여 병렬화 알고리즘을 적용할 경우. 기존의 순차적인 알고리즘과는 다른 방식의 문제 접근이 가능한 경우가 있다. 인공 지능 분야의 양방향 병렬 탐색을 그 예로 들 수 있으며 특정 슈퍼 컴퓨터를 대상으로 한 구현 및 성능 측정 결과가 보고 된 바 있다. 본 논문에서는 최근 병렬 시스템의 새로운 분야로 부각되고 있는 클러스터 시스템을 대상으로 양방향 병렬 탐색 알고리즘을 구현하고 테스트 하였다. 구현에 있어서는 클러스터 시스템에서 많이 이용되는 메시지 패싱 인터페이스 (MPI)를 사용하였으며 여러 가지 조건을 조합하여 테스트 함으로서, 클러스터 시스템 환경에서 양방향 병렬 탐색이 갖는 성능을 분석하였다.

  • PDF

Proposal of Container-Based HPC Structures and Performance Analysis

  • Yong, Chanho;Lee, Ga-Won;Huh, Eui-Nam
    • Journal of Information Processing Systems
    • /
    • v.14 no.6
    • /
    • pp.1398-1404
    • /
    • 2018
  • High-performance computing (HPC) provides to researchers a powerful ability to resolve problems with intensive computations, such as those in the math and medical fields. When an HPC platform is provided as a service, users may suffer from unexpected obstacles in developing and running applications due to restricted development environments and dependencies. In this context, operating system level virtualization can be a solution for HPC service to ensure lightweight virtualization and consistency in Dev-Ops environments. Therefore, this paper proposes three types of typical HPC structure for container environments built with HPC container and Docker. The three structures focus on smooth integration with existing HPC job framework, message passing interface (MPI). Lastly, the performance of the structures is analyzed with High Performance Linpack benchmark from the aspect of performance degradation in network communications under Docker.

Development of a flux emergence simulation using parallel computing

  • Lee, Hwanhee;Magara, Tetsuya
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.2
    • /
    • pp.71.1-71.1
    • /
    • 2019
  • The solar magnetic field comes from the solar interior and is related to various phenomena on the Sun. To understand this process, many studies have been conducted to produce its evolution using a single flux rope. In this study, we are interested in the emergence of two flux ropes and their evolution, which takes longer than the emergence of a single flux rope. To construct it, we develop a flux emergence simulation by applying a parallel computing to reduce a computation time in a wider domain. The original simulation code had been written in Fortran 77. We modify it to a version of Fortran 90 with Message Passing Interface (MPI). The results of the original and new simulation are compared on the NEC SX-Aurora TSUBASA which is a vector engine processor. The parallelized version is faster than running on a single core and it shows a possibility to handle large amounts of calculation. Based on this model, we can construct a complex flux emergence system, such as an evolution of two magnetic flux ropes.

  • PDF

A Study on the Event Processing for Electronic Control (전자제어의 Event 처리방법에 관한 연구)

  • 이종승;이중순;정성식;하종률
    • Transactions of the Korean Society of Automotive Engineers
    • /
    • v.6 no.3
    • /
    • pp.115-122
    • /
    • 1998
  • For digital engine control timings, such as ignition, are based on the crank shaft angle. Therefore, it is very important that the angle of the crank shaft can be detected with accuracy for optimal ignition timing. Sequential multi-point injection(MPI) systems that have independent injection events for each cylinder, are used to inject an accurate quantity of fuel, and to cope with varying engine status promptly. In this study the distributorless ignition timing. A crankshaft position sensor has been installed such that it generates a number of pulses per crankshaft revolution to permit accurate detection of the crank shaft angle. An event detecting algorithm has been developed, which detects the crank shaft pulses generated by the position sensor, and the software outputs the required control signals at given crank angle values. We clarified that the hardware method is the best way to increase the performance of the control system, because the event detecting duration T(1+2)max becomes zero.

  • PDF

Injection Molding 3D CAE Applications for Estimating Filling Imbalance Using a New Runner system meshing (새로운 3D 멧싱 기법으로 충전 불균형을 예측할 수 있는 사출성형 CAE 활용)

  • Go, Seung-Woo;Jung, Su-Jung;Seo, Sang-hun;Jeong, Yeong-Deug
    • Design & Manufacturing
    • /
    • v.6 no.2
    • /
    • pp.31-36
    • /
    • 2012
  • Now days CAE has been used for almost all injection molding designs in order to find the best injection conditions. Almost all CAE use 2-D mesh, but the CAE with 2-D mesh can't indicate such as jetting, flow-mark and filling imbalance in multi cavity mold. In this study, we suggested a new 3D meshing. the method which can indicate the filling imbalance in geometrically balanced runner system with Mold Flow MPI 6.1 and we found out that the calculation times are saved. As a feasibility study, we verified that Melt Flipper, RC Pin etc appeared the balanced filling behaviors. of geometrically balanced runner system and Melt Flipper, filling imbalance was indicated more accurately.

  • PDF

An Optimal Reliability-Redundancy Allocation Problem by using Hybrid Parallel Genetic Algorithm (하이브리드 병렬 유전자 알고리즘을 이용한 최적 신뢰도-중복 할당 문제)

  • Kim, Ki-Tae;Jeon, Geon-Wook
    • IE interfaces
    • /
    • v.23 no.2
    • /
    • pp.147-155
    • /
    • 2010
  • Reliability allocation is defined as a problem of determination of the reliability for subsystems and components to achieve target system reliability. The determination of both optimal component reliability and the number of component redundancy allowing mixed components to maximize the system reliability under resource constraints is called reliability-redundancy allocation problem(RAP). The main objective of this study is to suggest a mathematical programming model and a hybrid parallel genetic algorithm(HPGA) for reliability-redundancy allocation problem that decides both optimal component reliability and the number of component redundancy to maximize the system reliability under cost and weight constraints. The global optimal solutions of each example are obtained by using CPLEX 11.1. The component structure, reliability, cost, and weight were computed by using HPGA and compared the results of existing metaheuristic such as Genetic Algoritm(GA), Tabu Search(TS), Ant Colony Optimization(ACO), Immune Algorithm(IA) and also evaluated performance of HPGA. The result of suggested algorithm gives the same or better solutions when compared with existing algorithms, because the suggested algorithm could paratactically evolved by operating several sub-populations and improve solution through swap, 2-opt, and interchange processes. In order to calculate the improvement of reliability for existing studies and suggested algorithm, a maximum possible improvement(MPI) was applied in this study.

Injection Molding 3D CAE Applications for Estimating Filling Imbalance Using a New Runner system meshing (새로운 3D 멧싱 기법으로 충전 불균형을 예측할수 있는 사출성형 CAE 활용)

  • Go, Seung-Woo;Jung, Su-Jung;Seo, Sang-Hun;Jeong, Yeong-Deug
    • 한국금형공학회:학술대회논문집
    • /
    • 2008.06a
    • /
    • pp.121-127
    • /
    • 2008
  • Nowdays CAE has been used for almost all injection molding designs in order to find the best injection conditions. Almost all CAE use 2-D mesh, but the CAE with 2-D mesh can't indicate such as jetting, flow-mark and filling imbalance in multi cavity mold. In this study, we suggested a new 3D meshing. the method which can indicate the filling imbalance in geometrically balanced runner system with Mold Flow MPI 6.1 and we found out that the calculation times are saved. As a feasibility study, we verified that Melt Flipper, RC Pin etc appeared the balanced filling behaviors. of geometrically balanced runner system and Melt Flipper, filling imbalance was indicated more accurately.

  • PDF

R Based Parallelization of a Climate Suitability Model to Predict Suitable Area of Maize in Korea (국내 옥수수 재배적지 예측을 위한 R 기반의 기후적합도 모델 병렬화)

  • Hyun, Shinwoo;Kim, Kwang Soo
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.19 no.3
    • /
    • pp.164-173
    • /
    • 2017
  • Alternative cropping systems would be one of climate change adaptation options. Suitable areas for a crop could be identified using a climate suitability model. The EcoCrop model has been used to assess climate suitability of crops using monthly climate surfaces, e.g., the digital climate map at high spatial resolution. Still, a high-performance computing approach would be needed for assessment of climate suitability to take into account a complex terrain in Korea, which requires considerably large climate data sets. The objectives of this study were to implement a script for R, which is an open source statistics analysis platform, in order to use the EcoCrop model under a parallel computing environment and to assess climate suitability of maize using digital climate maps at high spatial resolution, e.g., 1 km. The total running time reduced as the number of CPU (Central Processing Unit) core increased although the speedup with increasing number of CPU cores was not linear. For example, the wall clock time for assessing climate suitability index at 1 km spatial resolution reduced by 90% with 16 CPU cores. However, it took about 1.5 time to compute climate suitability index compared with a theoretical time for the given number of CPU. Implementation of climate suitability assessment system based on the MPI (Message Passing Interface) would allow support for the digital climate map at ultra-high spatial resolution, e.g., 30m, which would help site-specific design of cropping system for climate change adaptation.