• Title/Summary/Keyword: message passing interface (MPI)

Search Result 115, Processing Time 0.026 seconds

Numerical Analysis of Anode Sheath Structure Shift in an Anode-layer Type Hall Thruster

  • Yokota, Shigeru;Komurasaki, Kimiya;Arakawa, Yoshihiro
    • Proceedings of the Korean Society of Propulsion Engineers Conference
    • /
    • 2008.03a
    • /
    • pp.602-605
    • /
    • 2008
  • The anode sheath structure in the hollow anode of an anode-layer type Hall thruster was numerically computed using a fully kinetic 2D3V Particle-in-Cell and Direct Simulation Monte Carlo(PIC-DSMC) code. By treating both ions and electrons as particles, anode surface region, which is electrically non-neutral, was analyzed. In order to analyze in detail, the calculation code was parallelized using Message Passing Interface (MPI). The code successfully simulated the discharge current oscillation. In the low magnetic induction case, ion sheath appears in the anode surface because ionization is enough to maintain the plasma occurs in the anode hollow. As the magnetic induction increases, main ionization region move to outside of the anode. At the same time, anode sheath voltage decreases. In the high magnetic induction case, electron sheath appears on the anode surface periodically because the ionization occurs mainly in the discharge channel. This anode sheath condition shift can be explained using the simple sheath model.

  • PDF

Proposal of Container-Based HPC Structures and Performance Analysis

  • Yong, Chanho;Lee, Ga-Won;Huh, Eui-Nam
    • Journal of Information Processing Systems
    • /
    • v.14 no.6
    • /
    • pp.1398-1404
    • /
    • 2018
  • High-performance computing (HPC) provides to researchers a powerful ability to resolve problems with intensive computations, such as those in the math and medical fields. When an HPC platform is provided as a service, users may suffer from unexpected obstacles in developing and running applications due to restricted development environments and dependencies. In this context, operating system level virtualization can be a solution for HPC service to ensure lightweight virtualization and consistency in Dev-Ops environments. Therefore, this paper proposes three types of typical HPC structure for container environments built with HPC container and Docker. The three structures focus on smooth integration with existing HPC job framework, message passing interface (MPI). Lastly, the performance of the structures is analyzed with High Performance Linpack benchmark from the aspect of performance degradation in network communications under Docker.

Development of a flux emergence simulation using parallel computing

  • Lee, Hwanhee;Magara, Tetsuya
    • The Bulletin of The Korean Astronomical Society
    • /
    • v.44 no.2
    • /
    • pp.71.1-71.1
    • /
    • 2019
  • The solar magnetic field comes from the solar interior and is related to various phenomena on the Sun. To understand this process, many studies have been conducted to produce its evolution using a single flux rope. In this study, we are interested in the emergence of two flux ropes and their evolution, which takes longer than the emergence of a single flux rope. To construct it, we develop a flux emergence simulation by applying a parallel computing to reduce a computation time in a wider domain. The original simulation code had been written in Fortran 77. We modify it to a version of Fortran 90 with Message Passing Interface (MPI). The results of the original and new simulation are compared on the NEC SX-Aurora TSUBASA which is a vector engine processor. The parallelized version is faster than running on a single core and it shows a possibility to handle large amounts of calculation. Based on this model, we can construct a complex flux emergence system, such as an evolution of two magnetic flux ropes.

  • PDF

AN ASSESSMENT OF PARALLEL PRECONDITIONERS FOR THE INTERIOR SPARSE GENERALIZED EIGENVALUE PROBLEMS BY CG-TYPE METHODS ON AN IBM REGATTA MACHINE

  • Ma, Sang-Back;Jang, Ho-Jong
    • Journal of applied mathematics & informatics
    • /
    • v.25 no.1_2
    • /
    • pp.435-443
    • /
    • 2007
  • Computing the interior spectrum of large sparse generalized eigenvalue problems $Ax\;=\;{\lambda}Bx$, where A and b are large sparse and SPD(Symmetric Positive Definite), is often required in areas such as structural mechanics and quantum chemistry, to name a few. Recently, CG-type methods have been found useful and hence, very amenable to parallel computation for very large problems. Also, as in the case of linear systems proper choice of preconditioning is known to accelerate the rate of convergence. After the smallest eigenpair is found we use the orthogonal deflation technique to find the next m-1 eigenvalues, which is also suitable for parallelization. This offers advantages over Jacobi-Davidson methods with partial shifts, which requires re-computation of preconditioner matrx with new shifts. We consider as preconditioners Incomplete LU(ILU)(0) in two variants, ever-relaxation(SOR), and Point-symmetric SOR(SSOR). We set m to be 5. We conducted our experiments on matrices from discretizations of partial differential equations by finite difference method. The generated matrices has dimensions up to 4 million and total number of processors are 32. MPI(Message Passing Interface) library was used for interprocessor communications. Our results show that in general the Multi-Color ILU(0) gives the best performance.

Dynamic simulation of squeezing flow of ER fluids using parallel processing

  • Kim, Do-Hoon;Chu, Sang-Hyon;Ahn, Kyung-Hyun;Lee, Seung-Jong
    • Korea-Australia Rheology Journal
    • /
    • v.11 no.3
    • /
    • pp.233-240
    • /
    • 1999
  • In order to understand the flow behavior of Electrorheological (ER) fluid, dynamic simulation has been intensively performed for the last decade. When the shear flow is applied, it is easy to carry out the simulation with relatively small number of particles because of the periodic boundary condition. For the squeezing flow, however, it is not easy to apply the periodic boundary condition, and the number of particles needs to be increased to simulate the ER system more realistically. For this reason, the simulation of ER fluid under squeezing flow has been mostly performed with some representative chains or with the approximation that severely restricts the flow geometry to reduce the computational load. In this study, Message Passing Interface (MPI), which is one of the most widely-used parallel processing techniques, has been employed in a dynamic simulation of ER fluid under squeezing flow. As the number of particles used in the simulation could be increased significantly, full domain between the electrodes has been covered. The numerical treatment or the approximation used to reduce the computational load has been evaluated for its validity, and was found to be quite effective. As the number of particles is increased, the fluctuation of the normal stress becomes diminished and the prediction in general was found to be qualitatively In good agreement with the experimental results.

  • PDF

Developing a Simulator of the Capture Process in Towed Fishing Gears by Chaotic Fish Behavior Model and Parallel Computing

  • Kim Yong-Hae;Ha Seok-Wun;Jun Yong-Kee
    • Fisheries and Aquatic Sciences
    • /
    • v.7 no.3
    • /
    • pp.163-170
    • /
    • 2004
  • A fishing simulator for towed fishing gear was investigated in order to mimic the fish behavior in capture process and investigate fishing selectivity. A fish behavior model using a psycho-hydraulic wheel activated by stimuli is established to introduce Lorenz chaos equations and a neural network system and to generate the components of realistic fish capture processes. The fish positions within the specified gear geometry are calculated from normalized intensities of the stimuli of the fishing gear components or neighboring fish and then these are related to the sensitivities and the abilities of the fish. This study is applied to four different towed gears i.e. a bottom trawl, a midwater trawl, a two-boat seine, and an anchovy boat seine and for 17 fish species as mainly caught. The Alpha cluster computer system and Fortran MPI (Message-Passing Interface) parallel programming were used for rapid calculation and mass data processing in this chaotic behavior model. The results of the simulation can be represented as animation of fish movements in relation to fishing gear using Open-GL and C graphic programming and catch data as well as selectivity analysis. The results of this simulator mimicked closely the field studies of the same gears and can therefore be used in further study of fishing gear design, predicting selectivity and indoor training systems.

NUMERICAL ANALYSIS OF TWO- AND THREE-DIMENSIONAL SUBSONIC TURBULENT CAVITY FLOWS (2차원과 3차원 아음속 공동 유동 특성에 대한 수치적 연구)

  • Choi, Hong-Il;Kim, Jae-Soo
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2007.10a
    • /
    • pp.187-193
    • /
    • 2007
  • The flight vehicles have cavities such as wheel wells and bomb bays. The flow around a cavity is characterized as unsteady flow because of the formation and dissipation of vortices due to the interaction between the freestream shear layer and cavity internal flow, the generation of shock and expansion waves. Resonance phenomena can damage the structures around the cavity and negatively affect aerodynamic performance and stability. In the present study, numerical analysis was performed for cavity flows by the unsteady compressible three dimensional Reynolds-Averaged Navier-Stokes (RANS) equations with Wilcox's ${\kappa}\;-\;{\omega}$ turbulence model. The cavity has the aspect ratios of 2.5, 3.5 and 4.5 for two-dimensional case, same aspect ratios with the W/D ratio of 2 for three-dimensional case. The Mach and Reynolds numbers are 0.53 and 1,600,000 respectively. The flow field is observed to oscillate in the "shear layer mode" with a feedback mechanism. Based on the SPL(Sound Pressure Level) analysis of the pressure variation at the cavity trailing edge, the dominant frequency was analyzed and compared with the results of Rossiter's formula. The MPI(Message Passing Interface) parallelized code was used for calculations by PC-cluster.

  • PDF

A NOVEL PARALLEL METHOD FOR SPECKLE MASKING RECONSTRUCTION USING THE OPENMP

  • LI, XUEBAO;ZHENG, YANFANG
    • Journal of The Korean Astronomical Society
    • /
    • v.49 no.4
    • /
    • pp.157-162
    • /
    • 2016
  • High resolution reconstruction technology is developed to help enhance the spatial resolution of observational images for ground-based solar telescopes, such as speckle masking. Near real-time reconstruction performance is achieved on a high performance cluster using the Message Passing Interface (MPI). However, much time is spent in reconstructing solar subimages in such a speckle reconstruction. We design and implement a novel parallel method for speckle masking reconstruction of solar subimage on a shared memory machine using the OpenMP. Real tests are performed to verify the correctness of our codes. We present the details of several parallel reconstruction steps. The parallel implementation between various modules shows a great speed increase as compared to single thread serial implementation, and a speedup of about 2.5 is achieved in one subimage reconstruction. The timing result for reconstructing one subimage with 256×256 pixels shows a clear advantage with greater number of threads. This novel parallel method can be valuable in real-time reconstruction of solar images, especially after porting to a high performance cluster.

PERFORMANCE ANALYSIS OF THE PARALLEL CUPID CODE IN DISTRIBUTED MEMORY SYSTEM BASED ETHERNET AND INFINIBAND NETWORK (이더넷과 인피니밴드 네트워크 기반의 분산 메모리 시스템에서 병렬성능 분석)

  • Jeon, B.J.;Choi, H.G.
    • Journal of computational fluids engineering
    • /
    • v.19 no.2
    • /
    • pp.24-29
    • /
    • 2014
  • In this study, a parallel performance of CUPID-code has been investigated for both Ethernet and Infiniband network system to examine the effect of cache memory and network-speed. Bi-conjugate gradient solver of CUPID-code has been parallelised by using domain decomposition method and message passing interface (MPI). It is shown that the parallel performance of Ethernet-network system is worse than that of Infiniband-network system due to the slow network-speed and a small cache memory. It is also found that the parallel performance of each system deteriorates for a small problem due to the communication overhead, but the performance of Infiniband-network system is better than Ethernet-network system due to a much faster network-speed. For a large problem, the parallel performance depends less on network system.

Developing a Bioinformatics Tool for Peptide Nucleic Acid (PNA) antisense Technique Utilizing Parallel Computing System (Peptide Nucleic Acid(PNA)를 이용한 antisense 기법에 적용할 병렬 컴퓨팅용 Bioinformatics tool 개발)

  • Kim Seong-Jo;Jeon Ho-Sang;Hong Seung-Pyo;Kim Hyon-Chang;Kim Han-Jip;Min Churl-K
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2006.06a
    • /
    • pp.43-45
    • /
    • 2006
  • Unlike RNA interference, whose usage is limited to eukaryotic cells, Peptide Nucleic Acid (PNA) technique is applicable to both eukaryotic and prokaryotic cells. PNA has been proven to be an effective agent for blocking gene expressions and has several advantages over other antisense techniques. Here we developed a parallel computing software that provides the ideal sequences to design PNA oligos to prevent any off-target effects. We applied a new approach in our location-finding algorithm that finds a target gene from the whole genome sequence. Message Passing Interface (MPI) was used to perform parallel computing in order to reduce the calculation time. The software will help biologists design more accurate and effective antisense PNA by minimizing the chance of off-target effects.

  • PDF