• Title/Summary/Keyword: Domain decomposition and MPI

Search Result 31, Processing Time 0.024 seconds

Parallel Implementation of SIMPLER by Using Domain Decomposition Technique (영역분할법에 의한 SIMPLER 기법의 병렬화)

  • Kwak Ho Sang
    • 한국전산유체공학회:학술대회논문집
    • /
    • 1997.10a
    • /
    • pp.23-28
    • /
    • 1997
  • A parallel implementation is made of a two-dimensional finite volume model based on the SIMPLER. The solution domain is decomposed into several subdomains and the solution at each subdomain is acquired by parallel use of multiple processors. Communications between processors are accomplished by using the standard MPI and the Cray-specific SHMEM. The parallelization method for the overall solution procedure to the Navier-Stokes equations is described in detail, The parallel implementation is validated on the Cray T3E system for a benchmark problem of natural convection in a sidewall-heated cavity. The parallel performance is assessed and the issues encountered in achieving a high-performance parallel model are elaborated.

  • PDF

Parallelization and application of SACOS for whole core thermal-hydraulic analysis

  • Gui, Minyang;Tian, Wenxi;Wu, Di;Chen, Ronghua;Wang, Mingjun;Su, G.H.
    • Nuclear Engineering and Technology
    • /
    • v.53 no.12
    • /
    • pp.3902-3909
    • /
    • 2021
  • SACOS series of subchannel analysis codes have been developed by XJTU-NuTheL for many years and are being used for the thermal-hydraulic safety analysis of various reactor cores. To achieve fine whole core pin-level analysis, the input preprocessing and parallel capabilities of the code have been developed in this study. Preprocessing is suitable for modeling rectangular and hexagonal assemblies with less error-prone input; parallelization is established based on the domain decomposition method with the hybrid of MPI and OpenMP. For domain decomposition, a more flexible method has been proposed which can determine the appropriate task division of the core domain according to the number of processors of the server. By performing the calculation time evaluation for the several PWR assembly problems, the code parallelization has been successfully verified with different number of processors. Subsequent analysis results for rectangular- and hexagonal-assembly core imply that the code can be used to model and perform pin-level core safety analysis with acceptable computational efficiency.

The Optimum Design of Airfoil Shape with Parallel Computation (병렬연산을 이용한 익형의 최적 설계)

  • Jo,Jang-Geun;Park,Won-Gyu
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.31 no.1
    • /
    • pp.1-7
    • /
    • 2003
  • The aerodynamic optimization method for airfoil design was described in this paper. The Navier-Stokes equations were solved to consider the viscous flow information around an airfoil. The Modified Method of Feasible Direction(MMFD) was used for sensitivity analysis and the polynomial interpolation was used for distance calculation of the minimization. The Message Passing Interface(MPI) library of parallel computation was adopted to reduce the computation time of flow solver by decomposing the entire computational domain into 8 sub-domains and one-to-one allocating 8 processors to 8 sub-domains. The parallel computation was also used to compute the sensitivity analysis by allocating each search direction to each processor. The present optimization reduced the drag of airfoil while the lift is maintained at the tolerable design value.

Finite Element Analysis of Shape Rolling Process using Destributive Parallel Algorithms on Cray T3E (병렬 컴퓨터를 이용한 형상 압연공정 유한요소 해석의 분산병렬처리에 관한 연구)

  • Gwon, Gi-Chan;Yun, Seong-Gi
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.24 no.5 s.176
    • /
    • pp.1215-1230
    • /
    • 2000
  • Parallel Approaches using Cray T3E which is NIPP (Massively Parallel Processors) machine are presented for the efficient computation of the finite element analysis of 3-D shape rolling processes. D omain decomposition method coupled with parallel linear equation solver is used. Domain decomposition is applied for obtaining element tangent stifffiess matrices and residual vectors. Direct and iterative parallel algorithms are used for solving the linear equations. Direct algorithm is_parallel version of direct banded matrix solver. For iterative algorithms, the well-known preconditioned conjugate gradient solver with Jacobi preconditioner is also employed. Moreover a new effective iterative scheme with block inverse matrix preconditioner, which is named by present authors, is presented and its results are compared with the one using Jacobi preconditioner. PVM and MPI are used for message passing and synchronization between processors. The performance and efficiency of each algorithm is discussed and comparisons are made among different algorithms.

Numerical Simulation of Natural Convection in Annuli with Internal Fins

  • Ha, Man-Yeong;Kim, Joo-Goo
    • Journal of Mechanical Science and Technology
    • /
    • v.18 no.4
    • /
    • pp.718-730
    • /
    • 2004
  • The solution for the natural convection in internally finned horizontal annuli is obtained by using a numerical simulation of time-dependent and two-dimensional governing equations. The fins existing in annuli influence the flow pattern, temperature distribution and heat transfer rate. The variations of the On configuration suppress or accelerate the free convective effects compared to those of the smooth tubes. The effects of fin configuration, number of fins and ratio of annulus gap width to the inner cylinder radius on the fluid flow and heat transfer in annuli are demonstrated by the distribution of the velocity vector, isotherms and streamlines. The governing equations are solved efficiently by using a parallel implementation. The technique is adopted for reduction of the computation cost. The parallelization is performed with the domain decomposition technique and message passing between sub-domains on the basis of the MPI library. The results from parallel computation reveal in consistency with those of the sequential program. Moreover, the speed-up ratio shows linearity with the number of processor.

The effects of axial spacing on the unsteady secondary and performance in one-stage axial turbine (1단 터빈에서 축간격 변화가 비정상 이차유동 및 성능에 미치는 영향)

  • Park Junyoung;Baek JeHyun
    • Proceedings of the KSME Conference
    • /
    • 2002.08a
    • /
    • pp.537-540
    • /
    • 2002
  • Flow through turbomachinery has a very complex structure and is intrinsically unsteady. Especially, recent design trend to turbomachinery with short axial spacing makes the flow extremely complex due to the interaction between stator and rotor. Therefore, it is very necessary to clearly understand the complex flow structure to obtain the high efficiency turbomachinery. So, in this paper, the effects of axial spacing on the unsteady secondary flow performance in the one stage turbine are investigated by three-dimensional unsteady flow analysis. The three-dimensional solver is parallelized using domain decomposition and Message Passing Interface(MPI) standard to overcome the limitation of memory and the CPU time in three-dimensional unsteady calculation. A sliding mesh interface approach has been implemented to exchange flow information between blade rows.

  • PDF

비점성 압축성 코드의 병렬화 기법에 의한 슈퍼컴퓨터 CRAY T3E의 성능 분석

  • Go Deok-Gon
    • 한국전산유체공학회:학술대회논문집
    • /
    • 1997.10a
    • /
    • pp.17-22
    • /
    • 1997
  • The performances of the CRAYT3E and CRAYC90 were compared in the point of aerodynamics. The CRAYC90 with and without the highest vector option was run, respectively. The CRAYT3E was run with various processors (from 1pe to 32pes). The communication utilities of MPI and SHMEM were used to inform the boundary data to the other processors. The DADI Euler solver, which is implicit scheme and use central difference method, was used. The domain decomposition method was also used. As the result, the CRAYC90 with the highest vector option is 5.7 times faster than the CRAYT3E with 1 processor. However, because of the scalability of the CRAYT3E, the CRAYT3E with more than 6 processors is faster than CRAYC90. In case that 32 processors used, the CRAYT3E is 6 times faster than CRAYC90 with the highest vector option.

  • PDF

Computation of Compressor Flows Using Parallel Implementation of Preconditioning Method (예조건화 기법의 병렬화를 이용한 압축기 유동해석)

  • Lee Gee-Soo;Choi Jeong-Yeol;Kim Kui-Soon
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2000.10a
    • /
    • pp.155-162
    • /
    • 2000
  • In this paper, preconditioning method is parallelized on fast-ethernet PC cluster. The algorithm is based on scaling the pressure terms in the momemtum equations and preconditioning the conservation equations to circumvent numerical difficulties at low Mach numbers. Parallelization is performed using a domain decomposition technique(DDT) and message passing between sub-domains are taken from the MPI library. The results are shown to have good convergence properties at all Mach number on the circular arc Bump and are capable of reasonable predicting two-dimensional turbulent flows on DCA compressor cascade.

  • PDF

Numerical Study on the Drag of a Car Model under Road Condition (주행조건에서의 자동차 모델 항력에 대한 수치해석적 연구)

  • Kim, Beom-Jun;Kang, Sung-Woo;Choi, Hyoung-gwon;Yoo, Jung-Yul
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.27 no.8
    • /
    • pp.1182-1190
    • /
    • 2003
  • A parallelized FEM code based on domain decomposition method has been recently developed for large-scale computational fluid dynamics. A 4-step splitting finite element algorithm is adopted for unsteady flow computation of the incompressible Navier-Stokes equation, and Smagorinsky LES model is chosen for turbulent flow computation. Both METIS and MPI Libraries are used for domain partitioning and data communication between processors, respectively. Tiburon model of Hyundai Motor Company is chosen as the computational model at Re=7.5 $\times$ 10$^{5}$ , which is based on the car height. The calculation is carried out under both the wind tunnel condition and the road condition using IBM SP parallel architecture at KISTI Super Computing Center. Compared with the existing experimental data, both the velocity and pressure fields are predicted reasonably well and the drag coefficient is in good agreement. Furthermore, it is confirmed that the drag under the road condition is smaller than that under the wind-tunnel condition.

Parallel Finite Element Simulation of the Incompressible Navier-stokes Equations (병렬 유한요소 해석기법을 이용한 유동장 해석)

  • Choi H. G.;Kim B. J.;Kang S. W.;Yoo J. Y.
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2002.05a
    • /
    • pp.8-15
    • /
    • 2002
  • For the large scale computation of turbulent flows around an arbitrarily shaped body, a parallel LES (large eddy simulation) code has been recently developed in which domain decomposition method is adopted. METIS and MPI (message Passing interface) libraries are used for domain partitioning and data communication between processors, respectively. For unsteady computation of the incompressible Wavier-Stokes equation, 4-step splitting finite element algorithm [1] is adopted and Smagorinsky or dynamic LES model can be chosen fur the modeling of small eddies in turbulent flows. For the validation and performance-estimation of the parallel code, a three-dimensional laminar flow generated by natural convection inside a cube has been solved. Then, we have solved the turbulent flow around MIRA (Motor Industry Research Association) model at $Re = 2.6\times10^6$, which is based on the model height and inlet free stream velocity, using 32 processors on IBM SMP cluster and compared with the existing experiment.

  • PDF