• 제목/요약/키워드: Domain decomposition and MPI

검색결과 31건 처리시간 0.024초

영역분할법에 의한 SIMPLER 기법의 병렬화 (Parallel Implementation of SIMPLER by Using Domain Decomposition Technique)

  • 곽호상
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 1997년도 추계 학술대회논문집
    • /
    • pp.23-28
    • /
    • 1997
  • A parallel implementation is made of a two-dimensional finite volume model based on the SIMPLER. The solution domain is decomposed into several subdomains and the solution at each subdomain is acquired by parallel use of multiple processors. Communications between processors are accomplished by using the standard MPI and the Cray-specific SHMEM. The parallelization method for the overall solution procedure to the Navier-Stokes equations is described in detail, The parallel implementation is validated on the Cray T3E system for a benchmark problem of natural convection in a sidewall-heated cavity. The parallel performance is assessed and the issues encountered in achieving a high-performance parallel model are elaborated.

  • PDF

Parallelization and application of SACOS for whole core thermal-hydraulic analysis

  • Gui, Minyang;Tian, Wenxi;Wu, Di;Chen, Ronghua;Wang, Mingjun;Su, G.H.
    • Nuclear Engineering and Technology
    • /
    • 제53권12호
    • /
    • pp.3902-3909
    • /
    • 2021
  • SACOS series of subchannel analysis codes have been developed by XJTU-NuTheL for many years and are being used for the thermal-hydraulic safety analysis of various reactor cores. To achieve fine whole core pin-level analysis, the input preprocessing and parallel capabilities of the code have been developed in this study. Preprocessing is suitable for modeling rectangular and hexagonal assemblies with less error-prone input; parallelization is established based on the domain decomposition method with the hybrid of MPI and OpenMP. For domain decomposition, a more flexible method has been proposed which can determine the appropriate task division of the core domain according to the number of processors of the server. By performing the calculation time evaluation for the several PWR assembly problems, the code parallelization has been successfully verified with different number of processors. Subsequent analysis results for rectangular- and hexagonal-assembly core imply that the code can be used to model and perform pin-level core safety analysis with acceptable computational efficiency.

병렬연산을 이용한 익형의 최적 설계 (The Optimum Design of Airfoil Shape with Parallel Computation)

  • 조장근;박원규
    • 한국항공우주학회지
    • /
    • 제31권1호
    • /
    • pp.1-7
    • /
    • 2003
  • 익형에 대한 공력 최적설계 프로그램을 개발하였으며, 점성 유동장에 대한 보다 정확한 정보를 설계에 반영하기 위하여 나비어-스톡스 방정식을 사용하였다. 최적설계 방법으로는 민감도 해석을 위하여 수정유용방향탐색방법(Modified Method of Feasible Directions, MMFD)을 사용하였으며 이동거리 계산을 위하여 다항식 보간법을 사용하였다. 또한 설계시간을 단축하기 위하여 MPI를 사용하여 병렬화하였다. 전체 유동장을 8개의 영역으로 분할하였으며 분할된 영역은 지정된 프로세서에 할당하여 계산을 수행하였다. 민감도 계산을 위하여 각 프로세서에 할당하여 계산을 수행하였다. 민감도 계산을 위하여 각 프로세서에 각 탐색방향을 할당하여 민감도를 병렬계산하였다. 본 연구의 수행 결과 양력은 허용한도 내의 일정한 값을 유지하는 가운데 항력이 감소된 최적화된 익형의 형상을 설계할 수 있었다.

병렬 컴퓨터를 이용한 형상 압연공정 유한요소 해석의 분산병렬처리에 관한 연구 (Finite Element Analysis of Shape Rolling Process using Destributive Parallel Algorithms on Cray T3E)

  • 권기찬;윤성기
    • 대한기계학회논문집A
    • /
    • 제24권5호
    • /
    • pp.1215-1230
    • /
    • 2000
  • Parallel Approaches using Cray T3E which is NIPP (Massively Parallel Processors) machine are presented for the efficient computation of the finite element analysis of 3-D shape rolling processes. D omain decomposition method coupled with parallel linear equation solver is used. Domain decomposition is applied for obtaining element tangent stifffiess matrices and residual vectors. Direct and iterative parallel algorithms are used for solving the linear equations. Direct algorithm is_parallel version of direct banded matrix solver. For iterative algorithms, the well-known preconditioned conjugate gradient solver with Jacobi preconditioner is also employed. Moreover a new effective iterative scheme with block inverse matrix preconditioner, which is named by present authors, is presented and its results are compared with the one using Jacobi preconditioner. PVM and MPI are used for message passing and synchronization between processors. The performance and efficiency of each algorithm is discussed and comparisons are made among different algorithms.

Numerical Simulation of Natural Convection in Annuli with Internal Fins

  • Ha, Man-Yeong;Kim, Joo-Goo
    • Journal of Mechanical Science and Technology
    • /
    • 제18권4호
    • /
    • pp.718-730
    • /
    • 2004
  • The solution for the natural convection in internally finned horizontal annuli is obtained by using a numerical simulation of time-dependent and two-dimensional governing equations. The fins existing in annuli influence the flow pattern, temperature distribution and heat transfer rate. The variations of the On configuration suppress or accelerate the free convective effects compared to those of the smooth tubes. The effects of fin configuration, number of fins and ratio of annulus gap width to the inner cylinder radius on the fluid flow and heat transfer in annuli are demonstrated by the distribution of the velocity vector, isotherms and streamlines. The governing equations are solved efficiently by using a parallel implementation. The technique is adopted for reduction of the computation cost. The parallelization is performed with the domain decomposition technique and message passing between sub-domains on the basis of the MPI library. The results from parallel computation reveal in consistency with those of the sequential program. Moreover, the speed-up ratio shows linearity with the number of processor.

1단 터빈에서 축간격 변화가 비정상 이차유동 및 성능에 미치는 영향 (The effects of axial spacing on the unsteady secondary and performance in one-stage axial turbine)

  • 박준영;백제현
    • 대한기계학회:학술대회논문집
    • /
    • 대한기계학회 2002년도 학술대회지
    • /
    • pp.537-540
    • /
    • 2002
  • Flow through turbomachinery has a very complex structure and is intrinsically unsteady. Especially, recent design trend to turbomachinery with short axial spacing makes the flow extremely complex due to the interaction between stator and rotor. Therefore, it is very necessary to clearly understand the complex flow structure to obtain the high efficiency turbomachinery. So, in this paper, the effects of axial spacing on the unsteady secondary flow performance in the one stage turbine are investigated by three-dimensional unsteady flow analysis. The three-dimensional solver is parallelized using domain decomposition and Message Passing Interface(MPI) standard to overcome the limitation of memory and the CPU time in three-dimensional unsteady calculation. A sliding mesh interface approach has been implemented to exchange flow information between blade rows.

  • PDF

비점성 압축성 코드의 병렬화 기법에 의한 슈퍼컴퓨터 CRAY T3E의 성능 분석

  • 고덕곤
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 1997년도 추계 학술대회논문집
    • /
    • pp.17-22
    • /
    • 1997
  • The performances of the CRAYT3E and CRAYC90 were compared in the point of aerodynamics. The CRAYC90 with and without the highest vector option was run, respectively. The CRAYT3E was run with various processors (from 1pe to 32pes). The communication utilities of MPI and SHMEM were used to inform the boundary data to the other processors. The DADI Euler solver, which is implicit scheme and use central difference method, was used. The domain decomposition method was also used. As the result, the CRAYC90 with the highest vector option is 5.7 times faster than the CRAYT3E with 1 processor. However, because of the scalability of the CRAYT3E, the CRAYT3E with more than 6 processors is faster than CRAYC90. In case that 32 processors used, the CRAYT3E is 6 times faster than CRAYC90 with the highest vector option.

  • PDF

예조건화 기법의 병렬화를 이용한 압축기 유동해석 (Computation of Compressor Flows Using Parallel Implementation of Preconditioning Method)

  • 이기수;최정열;김귀순
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 2000년도 추계 학술대회논문집
    • /
    • pp.155-162
    • /
    • 2000
  • In this paper, preconditioning method is parallelized on fast-ethernet PC cluster. The algorithm is based on scaling the pressure terms in the momemtum equations and preconditioning the conservation equations to circumvent numerical difficulties at low Mach numbers. Parallelization is performed using a domain decomposition technique(DDT) and message passing between sub-domains are taken from the MPI library. The results are shown to have good convergence properties at all Mach number on the circular arc Bump and are capable of reasonable predicting two-dimensional turbulent flows on DCA compressor cascade.

  • PDF

주행조건에서의 자동차 모델 항력에 대한 수치해석적 연구 (Numerical Study on the Drag of a Car Model under Road Condition)

  • 김범준;강성우;최형권;유정열
    • 대한기계학회논문집B
    • /
    • 제27권8호
    • /
    • pp.1182-1190
    • /
    • 2003
  • A parallelized FEM code based on domain decomposition method has been recently developed for large-scale computational fluid dynamics. A 4-step splitting finite element algorithm is adopted for unsteady flow computation of the incompressible Navier-Stokes equation, and Smagorinsky LES model is chosen for turbulent flow computation. Both METIS and MPI Libraries are used for domain partitioning and data communication between processors, respectively. Tiburon model of Hyundai Motor Company is chosen as the computational model at Re=7.5 $\times$ 10$^{5}$ , which is based on the car height. The calculation is carried out under both the wind tunnel condition and the road condition using IBM SP parallel architecture at KISTI Super Computing Center. Compared with the existing experimental data, both the velocity and pressure fields are predicted reasonably well and the drag coefficient is in good agreement. Furthermore, it is confirmed that the drag under the road condition is smaller than that under the wind-tunnel condition.

병렬 유한요소 해석기법을 이용한 유동장 해석 (Parallel Finite Element Simulation of the Incompressible Navier-stokes Equations)

  • 최형권;김범준;강성우;유정열
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 2002년도 춘계 학술대회논문집
    • /
    • pp.8-15
    • /
    • 2002
  • For the large scale computation of turbulent flows around an arbitrarily shaped body, a parallel LES (large eddy simulation) code has been recently developed in which domain decomposition method is adopted. METIS and MPI (message Passing interface) libraries are used for domain partitioning and data communication between processors, respectively. For unsteady computation of the incompressible Wavier-Stokes equation, 4-step splitting finite element algorithm [1] is adopted and Smagorinsky or dynamic LES model can be chosen fur the modeling of small eddies in turbulent flows. For the validation and performance-estimation of the parallel code, a three-dimensional laminar flow generated by natural convection inside a cube has been solved. Then, we have solved the turbulent flow around MIRA (Motor Industry Research Association) model at $Re = 2.6\times10^6$, which is based on the model height and inlet free stream velocity, using 32 processors on IBM SMP cluster and compared with the existing experiment.

  • PDF