• 제목/요약/키워드: Parallel computing model

검색결과 172건 처리시간 0.021초

Development of a flux emergence simulation using parallel computing

  • 이환희;마가라 테츠야
    • 천문학회보
    • /
    • 제44권2호
    • /
    • pp.71.1-71.1
    • /
    • 2019
  • The solar magnetic field comes from the solar interior and is related to various phenomena on the Sun. To understand this process, many studies have been conducted to produce its evolution using a single flux rope. In this study, we are interested in the emergence of two flux ropes and their evolution, which takes longer than the emergence of a single flux rope. To construct it, we develop a flux emergence simulation by applying a parallel computing to reduce a computation time in a wider domain. The original simulation code had been written in Fortran 77. We modify it to a version of Fortran 90 with Message Passing Interface (MPI). The results of the original and new simulation are compared on the NEC SX-Aurora TSUBASA which is a vector engine processor. The parallelized version is faster than running on a single core and it shows a possibility to handle large amounts of calculation. Based on this model, we can construct a complex flux emergence system, such as an evolution of two magnetic flux ropes.

  • PDF

셀룰러 오토마타 기반 WCA2D 모형을 이용한 부산 온천천 유역 고해상도 도시 침수 해석 (High-resolution Urban Flood Modeling using Cellular Automata-based WCA2D in the Oncheon-cheon Catchment in Busan, South Korea)

  • 최현진;이송희;우현아;노성진
    • 대한토목학회논문집
    • /
    • 제43권5호
    • /
    • pp.587-599
    • /
    • 2023
  • 기후변화로 인해 전 세계 주요 도시에서 홍수의 빈도와 위험성이 증가함에 따라, 도시 침수에 대비한 선제적 대응을 위해 넓은 공간 영역에서 고해상도 2차원 침수 정보를 신속하고 정확하게 해석할 수 있는 모의 기술의 중요성이 대두되고 있다. 기존의 천수 방정식(shallow water equations)에 기반한 물리적 해석 방법은 고해상도 침수 예측을 위해 많은 컴퓨터 자원과 계산 시간이 소요되는 한계가 있다. 본 연구는 전환 규칙과 가중치 기반 시스템을 사용하여 침수의 시공간 변화를 모의하는 셀룰러 오토마타(cellular automata) 기반 2차원 침수 해석 모형 Weighted Cellular Automata 2D (WCA2D)의 이론적 배경을 고찰하고, 부산 온천천 유역의 침수 사상 모의를 통해 재현하여 국내 도시 유역에 대한 적용성을 검토하였다. 또한, Open Computing Language (OpenCL)와 Open Multi-Processing (OpenMP)과 같은 병렬계산(parallel computing)기술을 적용한 버전을 순차계산(sequential computing)결과와 비교하여 연산성능을 평가 하였다. 연구결과, WCA2D 모형에 의한 최대 침수심 분포는 과거침수 피해지도와 유사하게 모의되어, 복잡한 지형특성을 가지는 도시유역 침수의 시공간적 변화를 해석하기에 적절함을 확인하였다. 또한,병렬 계산 적용시 순차 계산 버전에 비해 OpenCL과 OpenMP는 약8배~14배, 5배~6배 연산 효율이 향상되어 효율적인 도시 침수 모의가 가능하였다.

Scalable Prediction Models for Airbnb Listing in Spark Big Data Cluster using GPU-accelerated RAPIDS

  • Muralidharan, Samyuktha;Yadav, Savita;Huh, Jungwoo;Lee, Sanghoon;Woo, Jongwook
    • Journal of information and communication convergence engineering
    • /
    • 제20권2호
    • /
    • pp.96-102
    • /
    • 2022
  • We aim to build predictive models for Airbnb's prices using a GPU-accelerated RAPIDS in a big data cluster. The Airbnb Listings datasets are used for the predictive analysis. Several machine-learning algorithms have been adopted to build models that predict the price of Airbnb listings. We compare the results of traditional and big data approaches to machine learning for price prediction and discuss the performance of the models. We built big data models using Databricks Spark Cluster, a distributed parallel computing system. Furthermore, we implemented models using multiple GPUs using RAPIDS in the spark cluster. The model was developed using the XGBoost algorithm, whereas other models were developed using traditional central processing unit (CPU)-based algorithms. This study compared all models in terms of accuracy metrics and computing time. We observed that the XGBoost model with RAPIDS using GPUs had the highest accuracy and computing time.

다중블록 유동해석에서 병렬처리를 위한 시스템의 구조 (A framework for parallel processing in multiblock flow computations)

  • 박상근;이건우
    • 대한기계학회논문집B
    • /
    • 제21권8호
    • /
    • pp.1024-1033
    • /
    • 1997
  • The past several years have witnessed an ever-increasing acceptance and adoption of parallel processing, both for high performance scientific computing as well as for more general purpose applications. Furthermore with increasing needs to perform the complex flow calculations in an efficient manner, the use of the message passing model on distributed networks has emerged as an important alternative to the expensive supercomputers. This work attempts to provide a generic framework to enable the parallelization of all CFD-related works using the master-slave model. This framework consists of (1) input geometry, (2) domain decomposition, (3) grid generation, (4) flow computations, (5) flow visualization, and (6) output display as the sequential components, but performs computations for (2) to (5) in parallel on the workstation clustering. The flow computations are parallized by having multiple copies of the flow-code to solve a PDE on different spatial regions on different processors, while their flow data are exchanged across the region boundaries, and the solution is time-stepped. The Parallel Virtual Machine (PVM) is used for distributed communication in this work.

병렬 유한요소 해석기법을 이용한 유동장 해석 (Parallel Finite Element Simulation of the Incompressible Navier-stokes Equations)

  • 최형권;김범준;강성우;유정열
    • 한국전산유체공학회:학술대회논문집
    • /
    • 한국전산유체공학회 2002년도 춘계 학술대회논문집
    • /
    • pp.8-15
    • /
    • 2002
  • For the large scale computation of turbulent flows around an arbitrarily shaped body, a parallel LES (large eddy simulation) code has been recently developed in which domain decomposition method is adopted. METIS and MPI (message Passing interface) libraries are used for domain partitioning and data communication between processors, respectively. For unsteady computation of the incompressible Wavier-Stokes equation, 4-step splitting finite element algorithm [1] is adopted and Smagorinsky or dynamic LES model can be chosen fur the modeling of small eddies in turbulent flows. For the validation and performance-estimation of the parallel code, a three-dimensional laminar flow generated by natural convection inside a cube has been solved. Then, we have solved the turbulent flow around MIRA (Motor Industry Research Association) model at $Re = 2.6\times10^6$, which is based on the model height and inlet free stream velocity, using 32 processors on IBM SMP cluster and compared with the existing experiment.

  • PDF

고속 네트웍 기반의 분산병렬시스템에서의 성능 향상 분석 모델 (Speedup Analysis Model for High Speed Network based Distributed Parallel Systems)

  • 김화성
    • 한국통신학회논문지
    • /
    • 제26권12C호
    • /
    • pp.218-224
    • /
    • 2001
  • 분산병렬처리의 목적은 다양한 내재 병렬 형태의 특징을 갖는 연산 집약적 문제를 고속 네트웍으로 연결되어진 다수의 고성능 및 병렬 컴퓨터들의 각기 다른 능력을 최대한 이용하여 해결함에 있다. 본 논문에서는 분산병렬시스템을 이용하는 경우의 성능 향상 분석을 위해 일반적인 그래프 표현 방법을 포함하는 계산 모델을 제안하고 프로그램의 수행을 위한 스케쥴링 시에 성능 향상이 어떠한 요인에 의해 달성되는지를 분석한다. 제안된 표현 방법은 동기종 및 이기종 시스템 모두에 적용되어질 수 있다. 분산병렬 시스템에서 스케줄링을 통하여 더 많은 속도향상을 얻기 위해서는 태스크와 병렬 컴퓨터간의 병렬특성의 일치가 주의 질게 다루어져야 하며 태스크의 이동으로 인한 통신 오버 헤드가 최소화 되어야 한다.

  • PDF

그리드 컴퓨팅에서 유효자원 동적 재배치 기반 작업 스케줄링 모델 (Dynamic Available-Resource Reallocation based Job Scheduling Model in Grid Computing)

  • 김재권;이종식
    • 한국시뮬레이션학회논문지
    • /
    • 제21권2호
    • /
    • pp.59-67
    • /
    • 2012
  • 그리드 컴퓨팅은 하나의 대용량 작업을 처리하도록 물리 자원을 구성하고 있지만 최근에는 데이터의 급속한 증가로 인해서 복수개의 작업을 처리하는 방법이 필요하다. 일반적으로 대용량 작업을 요청하면 각 물리 자원들이 작업을 분할하게 되며, 자원의 성능과 거리에 따라 처리 시간이 다르다. 성능에 따라 먼저 완료된 유효자원은 어떠한 작업도 하지 않으며, 모든 작업이 끝났을 경우에 다음 작업을 처리한다. 이에 본 논문에서는 먼저 처리가 완료된 자원을 다른 작업에 할당할 수 있는 동적 자원 재배치 스케줄링 모델(DRRSM: Dynamic Resource Reallocation Scheduling Model)을 제안한다. DRRSM은 먼저 처리가 완료된 자원을 다른 작업에 자원의 성능과 거리에 따라 작업을 재배치시키는 방법이다. DRRSM은 여러 개의 대용량 작업을 처리하는데 효과적이다.

외연적 시간적분법을 이용한 복합재료 섬유 파단 시 음향방출의 3차원 유한요소 해석 (Tree-dimensional FE Analysis of Acoustic Emission of Fiber Breakage using Explicit Time Integration Method)

  • 백승훈;박시형;김승조
    • 한국복합재료학회:학술대회논문집
    • /
    • 한국복합재료학회 2005년도 춘계학술발표대회 논문집
    • /
    • pp.172-175
    • /
    • 2005
  • The numerical simulation is performed for the acoustic emission and the wave propagation due to fiber breakage in single fiber composite plates by the finite element transient analysis. The acoustic emission and the following wave motions from a fiber breakage under a static loading is simulated to investigate the applicability of the explicit finite element method and the equivalent volume force model as a simulation tool of wave propagation and a modeling technique of an acoustic emission. For such a simple case of the damage event under static loading, various parameters affecting the wave motion are investigated for reliable simulations of the impact damage event. The high velocity and the small wave length of the acoustic emission require a refined analysis with dense distribution of the finite element and a small time step. In order to fulfill the requirement for capturing the exact wave propagation and to cover the 3-D simulation, we utilize the parallel FE transient analysis code and the parallel computing technology.

  • PDF

병렬 환경하의 진화 이론을 이용한 결함인식 (Crack Identification Using Evolutionary Algorithms in Parallel Computing Environment)

  • 심문보;서명원
    • 대한기계학회논문집A
    • /
    • 제26권9호
    • /
    • pp.1806-1813
    • /
    • 2002
  • It is well known that a crack has an important effect on the dynamic behavior of a structure. This effect depends mainly on the location and depth of the crack. To identify the location and depth of a crack in a structure, a classical optimization technique was adopted by previous researchers. That technique overcame the difficulty of finding the intersection point of the superposed contours that correspond to the eigenfrequency caused by the crack presence. However, it is hard to select a trial solution initially for optimization because the defined objective function is heavily multimodal. A method is presented in this paper, which uses continuous evolutionary algorithms(CEAs). CEAs are effective for solving inverse problems and implemented on PC clusters to shorten calculation time. With finite element model of the structure to calculate eigenfrequencies, it is possible to formulate the inverse problem in optimization format. CEAs are used to identify the crack location and depth minimizing the difference from the measured frequencies. We have tried this new idea on a simple beam structure and the results are promising with high parallel efficiency over about 94%.