• 제목/요약/키워드: speedup

검색결과 271건 처리시간 0.027초

이산화된 Navier-Stokes 방정식의 영역분할법을 위한 병렬 예조건화 (Parallel Preconditioner for the Domain Decomposition Method of the Discretized Navier-Stokes Equation)

  • 최형권;유정열;강성우
    • 대한기계학회논문집B
    • /
    • 제27권6호
    • /
    • pp.753-765
    • /
    • 2003
  • A finite element code for the numerical solution of the Navier-Stokes equation is parallelized by vertex-oriented domain decomposition. To accelerate the convergence of iterative solvers like conjugate gradient method, parallel block ILU, iterative block ILU, and distributed ILU methods are tested as parallel preconditioners. The effectiveness of the algorithms has been investigated when P1P1 finite element discretization is used for the parallel solution of the Navier-Stokes equation. Two-dimensional and three-dimensional Laplace equations are calculated to estimate the speedup of the preconditioners. Calculation domain is partitioned by one- and multi-dimensional partitioning methods in structured grid and by METIS library in unstructured grid. For the domain-decomposed parallel computation of the Navier-Stokes equation, we have solved three-dimensional lid-driven cavity and natural convection problems in a cube as benchmark problems using a parallelized fractional 4-step finite element method. The speedup for each parallel preconditioning method is to be compared using upto 64 processors.

Relaxed min-max 힙을 병합하는 병렬 알고리즘 (A Parallel Algorithm for Merging Relaxed Min-Max Heaps)

  • 민용식
    • 한국정보처리학회논문지
    • /
    • 제5권5호
    • /
    • pp.1162-1171
    • /
    • 1998
  • 본 논문에서는 relaxed min-max heap을 병합시키기 위하여 새로운 자료구조인 개선된 relaxed min-max-pair 힙을 제시함과 동시에, 두개의 relaxed min-max 힙, 즉 크기가 n인 relaxed min-max nheap과 크기가 k인 relaxed min-max kheap으로 구성된 우선 순위 큐를 병합시키기 위한 병렬 알고리즘을 제시하고자 한다. 본 논문에서는 [9]의 방법으로부터 relaxed min-max 힙을 병합 시키기 위해서 이용된 blossomed tree와 lazying 방법을 제거하여도 병합되는 새로운 방법을 제시하였다. 결과적으로 본 논문에 제시된 방법은 max($2^{i-1}$,[(m+1/4)])개의 프로세서를 이용할 경우, 시간 복잡도가 O(log(log(n/k))${\times}$log(k))임을 볼 수가 있다. 그리고 크기가 서로 다른 두 개의 relaxed min-max heap으로 구성된 8백만개의 데이터를 병합시키기 위해서, MasPar 머쉰에서 64개의 프로세서를 이용하여 실행시킨 결과 35.205의 Speedup을 얻었다.

  • PDF

고속 시뮬레이션을 위한 모델합성 방법 (Model Composition Methodology for High Speed Simulation)

  • 이완복
    • 한국콘텐츠학회논문지
    • /
    • 제6권11호
    • /
    • pp.258-265
    • /
    • 2006
  • DEVS 형식론은 이산사건 시스템의 구조를 계층적으로 나타낼 수 있기 때문에 복잡한 시스템을 모델링 하기에 적합하며, 가독성이 좋기 때문에 유지 보수에 유리한 장점이 있다. 반면에, 계층적인 모델의 구조는 시뮬레이션 실행 시에 빈번한 메시지 전달을 야기 시켜 시뮬레이션 속도가 저하되는 단점이 있다. 본 논문에서는 계층적인 DEVS 모델들을 하나로 합성하여 빈번한 메시지 전달을 방지하고 시뮬레이션 실행 속도를 개선할 수 있는 모델 합성법을 제안한다. 제안한 방법은 시뮬레이터 실행코드를 생성하기 이전에 구성 모델들 간의 메시지 전달 관계를 해석하여 실행시간에는 모델 해석과정을 생략하도록 하는 기법으로 기존의 모델 수평화 기법보다 훨씬 뛰어난 속도 향상 결과를 보인다. 제안한 방법의 효용성을 보이기 위해 실험을 통하여 시뮬레이션 속도가 18배 정도 개선될 수 있음을 보인다.

  • PDF

멀티코어 프로세서 상에서 에너지 효율을 고려한 실시간 병렬 작업들의 결함 포용 스케쥴링 (Fault-tolerant Scheduling of Real-time Parallel Tasks with Energy Efficiency on Multicore Processors)

  • 이관우
    • 정보처리학회논문지:컴퓨터 및 통신 시스템
    • /
    • 제3권6호
    • /
    • pp.173-178
    • /
    • 2014
  • 제시된 스케줄링 기법은 병렬처리 기법을 활용하여 실시간 작업들의 데드라인 제약과 결함 포용 제약을 만족하면서 멀티코어 프로세서의 에너지 소모 효율성을 향상시켰다. 최소 에너지 소모량 스케줄을 찾는 것은 NP-hard 문제이므로, 제시된 기법은 다항식의 시간 내에 최소 에너지 소모량에 근접하는 스케줄을 찾는다. 제시된 기법은 연관된 최신 기법과 비교하여 높은 병렬처리 속도는 물론 낮은 병렬처리 속도에서도 에너지 소모량이 현격하게 낮았으며, 에너지 소모량을 최대 86% 줄였다.

A lumped parameter method of characteristics approach and multigroup kernels applied to the subgroup self-shielding calculation in MPACT

  • Stimpson, Shane;Liu, Yuxuan;Collins, Benjamin;Clarno, Kevin
    • Nuclear Engineering and Technology
    • /
    • 제49권6호
    • /
    • pp.1240-1249
    • /
    • 2017
  • An essential component of the neutron transport solver is the resonance self-shielding calculation used to determine equivalence cross sections. The neutron transport code, MPACT, is currently using the subgroup self-shielding method, in which the method of characteristics (MOC) is used to solve purely absorbing fixed-source problems. Recent efforts incorporating multigroup kernels to the MOC solvers in MPACT have reduced runtime by roughly $2{\times}$. Applying the same concepts for self-shielding and developing a novel lumped parameter approach to MOC, substantial improvements have also been made to the self-shielding computational efficiency without sacrificing any accuracy. These new multigroup and lumped parameter capabilities have been demonstrated on two test cases: (1) a single lattice with quarter symmetry known as VERA (Virtual Environment for Reactor Applications) Progression Problem 2a and (2) a two-dimensional quarter-core slice known as Problem 5a-2D. From these cases, self-shielding computational time was reduced by roughly $3-4{\times}$, with a corresponding 15-20% increase in overall memory burden. An azimuthal angle sensitivity study also shows that only half as many angles are needed, yielding an additional speedup of $2{\times}$. In total, the improvements yield roughly a $7-8{\times}$ speedup. Given these performance benefits, these approaches have been adopted as the default in MPACT.

Quantum-based exact pattern matching algorithms for biological sequences

  • Soni, Kapil Kumar;Rasool, Akhtar
    • ETRI Journal
    • /
    • 제43권3호
    • /
    • pp.483-510
    • /
    • 2021
  • In computational biology, desired patterns are searched in large text databases, and an exact match is preferable. Classical benchmark algorithms obtain competent solutions for pattern matching in O (N) time, whereas quantum algorithm design is based on Grover's method, which completes the search in $O(\sqrt{N})$ time. This paper briefly explains existing quantum algorithms and defines their processing limitations. Our initial work overcomes existing algorithmic constraints by proposing the quantum-based combined exact (QBCE) algorithm for the pattern-matching problem to process exact patterns. Next, quantum random access memory (QRAM) processing is discussed, and based on it, we propose the QRAM processing-based exact (QPBE) pattern-matching algorithm. We show that to find all t occurrences of a pattern, the best case time complexities of the QBCE and QPBE algorithms are $O(\sqrt{t})$ and $O(\sqrt{N})$, and the exceptional worst case is bounded by O (t) and O (N). Thus, the proposed quantum algorithms achieve computational speedup. Our work is proved mathematically and validated with simulation, and complexity analysis demonstrates that our quantum algorithms are better than existing pattern-matching methods.

철근콘크리트 쉘 요소의 강성행렬 계산을 위한 벡터알고리즘 (Vector Algorithm for RC Shell Element Stiffness Matrix)

  • 민창식
    • 한국전산구조공학회:학술대회논문집
    • /
    • 한국전산구조공학회 1994년도 가을 학술발표회 논문집
    • /
    • pp.25-30
    • /
    • 1994
  • A vector algorithm for calculating the stiffness matrices of reinforced concrete shell elements is presented. The algorithm is based on establishing vector lengths equal to the number of elements. The computational efficiency of the proposed algorithm is assessed on a Cray Y-MP supercomputer. It is shown that the vector algorithm achieves scalar-to-vector speedup of 1.7 to 7.6 on three inelastic problems.

  • PDF

A 2D-FFT algorithm on mesh connected multiprocessor systems

  • Kunieda, Hiroaki;Itoh, Kazuhito
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 제어로봇시스템학회 1987년도 한국자동제어학술회의논문집(한일합동학술편); 한국과학기술대학, 충남; 16-17 Oct. 1987
    • /
    • pp.851-856
    • /
    • 1987
  • A direct computation algorithm of two dimensional fast Fourier transform (2D-FFT) is considered here for implementation in mesh connected multiprocessor array of both a 2D-toroidal and a rectangular type. Results are derived for a hardware algorithm including data allocation and interprocessor communications. A performance comparison is carried out between the proposed direct 2D-FFT computation and the conventional one to show that a new algorithm gives higher speedup under a reasonable assumption on the speeds of operations.

  • PDF

Vector algorithm for reinforced concrete shell element stiffness matrix

  • Min, Chang Shik;Gupta, Ajaya Kumar
    • Structural Engineering and Mechanics
    • /
    • 제2권2호
    • /
    • pp.125-139
    • /
    • 1994
  • A vector algorithm for calculating the stiffness matrices of reinforced concrete shell elements is presented. The algorithm is based on establishing vector lengths equal to the number of elements. The computational efficiency of the proposed algorithm is assessed on a Cray Y-MP supercomputer. It is shown that the vector algorithm achieves scalar-to-vector speedup of 1.7 to 7.6 on three moderate sized inelastic problems.

국가철도망 구축에 따른 한국철도의 현대화 방안에 관한 연구 (Korean Railway Modernization)

  • 김현웅
    • 한국철도학회:학술대회논문집
    • /
    • 한국철도학회 2005년도 춘계학술대회 논문집
    • /
    • pp.1101-1104
    • /
    • 2005
  • The overall objectives for Korean railway system of the future are speedup, capacity management, increase safety, environmental protection. and enhancement efficiency. To achieve these, interconnection within the railway system must be optimised and designed on a customer-friendly basis. This principles applies to both passenger and freight traffic. for which cost-effective and time-efficient facilities must be provided. As the result. Korean railway network modernization is based on seeking out optimum conditions for improving railway business.

  • PDF