• Title/Summary/Keyword: Parallel Efficiency

Search Result 1,039, Processing Time 0.037 seconds

Implementation and Performance Analysis of a Parallel SIMPLER Model Based on Domain Decomposition (영역 분할에 의한 SIMPLER 모델의 병렬화와 성능 분석)

  • Kwak Ho Sang;Lee Sangsan
    • Journal of computational fluids engineering
    • /
    • v.3 no.1
    • /
    • pp.22-29
    • /
    • 1998
  • Parallel implementation is conducted for a SIMPLER finite volume model. The present parallelism is based on domain decomposition and explicit message passing using MPI and SHMEM. Two parallel solvers to tridiagonal matrix equation are employed. The implementation is verified on the Cray T3E system for a benchmark problem of natural convection in a sidewall-heated cavity. The test results illustrate good scalability of the present parallel models. Performance issues are elaborated in view of convergence as well as conventional parallel overheads and single processor performance. The effectiveness of a localized matrix solution algorithm is demonstrated.

  • PDF

TBBench: A Micro-Benchmark Suite for Intel Threading Building Blocks

  • Marowka, Ami
    • Journal of Information Processing Systems
    • /
    • v.8 no.2
    • /
    • pp.331-346
    • /
    • 2012
  • Task-based programming is becoming the state-of-the-art method of choice for extracting the desired performance from multi-core chips. It expresses a program in terms of lightweight logical tasks rather than heavyweight threads. Intel Threading Building Blocks (TBB) is a task-based parallel programming paradigm for multi-core processors. The performance gain of this paradigm depends to a great extent on the efficiency of its parallel constructs. The parallel overheads incurred by parallel constructs determine the ability for creating large-scale parallel programs, especially in the case of fine-grain parallelism. This paper presents a study of TBB parallelization overheads. For this purpose, a TBB micro-benchmarks suite called TBBench has been developed. We use TBBench to evaluate the parallelization overheads of TBB on different multi-core machines and different compilers. We report in detail in this paper on the relative overheads and analyze the running results.

Design and Verification of High-Performance Parallel Processor Hardware for JPEG Encoder (JPEG 인코더를 위한 고성능 병렬 프로세서 하드웨어 설계 및 검증)

  • Kim, Yong-Min;Kim, Jong-Myon
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.6 no.2
    • /
    • pp.100-107
    • /
    • 2011
  • As the use of mobile multimedia devices is increasing in the recent year, the needs for high-performance multimedia processors are increasing. In this regard, we propose a SIMD (Single Instruction Multiple Data) based parallel processor that supports high-performance multimedia applications with low energy consumption. The proposed parallel processor consists of 16 processing elements(PEs) and operates on a 3-stage pipelining. Experimental results for the JPEG encoding algorithm indicate that the proposed parallel processor outperforms conventional parallel processors in terms of performance and energy efficiency. In addition, the proposed parallel processor architecture was developed and verified with verilog HDL and a FPGA prototype system.

Battery Equalization Method for Parallel-connected Cells Using Dynamic Resistance Technique

  • La, Phuong-Ha;Choi, Sung-Jin
    • Proceedings of the KIPE Conference
    • /
    • 2018.11a
    • /
    • pp.36-38
    • /
    • 2018
  • As the battery capacity requirement increases, battery cells are connected in a parallel configuration. However, the sharing current of each battery cell becomes unequal due to the imbalance between cell's impedance which results the mismatched states of charge (SOC). The conventional fixed-resistance balancing methods have a limitation in battery equalization performance and system efficiency. This paper proposes a battery equalization method based on dynamic resistance technique, which can improve equalization performance and reduce the loss dissipation. Based on the SOC rate of parallel connected battery cells, the switches in the equalization circuit are controlled to change the equivalent series impedance of the parallel branch, which regulates the current flow to maximize SOC utilization. To verify the method, operations of 4 parallel-connected 18650 Li-ion battery cells with 3.7V-2.6Ah individually are simulated on Matlab/Simulink. The results show that the SOCs are balanced within 1% difference with less power dissipation over the conventional method.

  • PDF

Performance Evaluation and Verification of MMX-type Instructions on an Embedded Parallel Processor (임베디드 병렬 프로세서 상에서 MMX타입 명령어의 성능평가 및 검증)

  • Jung, Yong-Bum;Kim, Yong-Min;Kim, Cheol-Hong;Kim, Jong-Myon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.16 no.10
    • /
    • pp.11-21
    • /
    • 2011
  • This paper introduces an SIMD(Single Instruction Multiple Data) based parallel processor that efficiently processes massive data inherent in multimedia. In addition, this paper implements MMX(MultiMedia eXtension)-type instructions on the data parallel processor and evaluates and analyzes the performance of the MMX-type instructions. The reference data parallel processor consists of 16 processors each of which has a 32-bit datapath. Experimental results for a JPEG compression application with a 1280x1024 pixel image indicate that MMX-type instructions achieves a 50% performance improvement over the baseline instructions on the same data parallel architecture. In addition, MMX-type instructions achieves 100% and 51% improvements over the baseline instructions in energy efficiency and area efficiency, respectively. These results demonstrate that multimedia specific instructions including MMX-type have potentials for widely used many-core GPU(Graphics Processing Unit) and any types of parallel processors.

Finite element analysis of welding process by parallel computation (병렬 처리를 이용한 용접 공정 유한 요소 해석)

  • 임세영;김주완;최강혁;임재혁
    • Proceedings of the KWS Conference
    • /
    • 2003.11a
    • /
    • pp.156-158
    • /
    • 2003
  • An implicit finite element implementation for Leblond's transformation plasticity constitutive equations, which are widely used in welded steel structure is proposed in the framework of parallel computing. The implementation is based upon the multiplicative decomposition of deformation gradient and hyper elastic formulation. We examine the efficiency of parallel computation for the finite element analysis of a welded structure using domain-wise multi-frontal solver.

  • PDF

Three dimensional finite element analysis of art-welding processor via parallel compuating (아크 용접 공정의 3차원 병렬처리 유한 요소 해석)

  • 임세영;김주완;김현규;조영삼
    • Proceedings of the KWS Conference
    • /
    • 2002.05a
    • /
    • pp.161-163
    • /
    • 2002
  • An implicit finite element implementation for Leblond's transformation plasticity constitutive equations, which are widely used in welded steel structure is proposed in the framework of parallel computing. The implementation is based upon the updated Lagrangian formulation. We examine the efficiency of parallel compuatation for the finite element analysis of a welded structure using multi-frontal solver.

  • PDF

Design and Implementation of 1.8kW bi-directional LDC with Parallel Control Strategy for Mild Hybrid Electric Vehicles (병렬제어기법이 적용된 1.8kW급 마일드 하이브리드 양방향 LDC 설계 및 구현)

  • Kim, Hyun-Bin;Jeong, Jea-Woong;Bae, Sungwoo;Kim, Jong-Soo
    • The Transactions of the Korean Institute of Power Electronics
    • /
    • v.22 no.1
    • /
    • pp.75-81
    • /
    • 2017
  • This paper presents a design and parallel control strategy of 1.8 kW low-voltage DC-DC converter (LDC) for mild hybrid electric vehicles to improve their power density, system efficiency, and operation stability. Topology and control scheme are important on the LDC for mild hybrid electric vehicles to achieve high system efficiency and power density because of their very low voltage and large current in input and output terminals. Therefore, the optimal topological structure and control algorithm are examined, and a detailed design methodology for the power and control stages is presented. A working sample of 1.8 kW LDC is designed and implemented by applying the adopted topology and control strategy. Experimental results indicate 92.45% of the maximum efficiency and 560 W/l of power density.

A Study on the Evaluation of Air Change Efficiency of Multi-Air-Conditioner with Ventilation System for Heating Season (환기시스템이 적용된 히트펌프의 난방시 급기효율 평가에 관한 연구)

  • Kwon Yong-Il;Han Hwataik;Kim Kyung-Hwan;Chung Baik-Young;Lee Gam-Gue
    • Korean Journal of Air-Conditioning and Refrigeration Engineering
    • /
    • v.17 no.1
    • /
    • pp.56-61
    • /
    • 2005
  • Indoor air quality becomes of a concern recently in view of human health. This study investigates the air diffusion performance and the air change efficiency of a classroom, when outdoor air is introduced in addition to the heating/cooling operation of a ceiling-mounted heat pump. A CFD analysis has been performed to investigate the effect of the discharge angle of the air jets from the heat pump for both parallel and series types of outdoor air system. It is observed that the series type creates more uniform indoor environment compared to the parallel type in general. It can be concluded the discharge angle should not be larger than 40o for the parallel type, in order not to generate thermal stratification in the room.

High Efficiency Drive of Dual Inverter Driven SPMSM with Parallel Split Stator

  • Lee, Yongjae;Ha, Jung-Ik
    • Journal of international Conference on Electrical Machines and Systems
    • /
    • v.2 no.2
    • /
    • pp.216-224
    • /
    • 2013
  • This paper describes dual inverter drive for a fractional-slot concentrated winding permanent magnet synchronous machine (PMSM). PMSMs are widely used in many applications from small servo motors to few megawatts generators thanks to its high efficiency and torque density. Especially, fractional-slot concentrated winding PMSM is very popular in the applications where wide operation range is required because it shows very wide constant power speed ratios. High speed operation, however, requires lots of negative daxis current for reducing back-EMF regardless of output torque. Field weakening current does not contribute to the torque generation in surface mounted PMSM case and causes inverter and copper loss. To reduce the losses from field weakening current, this paper proposes PMSM with split stator and parallel dual inverter drive. Proposed parallel dual inverter drive reduces back-EMF and enables efficient drive at high speed and light load situation. Control strategy of proposed dual inverter system is established through loss analysis and simulation. Proposed concept is verified with practical experiment.