• Title/Summary/Keyword: parallel communication

Search Result 1,114, Processing Time 0.036 seconds

PERFORMANCE ENHANCEMENT OF PARALLEL MULTIFRONTAL SOLVER ON BLOCK LANCZOS METHOD

  • Byun, Wan-Il;Kim, Seung-Jo
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.13 no.1
    • /
    • pp.13-20
    • /
    • 2009
  • The IPSAP which is a finite element analysis program has been developed for high parallel performance computing. This program consists of various analysis modules - stress, vibration and thermal analysis module, etc. The M orthogonal block Lanczos algorithm with shiftinvert transformation is used for solving eigenvalue problems in the vibration module. And the multifrontal algorithm which is one of the most efficient direct linear equation solvers is applied to factorization and triangular system solving phases in this block Lanczos iteration routine. In this study, the performance enhancement procedures of the IPSAP are composed of the following stages: 1) communication volume minimization of the factorization phase by modifying parallel matrix subroutines. 2) idling time minimization in triangular system solving phase by partial inverse of the frontal matrix and the LCM (least common multiple) concept.

  • PDF

A two-level parallel algorithm for material nonlinearity problems

  • Lee, Jeeho;Kim, Min Seok
    • Structural Engineering and Mechanics
    • /
    • v.38 no.4
    • /
    • pp.405-416
    • /
    • 2011
  • An efficient two-level domain decomposition parallel algorithm is suggested to solve large-DOF structural problems with nonlinear material models generating unsymmetric tangent matrices, such as a group of plastic-damage material models. The parallel version of the stabilized bi-conjugate gradient method is developed to solve unsymmetric coarse problems iteratively. In the present approach the coarse DOF system is solved parallelly on each processor rather than the whole system equation to minimize the data communication between processors, which is appropriate to maintain the computing performance on a non-supercomputer level cluster system. The performance test results show that the suggested algorithm provides scalability on computing performance and an efficient approach to solve large-DOF nonlinear structural problems on a cluster system.

An Implementation of High-Speed Parallel Processing System for Neural Network Design by Using the Multicomputer Network (다중 컴퓨터 망에서 신경회로망 설계를 위한 고속병렬처리 시스템의 구현)

  • 김진호;최흥문
    • Journal of the Korean Institute of Telematics and Electronics B
    • /
    • v.30B no.5
    • /
    • pp.120-128
    • /
    • 1993
  • In this paper, an implementation of high-speed parallel processing system for neural network design on the multicomputer network is presented. Linear speedup expandability is increased by reducing the synchronization penalty and the communication overhead. Also, we presented the parallel processing models and their performance evaluation models for each of the parallization methods of the neural network. The results of the experiments for the character recognition of the neural network bases on the proposed system show that the proposed approach has the higher linear speedup expandability than the other systems. The proposed parallel processing models and the performance evaluation models could be used effectively for the design and the performance estimation of the neural network on the multicomputer network.

  • PDF

Finite element analysis of strip rolling process using distributive parallel algorithm (평판압연공정 유한요소해석의 분산병렬처리에 관한 연구)

  • Gwon, Kie-Chan;Youn, Sung-Kie
    • Transactions of the Korean Society of Mechanical Engineers A
    • /
    • v.21 no.12
    • /
    • pp.2096-2105
    • /
    • 1997
  • A parallel approach using a network of engineering workstations is presented for the efficient computation in the elastoplastic analysis of strip rolling process. The domain decomposition method coupled with the frontal solver for elimination of internal degrees of freedom in each subdomain is used. PVM is used for message passing and synchronization between processors. A 2-D plane strain problem and the strip rolling process are analyzed to demonstrate the performance of the algorithm and factors that have a great effect on efficiency are discussed. In spite of much communication time on the network the result illustrates the advantages of this parallel algorithm over its corresponding sequential algorithm.

A Parallel Processing Structure for the Discrete Kalman Filter (이산 칼만 필터의 병렬처리 구조)

  • 김용준;이장규;김병중
    • The Transactions of the Korean Institute of Electrical Engineers
    • /
    • v.39 no.10
    • /
    • pp.1057-1065
    • /
    • 1990
  • A parallel processing algorithm for the discrete Kalman filter, which is one of the most commonly used filtering techniques in modern control, signal processing, and communication, is proposed. To decrease the number of computations critical in the Kalman filter, previously proposed parallel algorithms are of the hierarchical structure by distributed processing of measurements, or of the systolic structure to disperse the computational burden. In this paper, a new parallel Kalman filter employing a structure similar to recursive doubling is proposed. Estimated valuse of state variables by the new algorithm converge faster to the true values because the new algorithm can process data twice faster than the conventional Kalman filter. Moreover, it maintains the optimality of the conventional Kalman filter.

  • PDF

A Parallel Kalman Filter for Discrete Linear Time-invariant System (이산 선형 시불변시스템에 대한 병렬칼만필터)

  • Lee, Jang-Gyu;Kim, Yong-Joon;Kim, Hyoung-Joong
    • Proceedings of the KIEE Conference
    • /
    • 1990.07a
    • /
    • pp.64-67
    • /
    • 1990
  • A parallel processing algorithm for discrete Kalman filter, which is one of the most commonly used filtering technique in modern control, signal processing, and communication. is proposed. Previously proposed parallel algorithms to decrease the number of computations needed in the Kalman filter are the hierachical structures by distributed processing of measurements, or the systolic structures to disperse the computational burden. In this paper, a new parallel Kalman filter employing a structure similar to recursive doubling is proposed. Estimated values of state variables by the new algorithm converge with two times faster data processing speed than that of the conventional Kalman filter. Moreover it maintains the optimality of the conventional Kalman filter.

  • PDF

Performance Enhancement of Parallel Prime Sieving with Hybrid Programming and Pipeline Scheduling (혼합형 병렬처리 및 파이프라이닝을 활용한 소수 연산 알고리즘)

  • Ryu, Seung-yo;Kim, Dongseung
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.4 no.10
    • /
    • pp.337-342
    • /
    • 2015
  • We develop a new parallelization method for Sieve of Eratosthenes algorithm, which enhances both computation speed and energy efficiency. A pipeline scheduling is included for better load balancing after proper workload partitioning. They run on multicore CPUs with hybrid parallel programming model which uses both message passing and multithreading computation. Experimental results performed on both small scale clusters and a PC with a mobile processor show significant improvement in execution time and energy consumptions.

Parallel Control of Mini Ice Storage Air Conditioner Using LonWorks System (LonWorks 시스템을 이용한 소형빙축열 에어컨의 병렬 제어)

  • Kim, Hong-Ryeol;Cho, Sung-Kyu;Chung, Nam-Jong;Kim, Dae-Won
    • Proceedings of the KIEE Conference
    • /
    • 2003.07d
    • /
    • pp.2358-2361
    • /
    • 2003
  • In this paper, parallel control method that enables coordinating operations of multiple mini ice storage air conditioner is proposed based on LonWorks system. The proposed parallel control method is a kind of master/slave control method based on LonTalk network communication. In the control method, master controller leading whole system operations is determined by date to avoid the disproportion of outdoor unit operations. In addition to the above, duty shift control method is proposed to avoid redundant operations of outdoor units and to equalize the operation frequencies of them. Through some simulation tests, the proposed parallel control method and the duty shift control method are evaluated and the efficiencies of them are validated by comparing with existing methods.

  • PDF

Control Method for Reducing Circulating Current in Parallel Operation of DC Distribution System for Building Applications (빌딩용 DC 배전 시스템의 병렬 운전 시 발생하는 순환전류를 저감시키기 위한 제어 기법)

  • Kim, Hack-Seong;Shin, Soo-Cheol;Lee, Hee-Jun;Jung, Chul-Ho;Han, Dong-Woo;Won, Chung-Yuen
    • The Transactions of the Korean Institute of Power Electronics
    • /
    • v.18 no.3
    • /
    • pp.256-262
    • /
    • 2013
  • In the large system such DC distribution for building, the method that a number of modules converters operation in parallel is commonly used. When parallel operation, circulating current is directly related to the loss of the entire system. Accordingly, each module to share the same current is the most important for the safety of the power system. In this paper, control method for reducing circulating current in parallel operation is proposed. furthermore response and operation of steady-state with parallel system was verified by simulation and experiment results.

High-speed simulation for fossil power plants uisng a parallel DSP system (병렬 DSP 시스템을 이용한 화력발전소 고속 시뮬레이션)

  • 박희준;김병국
    • Journal of the Korean Institute of Telematics and Electronics C
    • /
    • v.35C no.4
    • /
    • pp.38-49
    • /
    • 1998
  • A fossil power plant can be modeled by a lot of algebraic equations and differential equations. When we simulate a large, complicated fossil power plant by a computer such as workstation or PC, it takes much time until overall equations are completely calculated. Therefore, new processing systems which have high computing speed is ultimately needed for real-time or high-speed(faster than real-time) simulators. This paper presents an enhanced strategy in which high computing power can be provided by parallel processing of DSP processors with communication links. DSP system is designed for general purpose. Parallel DSP system can be easily expanded by just connecting new DSP modules to the system. General urpose DSP modules and a VME interface module was developed. New model and techniques for the task allocation are also presented which take into account the special characteristics of parallel I/O and computation. As a realistic cost function of task allocation, we suggested 'simulation period' which represents the period of simulation output intervals. Based on the development of parallel DSP system and realistic task allocation techniques, we cound achieve good efficiency of parallel processing and faster simulation speed than real-time.

  • PDF