• Title/Summary/Keyword: Eight-parallel

Search Result 121, Processing Time 0.033 seconds

Parallel Process System and its Application to Steam Generator Structural Analysis

  • Chang Yoon-Suk;Ko Han-Ok;Choi Jae-Boong;Kim Young-Jin
    • Journal of Mechanical Science and Technology
    • /
    • v.19 no.11
    • /
    • pp.2007-2015
    • /
    • 2005
  • A large-scale analysis to evaluate complex material and structural behaviors is one of interesting topic in diverse engineering and scientific fields. Also, the utilization of massively parallel processors has been a recent trend of high performance computing. The objective of this paper is to introduce a parallel process system which consists of general purpose finite element analysis solver as well as parallelized PC cluster. The later was constructed using eight processing elements and the former was developed adopting both hierarchical domain decomposition method and balancing domain decomposition method. Then, to verify the efficiency of the established system, it was applied for structural analysis of steam generator in nuclear power plant. Since the prototypal evaluation results agreed well to the corresponding reference solutions it is believed that, after reinforcement of PC cluster by increasing number of processing elements, the promising parallel process system can be utilized as a useful tool for advanced structural integrity evaluation.

An Efficient Interpolation Hardware Architecture for HEVC Inter-Prediction Decoding

  • Jin, Xianzhe;Ryoo, Kwangki
    • Journal of information and communication convergence engineering
    • /
    • v.11 no.2
    • /
    • pp.118-123
    • /
    • 2013
  • This paper proposes an efficient hardware architecture for high efficiency video coding (HEVC), which is the next generation video compression standard. It adopts several new coding techniques to reduce the bit rate by about 50% compared with the previous one. Unlike the previous H.264/AVC 6-tap interpolation filter, in HEVC, a one-dimensional seven-tap and eight-tap filter is adopted for luma interpolation, but it also increases the complexity and gate area in hardware implementation. In this paper, we propose a parallel architecture to boost the interpolation performance, achieving a luma $4{\times}4$ block interpolation in 2-4 cycles. The proposed architecture contains shared operations reducing the gate count increased due to the parallel architecture. This makes the area efficiency better than the previous design, in the best case, with the performance improved by about 75.15%. It is synthesized with the MagnaChip $0.18{\mu}m$ library and can reach the maximum frequency of 200 MHz.

High-Performance Variable-Length Reed-Solomon Decoder Architecture for Gigabit WPAN Applications (기가비트 WPAN용 고성능 가변길이 리드-솔로몬 복호기 구조)

  • Choi, Chang-Seok;Lee, Han-Ho
    • Journal of the Institute of Electronics Engineers of Korea SD
    • /
    • v.49 no.1
    • /
    • pp.25-34
    • /
    • 2012
  • This paper presents a universal architecture for variable-length eight-parallel Reed-Solomon (RS) decoder for high-rate WPAN systems. The proposed architecture can support not only RS(255,239) code but various shortened RS codes. Moreover, variable-length architecture provides variable low latency for various shortened RS codes and the eight-parallel design also provides high data processing rate. Using 90-$nm$ CMOS standard cell technology, the proposed RS decoder has been synthesized and measured for performance. The proposed RS decoder can provide a maximum 19-$Gbps$ data rate at clock frequency 300 $MHz$.

Thirty-two-tupling frequency millimeter-wave generation based on eight Mach-Zehnder modulators connected in parallel

  • Xinqiao Chen;Siyuan Dai;Zhihan Li;Wenyao Ba;Xu Chen
    • ETRI Journal
    • /
    • v.46 no.2
    • /
    • pp.194-204
    • /
    • 2024
  • A new method is proposed to generate a 32-tupling frequency millimeter wave (MMW) with eight Mach-Zehnder modulators (MZMs) connected in parallel. Theoretical analyses and simulation experiments are conducted. The optical sideband suppression ratio (OSSR) of the obtained ±16th order optical sidebands are 61.54 dB and 61.42 dB, and the radio frequency spurious suppression ratios (RFSSRs) of the generated 32-tupling frequency MMW are 55.52 dB and 55.27 dB based on the theoretical analysis and simulation experiments, respectively; these outcomes verified the feasibility of the new method. The main parameters used to affect the stability of the generated signal are the modulation index and extinction ratio of MZM. Their effects on the OSSR and RFSSR of the generated signals are investigated when they deviate from their designed values. Compared with the other proposed methods for the generation of 32-tupling frequency MMW by MZM, our method has the best spectral purity and stability, and it is expected to have important MMW over fiber applications.

Parallel Processing of k-Means Clustering Algorithm for Unsupervised Classification of Large Satellite Images: A Hybrid Method Using Multicores and a PC-Cluster (대용량 위성영상의 무감독 분류를 위한 k-Means Clustering 알고리즘의 병렬처리: 다중코어와 PC-Cluster를 이용한 Hybrid 방식)

  • Han, Soohee;Song, Jeong Heon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.6
    • /
    • pp.445-452
    • /
    • 2019
  • In this study, parallel processing codes of k-means clustering algorithm were developed and implemented in a PC-cluster for unsupervised classification of large satellite images. We implemented intra-node code using multicores of CPU (Central Processing Unit) based on OpenMP (Open Multi-Processing), inter-nodes code using a PC-cluster based on message passing interface, and hybrid code using both. The PC-cluster consists of one master node and eight slave nodes, and each node is equipped with eight multicores. Two operating systems, Microsoft Windows and Canonical Ubuntu, were installed in the PC-cluster in turn and tested to compare parallel processing performance. Two multispectral satellite images were tested, which are a medium-capacity LANDSAT 8 OLI (Operational Land Imager) image and a high-capacity Sentinel 2A image. To evaluate the performance of parallel processing, speedup and efficiency were measured. Overall, the speedup was over N / 2 and the efficiency was over 0.5. From the comparison of the two operating systems, the Ubuntu system showed two to three times faster performance. To confirm that the results of the sequential and parallel processing coincide with the other, the center value of each band and the number of classified pixels were compared, and result images were examined by pixel to pixel comparison. It was found that care should be taken to avoid false sharing of OpenMP in intra-node implementation. To process large satellite images in a PC-cluster, code and hardware should be designed to reduce performance degradation caused by file I / O. Also, it was found that performance can differ depending on the operating system installed in a PC-cluster.

The Flow Characteristics of Parallel Plane Jets Using Particle Image Velocimetry Technique (I) - Unventilated Jet - (PIV기법을 이용한 병렬 평면제트의 유동특성 (I) - 유입이 제한된 제트 -)

  • Kim, Dong-Keon;Yoon, Soon-Hyun
    • Transactions of the Korean Society of Mechanical Engineers B
    • /
    • v.27 no.3
    • /
    • pp.302-310
    • /
    • 2003
  • Experiments were conducted to show the characteristics of the flow on unventilated parallel plane jets. Measurements of mean velocity components and turbulent intensities were carried out with a particle image velocimetry to investigate the flow field generated by the air issued from two identical plane parallel nozzles and mixed with the ambient air. The measurements range of these experiments were Reynolds number of 5300 based on the nozzle width and the cases of nozzle-to-nozzle distance were four times. six times and eight times the width of the nozzle. Results show that a recirculation zone with a sub-atmospheric static pressure was bounded by the inner shear layers of the individual jets and the nozzles plate. The positions. where maximum value of mean turbulent intensities and mean turbulent kinetic energy show, were at the same position with the merging point. The spread of jets in the merging region increases more rapidly than that of Jets in the converging and the combined region. As nozzle-to-nozzle distances were increased. it was shown that merging and combined lengths were shorter.

Design to Chip with Multi-Access Memory System and Parallel Processor for 16 Processing Elements of Image Processing Purpose (영상처리용 16개의 처리기를 위한 다중접근기억장치 및 병렬처리기의 칩 설계)

  • Lim, Jae-Ho;Park, Seong-Mi;Park, Jong-Won
    • Journal of Korea Multimedia Society
    • /
    • v.14 no.11
    • /
    • pp.1401-1408
    • /
    • 2011
  • This dissertation present a chip with Multi-Access Memory System(MAMS) and parallel processor for 16 Processing Elements of image processing purpose. MAMS is a kind of parallel access memory system and can simultaneously access to random pixel datas with eight types. It is possible to set a interval about pixel datas to access, too. The parallel processor built-in MAMS actually has been realized in 2003 but its performance fell short of a real time process for high-definition images. I designed a improved parallel processing system by means of addition and expansion of Memory Modules and Processing Elements of previous one. It is feasible to perform a Morphological Closing at the speed of 3 times of the previous one and 6 times of serial system.

Numerical procedures for extreme impulsive loading on high strength concrete structures

  • Danielson, Kent T.;Adley, Mark D.;O'Daniel, James L.
    • Computers and Concrete
    • /
    • v.7 no.2
    • /
    • pp.159-167
    • /
    • 2010
  • This paper demonstrates numerical techniques for complex large-scale modeling with microplane constitutive theories for reinforced high strength concrete, which for these applications, is defined to be around the 7000 psi (48 MPa) strength as frequently found in protective structural design. Applications involve highly impulsive loads, such as an explosive detonation or impact-penetration event. These capabilities were implemented into the authors' finite element code, ParaAble and the PRONTO 3D code from Sandia National Laboratories. All materials are explicitly modeled with eight-noded hexahedral elements. The concrete is modeled with a microplane constitutive theory, the reinforcing steel is modeled with the Johnson-Cook model, and the high explosive material is modeled with a JWL equation of state and a programmed burn model. Damage evolution, which can be used for erosion of elements and/or for post-analysis examination of damage, is extracted from the microplane predictions and computed by a modified Holmquist-Johnson-Cook approach that relates damage to levels of inelastic strain increment and pressure. Computation is performed with MPI on parallel processors. Several practical analyses demonstrate that large-scale analyses of this type can be reasonably run on large parallel computing systems.

Current Calculation in Three-Dimensional Finite Element Model (3차원 유한요소 모델에서의 전류계산)

  • Kim, Deok-Won
    • Proceedings of the KIEE Conference
    • /
    • 1987.07b
    • /
    • pp.1288-1291
    • /
    • 1987
  • An finite element code has been developed to calculate current flowing through an 8-node trilinear cubic element from the calculated potentials on the eight node. This code was implemented to the three-dimensional thoracic model for impedance cardiography to find the total currents in the z-direction flowing through the layers which are parallel to x-y plane. The accuracy of the total current was estimated from its variation among the layers. It was found that the accuracy of the total currents in the layers was less than 0.6%.

  • PDF

A fuzzy logic based bin picking technique (퍼지논리를 이용한 Bin picking 방법)

  • 김태원;서일홍;김기엽
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1991.10a
    • /
    • pp.979-983
    • /
    • 1991
  • A novel 2-dimensional matched filter of the parallel-jaw type using fuzzy logics is proposed for bin picking. Specifically, averaged pixel intensity of the windowed region for the filtering is considered to be fuzzy. Also membership function for darkness and brightness are designed by employing the intensite histogram of image. Then a rule is given to know how much a windowed region can be a possible holdsite. Furthermore eight rules are made to determine the part orientation, where Mamadi's resoning method is applied. To show the validities of our proposed technique. some experimental results are illustrated and compared with the results by conventional matched filter technique.

  • PDF