• Title/Summary/Keyword: Parallel interface

Search Result 443, Processing Time 0.022 seconds

Parallel Finite Element Simulation of the Incompressible Navier-stokes Equations (병렬 유한요소 해석기법을 이용한 유동장 해석)

  • Choi H. G.;Kim B. J.;Kang S. W.;Yoo J. Y.
    • 한국전산유체공학회:학술대회논문집
    • /
    • 2002.05a
    • /
    • pp.8-15
    • /
    • 2002
  • For the large scale computation of turbulent flows around an arbitrarily shaped body, a parallel LES (large eddy simulation) code has been recently developed in which domain decomposition method is adopted. METIS and MPI (message Passing interface) libraries are used for domain partitioning and data communication between processors, respectively. For unsteady computation of the incompressible Wavier-Stokes equation, 4-step splitting finite element algorithm [1] is adopted and Smagorinsky or dynamic LES model can be chosen fur the modeling of small eddies in turbulent flows. For the validation and performance-estimation of the parallel code, a three-dimensional laminar flow generated by natural convection inside a cube has been solved. Then, we have solved the turbulent flow around MIRA (Motor Industry Research Association) model at $Re = 2.6\times10^6$, which is based on the model height and inlet free stream velocity, using 32 processors on IBM SMP cluster and compared with the existing experiment.

  • PDF

TEXTURE AND RELATED MICROSTRUCTURE AND SURF ACE TOPOGRAPHY OF VAPOR DEPOSITS

  • Lee, Dong-Nyung
    • Journal of Surface Science and Engineering
    • /
    • v.29 no.5
    • /
    • pp.301-313
    • /
    • 1996
  • The texture of vapor deposits(PVD and CVD) changes from the orientation that places the lowest energy lattice plane parallel to the substrate under the condition of low atom or ion concentration adjacent to the deposit, to the orientation that places the higher energy crystal planes parallel to the substrate as the atom or ion concentration adjacent to the deposit increases. However, in the early stage of deposition, the deposit-substrate interface energy and the surface energy constitute the most important energies of the system. Therefore, if the lattice match is established between the substrate and the deposit without generating much strain energy, the epitaxial growth takes place to reduce the interfacial energy. When the epitaxial growth does not take place, the surface energy is dominant in the early stage of deposition and the lowest energy crystal plane tends to be placed parallel to the substrate up to a critial thickness. The thickness depends on the deposition condition. If the deposition condition does not favor placing the lowest energy crystal plane parallel to the substrate, the initial texture will change to that compatible with the deposition condition as the film thickness increases, and the texture turnover thickness will be short. The microstructure and surface topography of deposits are related to their texture.

  • PDF

Implementation of MPI-based WiMAX Base Station for SDR System (SDR 시스템을 위한 MPI 기반 WiMAX 기지국의 구현)

  • Ahn, Chi Young;Kim, Hyo Han;Choi, Seung Won
    • Journal of Korea Society of Digital Industry and Information Management
    • /
    • v.9 no.4
    • /
    • pp.59-67
    • /
    • 2013
  • Compared to the conventional Hardware-oriented base stations, Software Defined Radio (SDR)-based base station provides various advantages especially in flexibility and expandability. It enables the multimode capability required in 4th-generation (4G) environment which aims at a convergence network of various kinds of communication standards. However, since a single base station processes all data required in various multiple waveforms, the SDR base station faces a problem of data processing speed. In this paper, we propose a new concept of SDR base station system which adopts a parallel processing technology of clustering environment. We implemented a WiMAX system with SDR concept which adopts the Message Passing Interface (MPI) technology which enables the speed-up operations. In order to maximize the efficiency of parallel processing in signal processing, we analyze how the algorithm at each of modules is related to data to be processed. Through the implemented system, we show a drastic improvement in operation time due to parallel processing using the proposed MPI technology. In addition, we demonstrate a feasibility of SDR system for 4G or even beyond-4G as well.

A PARALLEL PRECONDITIONER FOR GENERALIZED EIGENVALUE PROBLEMS BY CG-TYPE METHOD

  • MA, SANGBACK;JANG, HO-JONG
    • Journal of the Korean Society for Industrial and Applied Mathematics
    • /
    • v.5 no.2
    • /
    • pp.63-69
    • /
    • 2001
  • In this study, we shall be concerned with computing in parallel a few of the smallest eigenvalues and their corresponding eigenvectors of the eigenvalue problem, $Ax={\lambda}Bx$, where A is symmetric, and B is symmetric positive definite. Both A and B are large and sparse. Recently iterative algorithms based on the optimization of the Rayleigh quotient have been developed, and CG scheme for the optimization of the Rayleigh quotient has been proven a very attractive and promising technique for large sparse eigenproblems for small extreme eigenvalues. As in the case of a system of linear equations, successful application of the CG scheme to eigenproblems depends also upon the preconditioning techniques. A proper choice of the preconditioner significantly improves the convergence of the CG scheme. The idea underlying the present work is a parallel computation of the Multi-Color Block SSOR preconditioning for the CG optimization of the Rayleigh quotient together with deflation techniques. Multi-Coloring is a simple technique to obatin the parallelism of order n, where n is the dimension of the matrix. Block SSOR is a symmetric preconditioner which is expected to minimize the interprocessor communication due to the blocking. We implemented the results on the CRAY-T3E with 128 nodes. The MPI(Message Passing Interface) library was adopted for the interprocessor communications. The test problems were drawn from the discretizations of partial differential equations by finite difference methods.

  • PDF

Conditions of Applications, Situations and Functions Applicable to Gesture Interface

  • Ryu, Tae-Beum;Lee, Jae-Hong;Song, Joo-Bong;Yun, Myung-Hwan
    • Journal of the Ergonomics Society of Korea
    • /
    • v.31 no.4
    • /
    • pp.507-513
    • /
    • 2012
  • Objective: This study developed a hierarchy of conditions of applications(devices), situations and functions which are applicable to gesture interface. Background: Gesture interface is one of the promising interfaces for our natural and intuitive interaction with intelligent machines and environments. Although there were many studies related to developing new gesture-based devices and gesture interfaces, it was little known which applications, situations and functions are applicable to gesture interface. Method: This study searched about 120 papers relevant to designing and applying gesture interfaces and vocabulary to find the gesture applicable conditions of applications, situations and functions. The conditions which were extracted from 16 closely-related papers were rearranged, and a hierarchy of them was developed to evaluate the applicability of applications, situations and functions to gesture interface. Results: This study summarized 10, 10 and 6 conditions of applications, situations and functions, respectively. In addition, the gesture applicable condition hierarchy of applications, situation and functions were developed based on the semantic similarity, ordering and serial or parallel relationship among them. Conclusion: This study collected gesture applicable conditions of application, situation and functions, and a hierarchy of them was developed to evaluate the applicability of gesture interface. Application: The gesture applicable conditions and hierarchy can be used in developing a framework and detailed criteria to evaluate applicability of applications situations and functions. Moreover, it can enable for designers of gesture interface and vocabulary to determine applications, situations and functions which are applicable to gesture interface.

The Performance Analysis of A High-speed Mechanism for SNMP Connection Management in Centralized Network Control Platform (중앙 집중형 네트워크 제어 플랫폼에서 SNMP 연결 관리의 고속화 방안 및 성능 분석)

  • Ko, Young-Suk;Kwon, Tae-Hyun;Kim, Choon-Hee;Nam, Hyun-Soon;Jeong, You-Hyeon;Cha, Young-Wook
    • The KIPS Transactions:PartC
    • /
    • v.14C no.6
    • /
    • pp.525-536
    • /
    • 2007
  • Network Control Platform(NCP) and Qualify of Service Switch(QSS) are being developed to realize centralized control and management technology, which is essential for guaranteeing traffic engineering and service quality in a next generation network. This paper adopts a parallel mechanism, and a thread and object pool to achieve high-speed connection management in the existing SNMP interface between NCP and QSS. We built up a connection management test-bed in laboratory environment to validate the functionality of high-speed connection management. We also measured and analyzed a performance of connection setup delay and a completion ratio using the test-bed. We ascertain that the parallel mechanism and the object pool are the most important performance parameters to achieve high-speed connection management in the SNMP interface between NCP and QSS.

Parallel Processing of Airborne Laser Scanning Data Using a Hybrid Model Based on MPI and OpenMP (MPI와 OpenMP기반 하이브리드 모델을 이용한 항공 레이저 스캐닝 자료의 병렬 처리)

  • Han, Soo-Hee;Park, Il-Suk;Heo, Joon
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.30 no.2
    • /
    • pp.135-142
    • /
    • 2012
  • In the present study, a parallel processing method running on a multi-core PC-Cluster is introduced to produce digital surface model (DSM) and digital terrain model (DTM) from huge airborne laser scanning data. A hybrid model using both message passing interface (MPI) and OpenMP was devised by revising a conventional MPI model which utilizes only MPI, and tested on a multi-core PC-Cluster for performance validation. In the results, the hybrid model has not shown better performances in the interpolation process to produce DSM, but the overall performance has turned out to be better by the help of reduced MPI calls. Additionally, scheduling function of OpenMP has revealed its ability to enhance the performance by controlling inequal overloads charged on cores induced by irregular distribution of airborne laser scanning data.

Design and Implementation of a Scalable Framework for Parallel Program Performance Visualization (병렬 프로그램 성능가시화를 위한 확장성 있는 프레임워크 설계 및 구현)

  • Moon, Sang-Su;Moon, Young-Shik;Kim, Jung-Sun
    • Journal of KIISE:Computing Practices and Letters
    • /
    • v.7 no.2
    • /
    • pp.109-120
    • /
    • 2001
  • In this paper, we propose the design and implementation of a portable, extensible, and efficient performance visualization framework for high performance parallel program development. The framework adopts a layered architecture:consists of three independent layers instrumentation layer, trace interface layer and visualization layer. The instrumentation layer was constructed as an ECL which captures generated events, and the EDL/JPAL constitutes the trace interface layer to provide problem-oriented interfaces between visualization layer and instrumentation layer. Finally, the visualization layer was designed as plug-and-play style for easy elimination, addition and composition of various filters, views and view groups, The proposed performance visualization framework is expected to be used as an independent performance debugging and analysis tool and as a core component in an integrated parallel programming environment.

  • PDF

Comparison of Parallel Computation Performances for 3D Wave Propagation Modeling using a Xeon Phi x200 Processor (제온 파이 x200 프로세서를 이용한 3차원 음향 파동 전파 모델링 병렬 연산 성능 비교)

  • Lee, Jongwoo;Ha, Wansoo
    • Geophysics and Geophysical Exploration
    • /
    • v.21 no.4
    • /
    • pp.213-219
    • /
    • 2018
  • In this study, we simulated 3D wave propagation modeling using a Xeon Phi x200 processor and compared the parallel computation performance with that using a Xeon CPU. Unlike the 1st generation Xeon Phi coprocessor codenamed Knights Corner, the 2nd generation x200 Xeon Phi processor requires no additional communication between the internal memory and the main memory since it can run an operating system directly. The Xeon Phi x200 processor can run large-scale computation independently, with the large main memory and the high-bandwidth memory. For comparison of parallel computation, we performed the modeling using the MPI (Message Passing Interface) and OpenMP (Open Multi-Processing) libraries. Numerical examples using the SEG/EAGE salt model demonstrated that we can achieve 2.69 to 3.24 times faster modeling performance using the Xeon Phi with a large number of computational cores and high-bandwidth memory compared to that using the 12-core CPU.

Effect of Heat Treatment on Microstructure and Mechanical Properties of Electromagnetic Duo-Cast Al Hybrid Material

  • Suh, Jun Young;Park, Sung Jin;Kwon, Do-Kyun;Chang, Si Young
    • Korean Journal of Materials Research
    • /
    • v.28 no.9
    • /
    • pp.499-505
    • /
    • 2018
  • This investigates the microstructure and mechanical properties of Al hybrid material prepared by electromagnetic duo-casting to determine the effect of heat treatment. The hybrid material is composed of an Al-Mg-Si alloy, pure Al and the interface between the Al-Mg-Si alloy and pure Al. It is heat-treated at 373, 573 and 773K for 1h and T6 treated (solution treatment at 773K for 1h and aging at 433K for 5h). As the temperature increases, the grain size of the Al-Mg-Si alloy in the hybrid material increases. The grain size of the T6 treated Al-Mg-Si alloy is similar to that of one heat-treated at 773K for 1h. The interface region where the micro-hardness becomes large from the pure Al to the Al-Mg-Si alloy widens with an increasing heat temperature. The hybrid material with a macro-interface parallel to the tensile direction experiences increased tensile strength, 0.2 % proof stress and the decreased elongation after T6 heat treatment. On the other hand, in the vertical direction to the tensile direction, there is no great difference with heat treatment. The bending strength of the hybrid material with a long macro-interface to the bending direction is higher than that with a short macro-interface, which is improved by heat treatment. The hybrid material with a long macro-interface to the bending direction is fractured by cracking through the eutectic structure in the Al-Mg-Si alloy. However, in the hybrid material with a short macro-interface, the bending deformation is observed only in the limited pure Al.