• 제목/요약/키워드: Parallel Process

검색결과 1,455건 처리시간 0.033초

병렬 컴퓨터를 이용한 형상 압연공정 유한요소 해석의 분산병렬처리에 관한 연구 (Finite Element Analysis of Shape Rolling Process using Destributive Parallel Algorithms on Cray T3E)

  • 권기찬;윤성기
    • 대한기계학회논문집A
    • /
    • 제24권5호
    • /
    • pp.1215-1230
    • /
    • 2000
  • Parallel Approaches using Cray T3E which is NIPP (Massively Parallel Processors) machine are presented for the efficient computation of the finite element analysis of 3-D shape rolling processes. D omain decomposition method coupled with parallel linear equation solver is used. Domain decomposition is applied for obtaining element tangent stifffiess matrices and residual vectors. Direct and iterative parallel algorithms are used for solving the linear equations. Direct algorithm is_parallel version of direct banded matrix solver. For iterative algorithms, the well-known preconditioned conjugate gradient solver with Jacobi preconditioner is also employed. Moreover a new effective iterative scheme with block inverse matrix preconditioner, which is named by present authors, is presented and its results are compared with the one using Jacobi preconditioner. PVM and MPI are used for message passing and synchronization between processors. The performance and efficiency of each algorithm is discussed and comparisons are made among different algorithms.

분산 협업 환경에서의 유한요소 해석에 관한 연구 (Finite Element Analysis with STEP in Distributive and Collaborative Environment)

  • 조성욱;권기억
    • 한국CDE학회논문집
    • /
    • 제11권5호
    • /
    • pp.384-392
    • /
    • 2006
  • In this research, the feasibility of distributed finite element analysis system with STEP and CORBA has been investigated. The enabling technologies such as CORBA and Java play key roles in the development of integrated and geographically distributed application software. In addition to the distribution of analysis modules, numerical solution process itself is again divided into parallel processes using multi-frontal method for computational efficiency. In contrast to the specially designed parallel process for specific hardware, CORBA-based parallel process is well suited for heterogeneous platforms over the network. The idea of Web-based distributed analysis system may be applied to the engineering ASP for design and analysis in the product development processes. We believe that the proposed approach for the analysis can be extended to the entire product development process for sharing and utilizing common product data in the distributed engineering environment, thus eventually provide basis for virtual enterprise.

자바를 위한 분산된 병렬 컴퓨팅 환경 (Distributed Parallel Computing Environment for Java)

  • 이상윤;김승호
    • 전자공학회논문지CI
    • /
    • 제41권6호
    • /
    • pp.23-37
    • /
    • 2004
  • 자바의 쓰레드는 다중 처리 환경에서 하나의 프로그램 공간 내의 독립적인 프로세스로 취급되는 객체 요소이므로 병렬처리를 위한 독립적인 프로세스로 활용할 수 있다. 또한, 자바의 동기화 메커니즘과 쓰레드를 활용하면 병렬 처리를 수행하는 응용프로그램을 쉽게 작성할 수 있다. 이에 따라, 자바의 병렬 처리 지원 기능을 분산된 컴퓨팅 환경에 적용하기 위한 많은 연구 결과가 있다. 본 논문에서는 레거시 자바 프로그램에 포함된 쓰레드를 분산된 컴퓨팅 환경에서 병렬 수행 하도록 지원하는 시스템 환경을 제안한다. TORB(Transparent Object Request Broker)라고 명명된 본 시스템은 프로그래밍 투명성을 지원하므로 이미 작성된 레거시 자바 프로그램을 간단한 변환 과정을 거친 후 병렬 수행 하도록 지원한다. TORB는 본 연구팀에서 이미 발표한 분산 프로그래밍 도구의 기능을 확장한 것이며, 이는 지정된 기능을 지정된 컴퓨터에서 수행하도록 지원하는 전형적인 분산처리 기능만을 보유하고 있었다.

Performance Study of Satellite Image Processing on Graphics Processors Unit Using CUDA

  • Jeong, In-Kyu;Hong, Min-Gee;Hahn, Kwang-Soo;Choi, Joonsoo;Kim, Choen
    • 대한원격탐사학회지
    • /
    • 제28권6호
    • /
    • pp.683-691
    • /
    • 2012
  • High resolution satellite images are now widely used for a variety of mapping applications including photogrammetry, GIS data acquisition and visualization. As the spectral and spatial data size of satellite images increases, a greater processing power is needed to process the images. The solution of these problems is parallel systems. Parallel processing techniques have been developed for improving the performance of image processing along with the development of the computational power. However, conventional CPU-based parallel computing is often not good enough for the demand for computational speed to process the images. The GPU is a good candidate to achieve this goal. Recently GPUs are used in the field of highly complex processing including many loop operations such as mathematical transforms, ray tracing. In this study we proposed a technique for parallel processing of high resolution satellite images using GPU. We implemented a spectral radiometric processing algorithm on Landsat-7 ETM+ imagery using CUDA, a parallel computing architecture developed by NVIDIA for GPU. Also performance of the algorithm on GPU and CPU is compared.

산업용 6관절 로봇의 원격제어를 위한 실시간 병렬데이터통신 인터페이스 (Development of Realtime Parallel Data Communication Interface for Remote Control of 6-DOF Industrial Robot)

  • 최명환;이우원
    • 산업기술연구
    • /
    • 제21권A호
    • /
    • pp.97-103
    • /
    • 2001
  • This paper presents the development of the I/O Interface for the real time parallel data communication between controller of a six-axis industrial robot(CRS-A460) and an external computer. The proposed I/O Interface consists of the hardware I/O interface and the software that is downloaded to the robot controller and executed by the controller operating system. The constitution of the digital I/O Port for CRS-A460 robot controller and the digital I/O board for IBM-PC are presented as well as the Process Control Program of the robot controller. The developed protocol for the parallel data communication is described. The data communication is tested, and the performance is analysed. In particular, it is shown that the real-time constraint of the robot controller process is satisfied.

  • PDF

픽셀-병렬 영상처리에 있어서 포맷 컨버터 설계에 관한 연구 (A Study on the Design of Format Converter for Pixel-Parallel Image Processing)

  • 김현기;김현호;하기종;최영규;류기환;이천희
    • 대한전자공학회:학술대회논문집
    • /
    • 대한전자공학회 2001년도 하계종합학술대회 논문집(2)
    • /
    • pp.269-272
    • /
    • 2001
  • In this paper we proposed the format converter design and implementation for real time image processing. This design method is based on realized the large processor-per-pixel array by integrated circuit technology in which this two types of integrated structure is can be classify associative parallel processor and parallel process with DRAM cell. Layout pitch of one-bit-wide logic is identical memory cell pitch to array high density PEs in integrate structure. This format converter design has control path implementation efficiently, and can be utilized the high technology without complicated controller hardware. Sequence of array instruction are generated by host computer before process start, and instructions are saved on unit controller. Host computer is executed the pixel-parallel operation starting at saved instructions after processing start

  • PDF

병렬기구 공직기계의 머신시뮬레이션 및 가공정밀도 평가 (The Evaluation of Machining Accuracy and the Machine Simulation for Parallel Kinematic Machine Tool(PKMT))

  • 신혁;유한식;고해주;정윤교
    • 한국기계가공학회지
    • /
    • 제8권4호
    • /
    • pp.41-47
    • /
    • 2009
  • This research deals with evaluation of machining accuracy for Parallel Kinematic Machine Tool(PKMT) applied parallel type robot system with high precision and stiffness. For this purpose, machine simulation is carried out to foreknow collision and interference between workpiece and tool. Furthermore, on the basis of machine simulation data, PKMT is manufactured. Machining accuracy such as cylindricity straightness, squareness, parallelism circularity, concentricity pitch error and yaw error, is measured by using coordinate measuring machine. Test piece for evaluation of machining accuracy is designed and manufactured under the standard of ISO 10791-7.

  • PDF

화상의 골격화에 대한 최적화 방법 (The Optimal Skeleton Method of an Image)

  • 신충호;오무송
    • 한국멀티미디어학회논문지
    • /
    • 제6권2호
    • /
    • pp.224-229
    • /
    • 2003
  • 골격선 추출에 대한 효율적인 방법을 이용해서 골격화방법을 제안한다 제안한 방법은 윤곽선 검출과정을 거친 후 본 방법을 적용함으로써 영상 골격화 효율을 높이고자한다 기존의 골격화 방법은 Zhang, Lu/Wang, 그리고 Paul방법을 사용하였다. 제안된 방법은 병렬적인 처리 방법과 더 해서 반복과정 에서 마스크의 네(화소) 이웃의 값과 네(화소) 이웃외의 주변 화소값을 비교하여 원본이미지와 일치시켜 처리하는 방식이 다. 결론적으로, 제안된 방법은 향상된 골격선의 연결과 품질 등을 보였다.

  • PDF

David II: 효과적인 메모리 시스템을 가지는 병렬 렌더링 프로세서 (David II: A new architecture for parallel rendering processors with effective memory system)

  • 이길환;박우찬;김일산;한탁돈
    • 한국정보처리학회:학술대회논문집
    • /
    • 한국정보처리학회 2004년도 춘계학술발표대회
    • /
    • pp.1655-1658
    • /
    • 2004
  • Current rendering processors are organized mainly to process a triangle as fast as possible and recently parallel 3D rendering processors, which can process multiple triangles in parallel with multiple rasterizers, begin to appear. For high performance in processing triangles, it is desirable for each rasterizer have its own local pixel cache. However, the consistency problem may occur in accessing the data at the same address simultaneously by more than one rasterizer. In this paper, we propose a parallel rendering processor architecture, called DAVID II, resolving such consistency problem effectively. Moreover, the proposed architecture reduces the latency due to a pixel cache miss significantly. The experimental results show that DAVID II achieves almost linear speedup at best case even in sixteen rasterizers.

  • PDF

An XPDL-Based Workflow Control-Structure and Data-Sequence Analyzer

  • Kim, Kwanghoon Pio
    • KSII Transactions on Internet and Information Systems (TIIS)
    • /
    • 제13권3호
    • /
    • pp.1702-1721
    • /
    • 2019
  • A workflow process (or business process) management system helps to define, execute, monitor and manage workflow models deployed on a workflow-supported enterprise, and the system is compartmentalized into a modeling subsystem and an enacting subsystem, in general. The modeling subsystem's functionality is to discover and analyze workflow models via a theoretical modeling methodology like ICN, to graphically define them via a graphical representation notation like BPMN, and to systematically deploy those graphically defined models onto the enacting subsystem by transforming into their textual models represented by a standardized workflow process definition language like XPDL. Before deploying those defined workflow models, it is very important to inspect its syntactical correctness as well as its structural properness to minimize the loss of effectiveness and the depreciation of efficiency in managing the corresponding workflow models. In this paper, we are particularly interested in verifying very large-scale and massively parallel workflow models, and so we need a sophisticated analyzer to automatically analyze those specialized and complex styles of workflow models. One of the sophisticated analyzers devised in this paper is able to analyze not only the structural complexity but also the data-sequence complexity, especially. The structural complexity is based upon combinational usages of those control-structure constructs such as subprocesses, exclusive-OR, parallel-AND and iterative-LOOP primitives with preserving matched pairing and proper nesting properties, whereas the data-sequence complexity is based upon combinational usages of those relevant data repositories such as data definition sequences and data use sequences. Through the devised and implemented analyzer in this paper, we are able eventually to achieve the systematic verifications of the syntactical correctness as well as the effective validation of the structural properness on those complicate and large-scale styles of workflow models. As an experimental study, we apply the implemented analyzer to an exemplary large-scale and massively parallel workflow process model, the Large Bank Transaction Workflow Process Model, and show the structural complexity analysis results via a series of operational screens captured from the implemented analyzer.