• Title/Summary/Keyword: parallel computers

Search Result 141, Processing Time 0.024 seconds

Assessment of computational performance for a vector parallel implementation: 3D probabilistic model discrete cracking in concrete

  • Paz, Carmen N.M.;Alves, Jose L.D.;Ebecken, Nelson F.F.
    • Computers and Concrete
    • /
    • v.2 no.5
    • /
    • pp.345-366
    • /
    • 2005
  • This work presents an assessment of the computational performance of a vector-parallel implementation of probabilistic model for concrete cracking in 3D. This paper shows the continuing efforts towards code optimization as reported in earlier works Paz, et al. (2002a,b and 2003). The probabilistic crack approach is based on the direct Monte Carlo method. Cracking is accounted by means of 3D interface elements. This approach considers that all nonlinearities are restricted to interface elements modeling cracks. The heterogeneity governs the overall cracking behavior and related size effects on concrete fracture. Computational kernels in the implementation are the inexact Newton iterative driver to solve the non-linear problem and a preconditioned conjugate gradient (PCG) driver to solve linearized equations, using an element by element (EBE) strategy to compute matrix-vector products. In particular the paper analyzes code behavior using OpenMP directives in parallel vector processors (PVP), such as the CRAY SV1 and CRAY T94. The impact of the memory architecture on code performance, and also some strategies devised to circumvent this issue are addressed by numerical experiment.

A Study on Distributed System Construction and Numerical Calculation Using Raspberry Pi

  • Ko, Young-ho;Heo, Gyu-Seong;Lee, Sang-Hyun
    • International journal of advanced smart convergence
    • /
    • v.8 no.4
    • /
    • pp.194-199
    • /
    • 2019
  • As the performance of the system increases, more parallelized data is being processed than single processing of data. Today's cpu structure has been developed to leverage multicore, and hence data processing methods are being developed to enable parallel processing. In recent years desktop cpu has increased multicore, data is growing exponentially, and there is also a growing need for data processing as artificial intelligence develops. This neural network of artificial intelligence consists of a matrix, making it advantageous for parallel processing. This paper aims to speed up the processing of the system by using raspberrypi to implement the cluster building and parallel processing system against the backdrop of the foregoing discussion. Raspberrypi is a credit card-sized single computer made by the raspberrypi Foundation in England, developed for education in schools and developing countries. It is cheap and easy to get the information you need because many people use it. Distributed processing systems should be supported by programs that connected multiple computers in parallel and operate on a built-in system. RaspberryPi is connected to switchhub, each connected raspberrypi communicates using the internal network, and internally implements parallel processing using the Message Passing Interface (MPI). Parallel processing programs can be programmed in python and can also use C or Fortran. The system was tested for parallel processing as a result of multiplying the two-dimensional arrangement of 10000 size by 0.1. Tests have shown a reduction in computational time and that parallelism can be reduced to the maximum number of cores in the system. The systems in this paper are manufactured on a Linux-based single computer and are thought to require testing on systems in different environments.

The teleautonomous control of an integrated FRHC-PUMA telerobot control system

  • Lee, Jin-S.;Kan, Edwin-P.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1990.10b
    • /
    • pp.974-979
    • /
    • 1990
  • The system discussed in this paper is an integrated stand-alone system with the full functional capabilities required of a telerobot system. It is complete with a force-reflecting 6-DOF hand controller, driving a PUMA 560 or 762 robot, with an integrated force-torque sensing wrist sensor and servo-driven parallel jaw gripper. A mix of custom and standard electronics, distributed computers and microprocessors, with embedded and downloadable software, have been integrated into the system, giving rise to a powerful and flexible teleautonomous control system.

  • PDF

Design of an Optoelectronic Database Filter Chip (고성능 병렬 광 데이터처리 가속기)

  • 나종화
    • Proceedings of the Korean Information Science Society Conference
    • /
    • 2000.04a
    • /
    • pp.36-38
    • /
    • 2000
  • An optoelectronic database filter chip for high performance database computers and applications is proposed. The proposed device is designed to perform the selection and projection operations of relational database operation on-the-fly in page-parallel manner to increase the overall performance of a database system. The device utilizes CMOS smart pixel array consists of detector and combinational logic circuit to perform the selection and projection operation.

  • PDF

Parallel Processing Algorithm of JPEG2000 Using GPU (GPU를 이용한 JPEG2000 병렬 알고리즘)

  • Lee, Dong-Ha;Cho, Shi-Won;Lee, Dong-Wook
    • The Transactions of The Korean Institute of Electrical Engineers
    • /
    • v.57 no.6
    • /
    • pp.1075-1080
    • /
    • 2008
  • Most modem computers or game consoles are well equipped with powerful graphics processing units(GPUs) to accelerate graphics operations. However, since the graphics engines in these GPUs are specially designed for graphics operations, we could not take advantage of their computing power for more general nongraphic operations. In this paper, we studied the GPUs graphics engine in order to accelerate the image processing capability. Specifically, we implemented a JPEC2000 decoding/encoding framework that involves both OpenMP and GPU. Initial experimental results show that significant speed-up can be achieved by utilizing the GPU power.

Genetic Scheduling Algorithm for FFT Dta Flows in Parallel Computers (병렬 컴퓨터 시스템에서의 FFT 데이터 흐름도에 관한 유전 스케줄링 알고리즘)

  • 박월선;김금호;서루비;윤성대
    • Proceedings of the IEEK Conference
    • /
    • 2000.06c
    • /
    • pp.161-164
    • /
    • 2000
  • We propose the genetic algorithm to apply three kinds of FFT data flows to be considered the overhead for the data exchange between processors that have the multi-scheduling problem on parallel computer In the design of genetic algorithm, we propose the chromosome representation which can simply encode and decode a solution without any heuristic information, the evaluation function to be considered an efficiency of processor, and the genetic operator to inherit a superior gene from their parents. And we saw that the simulation result can verify better performance than the existing algorithm(BEA : binary exchange algorithm)in the face of execution time.

  • PDF

PC networked parallel processing system for figures and letters

  • Kitazawa, M.;Sakai, Y.
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 1993.10b
    • /
    • pp.277-282
    • /
    • 1993
  • In understanding concepts, there are two aspects; image and language. The point discussed in this paper is things fundamental in finding proper relations between objects in a scene to represent the meaning of the that whole scene properly through experiencing in image and language. It is assumed that one of the objects in a scene has letters as objects inside its contour. As the present system can deal with both figures and letters in a scene, the above assumption makes it easy for the system to infer the context of a scene. Several personal computers on the LAN network are used and they process items in parallel.

  • PDF

Simulation of a CIM Workflow System Using Parallel Virtual Machine (PVM)

  • Chang-Ouk Kim
    • Journal of the Korea Society for Simulation
    • /
    • v.5 no.2
    • /
    • pp.13-24
    • /
    • 1996
  • Workflow is an ordered sequence of interdependent component data activities each of which can be executed on an integrated information system by accessing a remote information system. In our previous research [4], we proposed a distributed CIM Workflow system which consists of a workflow execution model called DAF-Net and an agent-based information systems called AIMIS. Given a component data activity, there needs an interaction protocol among agents which allocates the component data activity to a relevant information systems exist. The objective of this research is to propose and test two protocols: ARR(Asynchronous Request and Response)protocol and NCL(Negotiation with Case based Learning) protocol. To test the effectiveness of the protocols, we applied the PVM(Parallel Virtual Machine) software to simulate the distributed CIM Workflow system. PVM provides a distributed computing environment in which users can run different software processes in different computers while allowing communication among the processes.

  • PDF

Two-Step Suboptimal Filters for Linear Dynamic Systems

  • Ahn, Jun-Il;Minhas, Rashid;Shin, Vladimir
    • 제어로봇시스템학회:학술대회논문집
    • /
    • 2005.06a
    • /
    • pp.16-21
    • /
    • 2005
  • This paper considers the problem of state estimation in linear continuous-time systems with multi-sensor environment and observation uncertainties. We propose two suboptimal filtering algorithms for these types of systems. The filtering algorithms consist of two steps: The local optimal Kalman estimates are computed at the first step. And, these local estimates are lineally fused at the second step. The implementation of the two-step filtering algorithms needs a lower memory demand than the optimal Kalman and adaptive Lainiotis-Kalman filters. In consequence of parallel structure of the proposed filters, the parallel computers can be used for their design. The examples exhibit the effect of common noise on the performance of fusion of the local Kalman estimates based on observations from different sensors and in the presence of uncertainties.

  • PDF

Dependability Analysis of Parallel Video Servers Using Fault Injection Simulation (결함 주입 시뮬레이션을 이용한 병렬 비디오 서버의 의존도 분석)

  • 정지영;김성수
    • Journal of the Korea Society for Simulation
    • /
    • v.9 no.2
    • /
    • pp.51-61
    • /
    • 2000
  • In recent years, significant advances in computers and communication technologies have made multimedia services feasible. As a result, various queueing models and cost models on architecture and data placement for multimedia server have been proposed. However, most of these models do not evaluate dependability of systems. In the design phase of a system, simulation is an important experimental means for performance and dependability analysis. Fault injection simulation has been used in evaluation of dependability metric. In this paper, we develop fault injection simulation model to analyze dependability of parallel video servers. In addition, we evaluate reliability and MTTF(Mean Time To Failure) of systems by using the simulator.

  • PDF