• Title/Summary/Keyword: Parallel interface

Search Result 439, Processing Time 0.027 seconds

PVM Performance Enhancement over a High-Speed Myrinet (초고속 Myrinet 통신망에서의 PVM 성능 개선)

  • Kim, In-Soo;Shim, Jae-Hong;Choi, Kyung-Hee;Jung, Gi-Hyun;Moon, Kyeong-Deok;Kim, Tae-Geun
    • The Transactions of the Korea Information Processing Society
    • /
    • v.7 no.1
    • /
    • pp.74-87
    • /
    • 2000
  • PVM (parallel virtual machine) provides a programming environment that allows a collection of networked workstations to appear as a single parallel computational resource. The performance of parallel applications in this environment depends on the performance of data transfers between tasks. In this paper, we present a new Myrinet-based communication model of PVM that improves PVM communication performance over a high-speed Myrinet LAN. The proposed PVM communication model adopts a communication mechanism that allows any user-level process to directly access the network interface board without going through UDP/IP protocol stacks in the kernel. This mechanism provides faster data transfers between PVM tasks over the Myrinet since it avoids data copy overhead from kernel (user space) to user space (kernel) and reduces communication latency due to network protocol software layers. We implemented EPVM (Enhanced PVM), our updated version of the traditional PVM using UDP/IP, that is based on the proposed communication model over the Myrinet. Performance results show EPVM achieves communication speed-up of one to two over the traditional PVM.

  • PDF

Parallel Computing Strategies for High-Speed Impact into Ceramic/Metal Plates (세라믹/금속판재의 고속충돌 파괴 유한요소 병렬 해석기법)

  • Moon, Ji-Joong;Kim, Seung-Jo;Lee, Min-Hyung
    • Journal of the Computational Structural Engineering Institute of Korea
    • /
    • v.22 no.6
    • /
    • pp.527-532
    • /
    • 2009
  • In this paper simulations for the impact into ceramics and/or metal materials have been discussed. To model discrete nature for fracture and damage of brittle materials, we implemented cohesive-law fracture model with a node separation algorithm for the tensile failure and Mohr-Coulomb model for the compressive loading. The drawback of this scheme is that it requires a heavy computational time. This is because new nodes are generated continuously whenever a new crack surface is created. In order to reduce the amount of calculation, parallelization with MPI library has been implemented. For the high-speed impact problems, the mesh configuration and contact calculation changes continuously as time step advances and it causes unbalance of computational load of each processor. Dynamic load balancing technique which re-allocates the loading dynamically is used to achieve good parallel performance. Some impact problems have been simulated and the parallel performance and accuracy of the solutions are discussed.

The Optimum Design of Airfoil Shape with Parallel Computation (병렬연산을 이용한 익형의 최적 설계)

  • Jo,Jang-Geun;Park,Won-Gyu
    • Journal of the Korean Society for Aeronautical & Space Sciences
    • /
    • v.31 no.1
    • /
    • pp.1-7
    • /
    • 2003
  • The aerodynamic optimization method for airfoil design was described in this paper. The Navier-Stokes equations were solved to consider the viscous flow information around an airfoil. The Modified Method of Feasible Direction(MMFD) was used for sensitivity analysis and the polynomial interpolation was used for distance calculation of the minimization. The Message Passing Interface(MPI) library of parallel computation was adopted to reduce the computation time of flow solver by decomposing the entire computational domain into 8 sub-domains and one-to-one allocating 8 processors to 8 sub-domains. The parallel computation was also used to compute the sensitivity analysis by allocating each search direction to each processor. The present optimization reduced the drag of airfoil while the lift is maintained at the tolerable design value.

Design and Implementation of a Java Package for Sharing Array Data by the DSM Interface on a Cluster of Workstations (워크스테이션 클러스터 상에서 분산공유메모리 인터페이스로 배열 데이터의 공유를 지원하는 Java 패키지의 설계와 구현)

  • Lim, Hae-Jung;Kim, Myung
    • Journal of Korea Multimedia Society
    • /
    • v.2 no.3
    • /
    • pp.355-365
    • /
    • 1999
  • In this paper, we present JPAS(Java Package for Array Sharing) which is a Java Package for sharing arrays of data on a cluster of workstations. It allows us to divide an array of data into several pieces, and to place each piece on a different host. JPAS uses Java RMI so that the entire array can be accessed by a location transparent interface which is similar to that of a distributed shared memory system. JPAS is portable and easy to use since it is implemented using pure Java. In order to reduce network overhead, JPAS allows programmers to use their prior knowledge of the application. Data consistency can be maintained through the value updating methods defined for all the elements of an array. We developed parallel programs which use JPAS, and tested them on a cluster of workstations. The test results show that JPAS is a parallel programming tool with reasonably good performance.

  • PDF

Real-Time Compressed Video Acquisition System for Stereo 360 VR (Stereo 360 VR을 위한 실시간 압축 영상 획득 시스템)

  • Choi, Minsu;Paik, Joonki
    • Journal of Broadcast Engineering
    • /
    • v.24 no.6
    • /
    • pp.965-973
    • /
    • 2019
  • In this paper, Stereo 4K@60fps 360 VR real-time video capture system which consists of video stream capture, video encoding and stitching module is been designed. The system captures stereo 4K@60fps 360 VR video by stitching 6 of 2K@60fps stream which are captured through HDMI interface from 6 cameras in real-time. In video capture phase, video is captured from each camera using multi-thread in real-time. In video encoding phase, raw frame memory transmission and parallel encoding are used to reduce the resource usage in data transmission between video capture and video stitching modules. In video stitching phase, Real-time stitching is secured by stitching calibration preprocessing.

Mass transfer kinetics using two-site interface model for removal of Cr(VI) from aqueous solution with cassava peel and rubber tree bark as adsorbents

  • Vasudevan, M.;Ajithkumar, P.S.;Singh, R.P.;Natarajan, N.
    • Environmental Engineering Research
    • /
    • v.21 no.2
    • /
    • pp.152-163
    • /
    • 2016
  • Present study investigates the potential of cassava peel and rubber tree bark for the removal of Cr (VI) from aqueous solution. Removal efficiency of more than 99% was obtained during the kinetic adsorption experiments with dosage of 3.5 g/L for cassava peel and 8 g/L for rubber tree bark. By comparing popular isotherm models and kinetic models for evaluating the kinetics of mass transfer, it was observed that Redlich-Peterson model and Langmuir model fitted well ($R^2$ > 0.99) resulting in maximum adsorption capacity as 79.37 mg/g and 43.86 mg/g for cassava peel and rubber tree bark respectively. Validation of pseudo-second order model and Elovich model indicated the possibility of chemisorption being the rate limiting step. The multi-linearity in the diffusion model was further addressed using multi-sites models (two-site series interface (TSSI) and two-site parallel interface (TSPI) models). Considering the influence of interface properties on the kinetic nature of sorption, TSSI model resulted in low mass transfer rate (5% for cassava peel and 10% for rubber tree bark) compared to TSPI model. The study highlights the employability of two-site sorption model for simultaneous representation of different stages of kinetic sorption for finding the rate-limiting process, compared to the separate equilibrium and kinetic modeling attempts.

Static and Dynamic Fracture Analysis for the Interface Crack of Isotropic-Orthotropic Bimaterial

  • Lee, Kwang-Ho;Arun Shukla;Venkitanarayanan Parameswaran;Vijaya Chalivendra;Hawong, Jae-Sug
    • Journal of Mechanical Science and Technology
    • /
    • v.16 no.2
    • /
    • pp.165-174
    • /
    • 2002
  • In the present study, interfacial cracks between an isotropic and orthotropic material, subjected to static far field tensile loading are analyzed using the technique of photoelasticity. The fracture parameters are extracted from the full-field isochromatic data and the same are compared with that obtained using boundary collocation method. Dynamic photoelasticity combined with high-speed digital photography is employed for capturing the isochromatics in the case of propagating interfacial cracks. The normalized stress intensity factors for static cracks are greate. when ${\alpha}$: 90$^{\circ}$(fibers perpendicular to the interface) than when ${\alpha}$=0$^{\circ}$(fibers parallel to the interface), and those when ${\alpha}$=90$^{\circ}$are similar to ones of isotropic material. The dynamic stress intensity factors for interfacial propagating cracks are greater when ${\alpha}$=0$^{\circ}$ than ${\alpha}$=90$^{\circ}$. For the velocity ranges (0.1 < C/C$\sub$s1/<0.7) observed in this study, the complex dynamic stress intensity factor │K$\sub$D/│increases with crack speed c, however, the rate of increase of │K$\sub$D/│with crack speed is not as drastic as that reported for homogeneous materials.

ASIC Design of OpenRISC-based Multimedia SoC Platform (OpenRISC 기반 멀티미디어 SoC 플랫폼의 ASIC 설계)

  • Kim, Sun-Chul;Ryoo, Kwang-Ki
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2008.10a
    • /
    • pp.281-284
    • /
    • 2008
  • This paper describes ASIC design of multimedia SoC Platform. The implemented Platform consists of 32-bit OpenRISC1200 Microprocessor, WISHBONE on-chip bus, VGA Controller, Debug Interface, SRAM Interface and UART. The 32-bit OpenRISC1200 processor has 5 stage pipeline and Harvard architecture with separated instruction/data bus. The VGA Controller can display RCB data on a CRT or LCD monitor. The Debug Interface supports a debugging function for the Platform. The SRAM Interface supports 18-bit address bus and 32-bit data bus. The UART provides RS232 protocol, which supports serial communication function. The Platform is design and verified on a Xilinx VERTEX-4 XC4VLX80 FPGA board. Test code is generated by a cross compiler' and JTAG utility software and gdb are used to download the test code to the FPGA board through parallel cable. Finally, the Platform is implemented into a single ASIC chip using Chatered 0.18um process and it can operate at 100MHz clock frequency.

  • PDF

Soil and ribbed concrete slab interface modeling using large shear box and 3D FEM

  • Qian, Jian-Gu;Gao, Qian;Xue, Jian-feng;Chen, Hong-Wei;Huang, Mao-Song
    • Geomechanics and Engineering
    • /
    • v.12 no.2
    • /
    • pp.295-312
    • /
    • 2017
  • Cast in situ and grouted concrete helical piles with 150-200 mm diameter half cylindrical ribs have become an economical and effective choice in Shanghai, China for uplift piles in deep soft soils. Though this type of pile has been successful used in practice, the reinforcing mechanism and the contribution of the ribs to the total resistance is not clear, and there is no clear guideline for the design of such piles. To study the inclusion of ribs to the contribution of shear resistance, the shear behaviour between silty sand and concrete slabs with parallel ribs at different spacing and angles were tested in a large direct shear box ($600mm{\times}400mm{\times}200mm$). The front panels of the shear box are detachable to observe the soil deformation after the test. The tests were modelled with three-dimensional finite element method in ABAQUS. It was found that, passive zones can be developed ahead of the ribs to form undulated failure surfaces. The shear resistance and failure mode are affected by the ratio of rib spacing to rib diameter. Based on the shape and continuity of the failure zones at the interface, the failure modes at the interface can be classified as "punching", "local" or "general" shear failure respectively. With the inclusion of the ribs, the pull out resistance can increase up to 17%. The optimum rib spacing to rib diameter ratio was found to be around 7 based on the observed experimental results and the numerical modelling.

Comparative Investigation of Interfacial Characteristics between HfO2/Al2O3 and Al2O3/HfO2 Dielectrics on AlN/p-Ge Structure

  • Kim, Hogyoung;Yun, Hee Ju;Choi, Seok;Choi, Byung Joon
    • Korean Journal of Materials Research
    • /
    • v.29 no.8
    • /
    • pp.463-468
    • /
    • 2019
  • The electrical and interfacial properties of $HfO_2/Al_2O_3$ and $Al_2O_3/HfO_2$ dielectrics on AlN/p-Ge interface prepared by thermal atomic layer deposition are investigated by capacitance-voltage(C-V) and current-voltage(I-V) measurements. In the C-V measurements, humps related to mid-gap states are observed when the ac frequency is below 100 kHz, revealing lower mid-gap states for the $HfO_2/Al_2O_3$ sample. Higher frequency dispersion in the inversion region is observed for the $Al_2O_3/HfO_2$ sample, indicating the presence of slow interface states A higher interface trap density calculated from the high-low frequency method is observed for the $Al_2O_3/HfO_2$ sample. The parallel conductance method, applied to the accumulation region, shows border traps at 0.3~0.32 eV for the $Al_2O_3/HfO_2$ sample, which are not observed for the $Al_2O_3/HfO_2$ sample. I-V measurements show a reduction of leakage current of about three orders of magnitude for the $HfO_2/Al_2O_3$ sample. Using the Fowler-Nordheim emission, the barrier height is calculated and found to be about 1.08 eV for the $HfO_2/Al_2O_3$ sample. Based on these results, it is suggested that $HfO_2/Al_2O_3$ is a better dielectric stack than $Al_2O_3/HfO_2$ on AlN/p-Ge interface.