• Title/Summary/Keyword: Multi-cores

Search Result 148, Processing Time 0.023 seconds

Parallel Cell-Connectivity Information Extraction Algorithm for Ray-casting on Unstructured Grid Data (비정렬 격자에 대한 광선 투사를 위한 셀 사이 연결정보 추출 병렬처리 알고리즘)

  • Lee, Jihun;Kim, Duksu
    • Journal of the Korea Computer Graphics Society
    • /
    • v.26 no.1
    • /
    • pp.17-25
    • /
    • 2020
  • We present a novel multi-core CPU based parallel algorithm for the cell-connectivity information extraction algorithm, which is one of the preprocessing steps for volume rendering of unstructured grid data. We first check the synchronization issues when parallelizing the prior serial algorithm naively. Then, we propose a 3-step parallel algorithm that achieves high parallelization efficiency by removing synchronization in each step. Also, our 3-step algorithm improves the cache utilization efficiency by increasing the spatial locality for the duplicated triangle test process, which is the core operation of building cell-connectivity information. We further improve the efficiency of our parallel algorithm by employing a memory pool for each thread. To check the benefit of our approach, we implemented our method on a system consisting of two octa-core CPUs and measured the performance. As a result, our method shows continuous performance improvement as we add threads. Also, it achieves up to 82.9 times higher performance compared with the prior serial algorithm when we use thirty-two threads (sixteen physical cores). These results demonstrate the high parallelization efficiency and high cache utilization efficiency of our method. Also, it validates the suitability of our algorithm for large-scale unstructured data.

A Numerical Study on the Flow and Heat Transfer Characteristics of Aluminum Pyramidal Truss Core Sandwich (알루미늄 피라미드 트러스 심재 샌드위치의 열유동 특성에 관한 수치해석 연구)

  • Kang, Jong-Su;Kim, Sang-Woo;Lim, Jae-Yong
    • Journal of the Korea Academia-Industrial cooperation Society
    • /
    • v.20 no.3
    • /
    • pp.638-644
    • /
    • 2019
  • In this study, the fluid flow and heat transfer characteristics within sandwich panels are investigated using computational fluid dynamics. Within the sandwich panels having periodic cellular cores, air can freely move inside the core section so that the structure is able to perform multi-functional roles such as simultaneous load bearing and heat dissipation. Thus, there needs to examine the thermal and flow analysis with respect to design variables and various conditions. In this regard, ANSYS Fluent was utilized to explore the flow and heat transfer within the pyramidal truss sandwich structures by varying the truss angle and inlet velocity. Without the entry effect in the first unitcell, the constant rate of pressure and the constant rate of Nusselt number was observed. As a result, it was demonstrated that Nusselt number increases and friction factor decreases as the inlet velocity increases. Moreover, the rate of Nusselt number and friction factor was appreciable in the range of V=1-5m/s due to the transition from laminar to turbulent flow. Regarding the effect of design variable, the variation of truss angle did not significantly influence the characteristics.

Assessment of Parallel Computing Performance of Agisoft Metashape for Orthomosaic Generation (정사모자이크 제작을 위한 Agisoft Metashape의 병렬처리 성능 평가)

  • Han, Soohee;Hong, Chang-Ki
    • Journal of the Korean Society of Surveying, Geodesy, Photogrammetry and Cartography
    • /
    • v.37 no.6
    • /
    • pp.427-434
    • /
    • 2019
  • In the present study, we assessed the parallel computing performance of Agisoft Metashape for orthomosaic generation, which can implement aerial triangulation, generate a three-dimensional point cloud, and make an orthomosaic based on SfM (Structure from Motion) technology. Due to the nature of SfM, most of the time is spent on Align photos, which runs as a relative orientation, and Build dense cloud, which generates a three-dimensional point cloud. Metashape can parallelize the two processes by using multi-cores of CPU (Central Processing Unit) and GPU (Graphics Processing Unit). An orthomosaic was created from large UAV (Unmanned Aerial Vehicle) images by six conditions combined by three parallel methods (CPU only, GPU only, and CPU + GPU) and two operating systems (Windows and Linux). To assess the consistency of the results of the conditions, RMSE (Root Mean Square Error) of aerial triangulation was measured using ground control points which were automatically detected on the images without human intervention. The results of orthomosaic generation from 521 UAV images of 42.2 million pixels showed that the combination of CPU and GPU showed the best performance using the present system, and Linux showed better performance than Windows in all conditions. However, the RMSE values of aerial triangulation revealed a slight difference within an error range among the combinations. Therefore, Metashape seems to leave things to be desired so that the consistency is obtained regardless of parallel methods and operating systems.

Hardware-Software Cosynthesis of Multitask Multicore SoC with Real-Time Constraints (실시간 제약조건을 갖는 다중태스크 다중코어 SoC의 하드웨어-소프트웨어 통합합성)

  • Lee Choon-Seung;Ha Soon-Hoi
    • Journal of KIISE:Computer Systems and Theory
    • /
    • v.33 no.9
    • /
    • pp.592-607
    • /
    • 2006
  • This paper proposes a technique to select processors and hardware IPs and to map the tasks into the selected processing elements, aming to achieve high performance with minimal system cost when multitask applications with real-time constraints are run on a multicore SoC. Such technique is called to 'Hardware-Software Cosynthesis Technique'. A cosynthesis technique was already presented in our early work [1] where we divide the complex cosynthesis problem into three subproblems and conquer each subproblem separately: selection of appropriate processing components, mapping and scheduling of function blocks to the selected processing component, and schedulability analysis. Despite good features, our previous technique has a serious limitation that a task monopolizes the entire system resource to get the minimum schedule length. But in general we may obtain higher performance in multitask multicore system if independent multiple tasks are running concurrently on different processor cores. In this paper, we present two mapping techniques, task mapping avoidance technique(TMA) and task mapping pinning technique(TMP), which are applicable for general cases with diverse operating policies in a multicore environment. We could obtain significant performance improvement for a multimedia real-time application, multi-channel Digital Video Recorder system and for randomly generated multitask graphs obtained from the related works.

Geophysical Imaging of Alluvial Water Table and the underlying Layers of Weathered and Soft Rocks (충적층 지하수면 및 그 하부의 풍화암/연암의 경계면 파악을 위한 복합 지구물리탐사)

  • Ju, Hyeon-Tae;Lee, Chul-Hee;Kim, Ji-Soo
    • The Journal of Engineering Geology
    • /
    • v.25 no.3
    • /
    • pp.349-356
    • /
    • 2015
  • Although geophysical methods are useful and generally provide valuable information about the subsurface, it is important to recognize their limitations. A common limitation is the lack of sufficient contrast in physical properties between different layers. Thus, multiple methods are commonly used to best constrain the physical properties of different layers and interpret each section individually. Ground penetrating radar (GPR) and shallow seismic reflection (SSR) methods, used for shallow and very shallow subsurface imaging, respond to dielectric and velocity contrasts between layers, respectively. In this study, we merged GPR and SSR data from a test site within the Cheongui granitic mass, where the water table is ~3 m deep all year. We interpreted the data in combination with field observations and existing data from drill cores and well logs. GPR and SSR reflections from the tops of the sand layer, water table, and weathered and soft rocks are successfully mapped in a single section, and they correlate well with electrical resistivity data and SPS (suspension PS) well-logging profiles. In addition, subsurface interfaces in the integrated section correlate well with S-wave velocity structures from multi-channel analysis shear wave (MASW) data, a method that was recently developed to enhance lateral resolution on the basis of CMP (common midpoint) cross-correlation (CMPCC) analysis.

A Performance Improvement of Linux TCP/IP Stack based on Flow-Level Parallelism in a Multi-Core System (멀티코어 시스템에서 흐름 수준 병렬처리에 기반한 리눅스 TCP/IP 스택의 성능 개선)

  • Kwon, Hui-Ung;Jung, Hyung-Jin;Kwak, Hu-Keun;Kim, Young-Jong;Chung, Kyu-Sik
    • The KIPS Transactions:PartA
    • /
    • v.16A no.2
    • /
    • pp.113-124
    • /
    • 2009
  • With increasing multicore system, much effort has been put on the performance improvement of its application. Because multicore system has multiple processing devices in one system, its processing power increases compared to the single core system. However in many cases the advantages of multicore can not be exploited fully because the existing software and hardware were designed to be suitable for single core. When the existing software runs on multicore, its performance improvement is limited by the bottleneck of sharing resources and the inefficient use of cache memory on multicore. Therefore, according as the number of core increases, it doesn't show performance improvement and shows performance drop in the worst case. In this paper we propose a method of performance improvement of multicore system by applying Flow-Level Parallelism to the existing TCP/IP network application and operating system. The proposed method sets up the execution environment so that each core unit operates independently as much as possible in network application, TCP/IP stack on operating system, device driver, and network interface. Moreover it distributes network traffics to each core unit through L2 switch. The proposed method allows to minimize the sharing of application data, data structure, socket, device driver, and network interface between each core. Also it allows to minimize the competition among cores to take resources and increase the hit ratio of cache. We implemented the proposed methods with 8 core system and performed experiment. Experimental results show that network access speed and bandwidth increase linearly according to the number of core.

Speed-up Techniques for High-Resolution Grid Data Processing in the Early Warning System for Agrometeorological Disaster (농업기상재해 조기경보시스템에서의 고해상도 격자형 자료의 처리 속도 향상 기법)

  • Park, J.H.;Shin, Y.S.;Kim, S.K.;Kang, W.S.;Han, Y.K.;Kim, J.H.;Kim, D.J.;Kim, S.O.;Shim, K.M.;Park, E.W.
    • Korean Journal of Agricultural and Forest Meteorology
    • /
    • v.19 no.3
    • /
    • pp.153-163
    • /
    • 2017
  • The objective of this study is to enhance the model's speed of estimating weather variables (e.g., minimum/maximum temperature, sunshine hour, PRISM (Parameter-elevation Regression on Independent Slopes Model) based precipitation), which are applied to the Agrometeorological Early Warning System (http://www.agmet.kr). The current process of weather estimation is operated on high-performance multi-core CPUs that have 8 physical cores and 16 logical threads. Nonetheless, the server is not even dedicated to the handling of a single county, indicating that very high overhead is involved in calculating the 10 counties of the Seomjin River Basin. In order to reduce such overhead, several cache and parallelization techniques were used to measure the performance and to check the applicability. Results are as follows: (1) for simple calculations such as Growing Degree Days accumulation, the time required for Input and Output (I/O) is significantly greater than that for calculation, suggesting the need of a technique which reduces disk I/O bottlenecks; (2) when there are many I/O, it is advantageous to distribute them on several servers. However, each server must have a cache for input data so that it does not compete for the same resource; and (3) GPU-based parallel processing method is most suitable for models such as PRISM with large computation loads.

Evaluation of Hydrogeologic Seal Capacity of Mudstone in the Yeongil Group, Pohang Basin, Korea: Focusing on Mercury Intrusion Capillary Pressure Analysis (포항분지 영일층군 이암층의 수리지질학적 차폐능 평가: 수은 모세관 압입 시험의 결과 분석을 중심으로)

  • Kim, Seon-Ok;Wang, Sookyun;Lee, Minhee
    • Economic and Environmental Geology
    • /
    • v.53 no.1
    • /
    • pp.23-32
    • /
    • 2020
  • Geological CO2 sequestration is a global warming response technology to limit atmospheric emissions by injecting CO2 captured on a large scale into deep geological formations. The presented results concern mineralogical and hydrogeological investigations (FE-SEM, XRD, XRF, and MICP) of mudstone samples from drilling cores of the Pohang basin, which is the research area for the first demonstration-scale CO2 storage project in Korea. They aim to identify the mineral properties of the mudstone constituting the caprock and to quantitatively evaluate the hydrogeologic sealing capacity that directly affects the stability and reliability of geological CO2 storage. Mineralogical analysis showed that the mudstone samples are mainly composed of quartz, K-feldspar, plagioclase and a small amount of pyrite, calcite, clay minerals, etc. Mercury intrusion capillary pressure analysis also showed that the samples generally had uniform particle configurations and pore distribution and there was no distinct correlation between the estimated porosity and air permeability. The allowable CO2 column heights based on the estimated pore-entry pressures and breakthrough pressures were found to be significantly higher than the thickness of the targeting CO2 injection layer. These results showed that the mudstone layers in the Yeongil group, Pohang basin, Korea have sufficient sealing capacity to suppress the leakage of CO2 injected during the demonstration-scale CO2 storage project. It should be noticed, however, that the applicability of results and analyses in this study is limited by the lack of available samples. For rigorous assessment of the sealing efficiency for geological CO2 storage operations, significant efforts on collection and multi-aspect evaluation for core samples over entire caprock formations should be accompanied.