• Title/Summary/Keyword: Multi-Core

Search Result 1,168, Processing Time 0.027 seconds

Implementation of IQ/IDCT in H.264/AVC Decoder Using Mobile Multi-Core GPGPU (모바일 멀티 코어 GP-GPU를 이용한 H.264/AVC 디코더 구현)

  • Kim, Dong-Han;Lee, Kwang-Yeob;Jeong, Jun-Mo
    • Proceedings of the Korean Institute of Information and Commucation Sciences Conference
    • /
    • 2010.10a
    • /
    • pp.321-324
    • /
    • 2010
  • There have been lots of researches on a multi-core processor. The enhancement has been performed through parallelization method. Multi-core architecture in the mobile environment has emerged. But, there is a limit to a mobile CPU's performance. GP-GPU(General-Purpose computing on Graphics Processing Units) can improve performance without adding other dedicated hardware. This paper presents the implementation of Inverse Quantization, Inverse DCT and Color Space Conversion module in H.264/AVC decoder using Multi-Core GP-GPU for a mobile environments. The proposed architecture improves approximately 50% of performance when it use all the features.

  • PDF

Polymerization of dual cured composites by different thickness (두께에 따른 이중 중합형 복합레진의 중합)

  • Kim, Yun-Ju;Jin, Myoung-Uk;Kim, Sung-Kyo;Kwon, Tae-Yub;Kim, Young-Kyung
    • Proceedings of the KACD Conference
    • /
    • 2008.05a
    • /
    • pp.169-176
    • /
    • 2008
  • The purpose of this study was to evaluate the effect of thickness, filling methods and curing methods on the polymerization of dual cured core materials by means of microhardness test. Two dual cured core materials, MultiCore Flow (Ivoclar Vivadent AG, Schaan, Liechtenstein) and Bis-Core (Bisco Inc., Schaumburg, IL, USA) were used in this study. 2 mm (bulky filled), 4 mm (bulky filled), 6 mm (bulky and incrementally filled) and 8 mm (bulky and incrementally filled)-thickness specimens were prepared with light cure or self cure mode. After storage at $37^{\circ}C$ for 24 hours, the Knoop hardness values (KHN) of top and bottom surfaces were measured and the microhardness ratio of top and bottom surfaces was calculated. The data were analyzed using one-way ANOVA and Scheffe multiple comparison test, with ${\alpha}=0.05$. The effect of thickness on the polymerization of dual cured composites showed material specific results. In 2, 4 and 6 mm groups, the KHN of two materials were not affected by thickness. However, in 8 mm group of MultiCore Flow, the KHN of the bottom surface was lower than those of other groups (p < 0.05). The effect of filling methods on the polymerization of dual cured composites was different by their thickness or materials. In 6 mm thickness, there was no significant difference between bulk and incremental filling groups. In 8 mm thickness, Bis-Core showed no significant difference between groups. However, in MultiCore Flow, the microhardness ratio of bulk filling group was lower than that of incremental filling group (p < 0.05). The effect of curing methods on the polymerization of dual cured composites showed material specific results. In Bis-Core, the KHN of dual cured group were higher than those of self cured group at both surfaces (p < 0.05). However, in MultiCore Flow, the results were not similar at both surfaces. At the top surface, dual cured group showed higher KHN than that of self cured group (p < 0.05). However, in the bottom surface, dual cured group showed lower value than that of self cured group (p < 0.05).

  • PDF

A Study of Trace-driven Simulation for Multi-core Processor Architectures (멀티코어 프로세서의 명령어 자취형 모의실험에 대한 연구)

  • Lee, Jong-Bok
    • The Journal of the Institute of Internet, Broadcasting and Communication
    • /
    • v.12 no.3
    • /
    • pp.9-13
    • /
    • 2012
  • In order to overcome the complexity and power problems of superscalar processors, the multi-core architecture has been prevalent recently. Although the execution-driven simulation is wide spread, the trace-driven simulation has speed advantages over the execution-driven simulation. We present a methodology to simulate multi-core architecture using trace-driven simulator. Using SPEC 2000 benchmarks as input, the trace-driven simulation has been performed for the cores ranging from 2 to 16 extensively. As a result, the 16-core processor resulted in 4.1 IPC and 13.3 times speed up over single-core processor on the average.

Load Unbalancing Scheduling Method for Energy-Efficient Multi-core Embedded Systems (에너지 효율적인 멀티코어 임베디드 시스템을 위한 부하 불균형 스케줄링 방법)

  • Choi, YoungJin
    • IEMEK Journal of Embedded Systems and Applications
    • /
    • v.11 no.1
    • /
    • pp.1-8
    • /
    • 2016
  • We proposed a load unbalancing scheduling method for energy-efficient multi-core embedded systems considering DVFS (Dynamic Voltage/Frequency Scaling) power consumption and task characteristics. It is a new kind of scheduler which combines load balancing and load unbalancing technique. The purpose of the method is to effectively utilize energy without much effect in performance. In this paper, we conduct experiments on energy consumption and performance using the previous load balancing and unbalancing techniques and our proposed technique. The proposed technique reduced energy consumption more than 13.7% when compared to other algorithms. As a result, the proposed technique shows low energy consumption without much decline in the performance and is adequate for energy-efficient multi-core embedded systems.

A Technique for Fast Process Creation Based on Creation Location

  • Kim, Byung-Jin;Ahn, Young-Ho;Chung, Ki-Seok
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.4
    • /
    • pp.283-287
    • /
    • 2011
  • Due to the proliferation of software parallelization on multi-core CPUs, the number of concurrently executing processes is rapidly increasing. Unlike processes running in a server environment, those executing in a multi-core desktop or a multi-core mobile platform have various correlations. Therefore, it is crucial to consider correlations among concurrently running processes. In this paper, we exploit the property that for a given created location in the binary image of the parent process, the average running time of child processes residing in the run-queue differs. We claim that this property can be exploited to improve the overall system performance by running processes that have a relatively short running time before those with a longer running time. Experimental results verified that the running time was actually improved by 11%.

Bounding Worst-Case Performance for Multi-Core Processors with Shared L2 Instruction Caches

  • Yan, Jun;Zhang, Wei
    • Journal of Computing Science and Engineering
    • /
    • v.5 no.1
    • /
    • pp.1-18
    • /
    • 2011
  • As the first step toward real-time multi-core computing, this paper presents a novel approach to bounding the worst-case performance for threads running on multi-core processors with shared L2 instruction caches. The idea of our approach is to compute the worst-case instruction access interferences between different threads based on the program control flow information of each thread, which can be statically analyzed. Our experiments indicate that the proposed approach can reasonably estimate the worst-case shared L2 instruction cache misses by considering the inter-thread instruction conflicts. Also, the worst-case execution time (WCET) of applications running on multi-core processors estimated by our approach is much better than the estimation by simply assuming all L2 instruction accesses are misses.

Mechanical behaviors of multi-layered foam core sandwich composite (다층 구조 폼 코아 샌드위치 복합재의 기계적 거동 연구)

  • Oh J.O.;Yoon S.H.
    • Proceedings of the Korean Society of Precision Engineering Conference
    • /
    • 2006.05a
    • /
    • pp.381-382
    • /
    • 2006
  • The mechanical behaviors of multi-layered foam core sandwich composite were investigated through a 3-point bending test. The sandwich specimens were obtained from sandwich panel consisting of aluminum faces and urethane foam core. Three types of sandwich specimens such as a single structure, a double structure and a triple structure were considered. The span of sandwich specimens were varied from 170mm to 350mm. According to the results, the flexural and shear properties of multi-layered sandwich composite were found to be higher than those of single-layered sandwich composite.

  • PDF

TBBench: A Micro-Benchmark Suite for Intel Threading Building Blocks

  • Marowka, Ami
    • Journal of Information Processing Systems
    • /
    • v.8 no.2
    • /
    • pp.331-346
    • /
    • 2012
  • Task-based programming is becoming the state-of-the-art method of choice for extracting the desired performance from multi-core chips. It expresses a program in terms of lightweight logical tasks rather than heavyweight threads. Intel Threading Building Blocks (TBB) is a task-based parallel programming paradigm for multi-core processors. The performance gain of this paradigm depends to a great extent on the efficiency of its parallel constructs. The parallel overheads incurred by parallel constructs determine the ability for creating large-scale parallel programs, especially in the case of fine-grain parallelism. This paper presents a study of TBB parallelization overheads. For this purpose, a TBB micro-benchmarks suite called TBBench has been developed. We use TBBench to evaluate the parallelization overheads of TBB on different multi-core machines and different compilers. We report in detail in this paper on the relative overheads and analyze the running results.

Tile Level Rate Control for High Efficiency Video Coding (HEVC) on Multi-core Platform

  • Marzuki, Ismail;Ahn, Yong-Jo;Sim, Donggyu
    • Proceedings of the Korean Society of Broadcast Engineers Conference
    • /
    • 2015.11a
    • /
    • pp.147-148
    • /
    • 2015
  • This paper proposes a tile level rate control for High Efficiency Video Coding (HEVC). The proposed tile level rate control is designed by considering the multi-core platform of tile in HEVC. The proposed tile level rate control allocates the number of bits for each tile based on the predetermined weight generated from the current picture level rate control. According to the experimental results, the proposed tile level rate control for HEVC on multi-core platform loses negligibly the bitrate accuracy about 0.07% on average over the reference software HM-14.0.

  • PDF

Development of Runner System for Filling Balance in Multi Cavity Injection Mold (다수 캐비티 사출금형에서 균형 충전용 러너 시스템 개발)

  • Jeong Y. D.
    • Proceedings of the Korean Society for Technology of Plasticity Conference
    • /
    • 2005.09a
    • /
    • pp.13-16
    • /
    • 2005
  • For mass production, usually injection mold has multi-cavity which is filled through geometrical balanced runner system. Despite geometrical balanced runner system, filling imbalances between cavity to cavity have always been observed. These filing imbalances are one of the most significant factors to affect quality of plastic parts when molding plastic parts in multi-cavity injection mold. Filling imbalances are results from non-symmetrical shear rate distribution within melt as it flows through the runner system. It has been possible to decrease filling imbalance by optimizing processing conditions, but it has not completely eliminated this phenomenon during injection molding processing. This paper presents a solution of these filling imbalances through using 'runner core pin'. The runner core pin which is developed in this study creates a symmetrical shear distribution within runner. As a result of using runner core pin, a remarkable improvement in reducing filling imbalance was confirmed.

  • PDF