Search | Korea Science

Performance Analysis and Identifying Characteristics of Processing-in-Memory System with Polyhedral Benchmark Suite (프로세싱 인 메모리 시스템에서의 PolyBench 구동에 대한 동작 성능 및 특성 분석과 고찰)

Jeonggeun Kim
- Journal of the Semiconductor & Display Technology
- /
- v.22 no.3
- /
- pp.142-148
- /
- 2023
In this paper, we identify performance issues in executing compute kernels from PolyBench, which includes compute kernels that are the core computational units of various data-intensive workloads, such as deep learning and data-intensive applications, on Processing-in-Memory (PIM) devices. Therefore, using our in-house simulator, we measured and compared the various performance metrics of workloads based on traditional out-of-order and in-order processors with Processing-in-Memory-based systems. As a result, the PIM-based system improves performance compared to other computing models due to the short-term data reuse characteristic of computational kernels from PolyBench. However, some kernels perform poorly in PIM-based systems without a multi-layer cache hierarchy due to some kernel's long-term data reuse characteristics. Hence, our evaluation and analysis results suggest that further research should consider dynamic and workload pattern adaptive approaches to overcome performance degradation from computational kernels with long-term data reuse characteristics and hidden data locality.
PDF

Analysis of the GPGPU Performance for Various Combinations of Workloads Executed Concurrently (동시에 실행되는 워크로드 조합에 따른 GPGPU 성능 분석)

Kim, Dongwhan;Eom, Hyeonsang
- KIISE Transactions on Computing Practices
- /
- v.23 no.3
- /
- pp.165-170
- /
- 2017
Many studies have utilized GPGPU (General-Purpose Graphic Processing Unit) and its high computing power to compute complex tasks. The characteristics of GPGPU programs necessitate the operations of memory copy between the host and device. A high latency period can affect the performance of the program. Thus, it is required to significantly improve the performance of GPGPU programs by optimizations. By executing multiple GPGPU programs simultaneously, the latency hiding effect of memory copy is achieved by overlapping the memory copy and computing operations in GPGPU. This paper presents the results of analyzing the latency hiding effect for memory copy operations. Furthermore, we propose a performance anticipation model and an algorithm for the limitations of using pinned memory, and show that the use of the proposed algorithm results in a 41% performance increase.
https://doi.org/10.5626/KTCP.2017.23.3.165 인용 KSCI

I/O Translation Layer Technology for High-performance and Compatibility Using New Memory (뉴메모리를 이용한 고성능 및 호환성을 위한 I/O 변환 계층 기술)

Song, Hyunsub;Moon, Young Je;Noh, Sam H.
- Journal of KIISE
- /
- v.42 no.4
- /
- pp.427-433
- /
- 2015
The rapid advancement of computing technology has triggered the need for fast data I/O processing and high-performance storage technology. Next generation memory technology, which we refer to as new memory, is anticipated to be used for high-performance storage as they have excellent characteristics as a storage device with non-volatility and latency close to DRAM. This research proposes NTL (New memory Translation layer) as a technology to make use of new memory as storage. With the addition of NTL, conventional I/O is served with existing mature disk-based file systems providing compatibility, while new memory I/O is serviced through the NTL to take advantage of the byte-addressability feature of new memory. In this paper, we describe the design of NTL and provide experiment measurement results that show that our design will bring performance benefits.
https://doi.org/10.5626/JOK.2015.42.4.427 인용 KSCI

An efficient Storage Reclamation Algorithm for RISC Parallel Processing (RISC 병렬 처리를 위한 기억공간의 효율적인 활용 알고리즘)

이철원;임인칠
- Journal of the Korean Institute of Telematics and Electronics B
- /
- v.28B no.9
- /
- pp.703-711
- /
- 1991
In this paper, an efficient storage reclamation algorithm for RISC parallel processing in the object orented programming environments is presented. The memory management for the dynamic memory allocation and the frequent memory access in object oriented programming is the main factor that decreases RISC parallel processing performance. The proposed algorithm can be efficiently allocated the memory space of RISCy computer which is required the frequent memory access, so it can be increased RISC parallel processing performance. The proposed algorithm is verified the efficiency by implementing C language on SUN SPARC(4.3 BSD UNIX).
PDF

Development and Application of Conducting Shape Memory Polyurethane Actuators (전도성 형상 기억 폴리우레탄 작동기의 개발 및 응용)

Paik, Il-Hyun;Jung, Yong-Chae;Cho, Jae-Hwan;Goo, Nam-Seo
- Proceedings of the Korean Society For Composite Materials Conference
- /
- 2005.04a
- /
- pp.226-230
- /
- 2005
This paper presents the actuation performance of a conducting shape memory polyurethane (CSMPU) actuator. We introduced a concept of shape memory polyurethane activated by electric power while conventional shape memory polyurethanes are activated by external heat source. A conducting shape memory polyurethane actuator was manufactured by adding cabon nano-tube to conventional shape memory polyurethane. The main problem of the previous CSMPU was bad dispersion of carbon nano-tubes in polyurethane. In this paper, we have tried to find manufacturing method to solve the dispersion problem. With a lot of elaborative works, we have developed conducting shape memory polyurethane actuator with good electrical performance. The actuation performance of the developed conducting shape memory polyurethane actuator was measured and assessed.
PDF

A Flash Memory Management Method for Enhancing the Recovery Performance (복구 성능 향상을 위한 플래시 메모리 관리 기법)

Park, Song-Hwa;Lee, Jung-Hoon;Cho, Sung-Woo;Kim, Sang-Hyun
- IEMEK Journal of Embedded Systems and Applications
- /
- v.13 no.5
- /
- pp.235-243
- /
- 2018
NAND flash memory has been widely used for embedded systems as storage device and the flash memory file systems such as JFFS2, YAFFS/YAFFS2 have been adopted by these embedded systems. The flash memory file systems provide the high performance and overcome the limitations of flash memory. However, these file systems don't solve the slow mount time problem when a sudden power failure happens. In this paper, we proposed a flash memory management method for enhancing the recovery performance. The proposed method manages the flash memory block type and stores the block type information at recovery image block. When file operations are occurred, our method stores the file information at the metadata block before and after the file operation. When mounting the flash memory, our method only scans the recovery image blocks and metadata blocks. The proposed method reduces the mount time by seeking the metadata block locations fast by using the recovery image blocks. We implemented the proposed method and evaluation results show that our method reduces the mount time 13 ~ 46 % compared with YAFFS2.
https://doi.org/10.14372/IEMEK.2018.13.5.235 인용 PDF KSCI

Analysis of the Influence of the Conflict Management Policy of the Transactional Memory on the System Performance and Bus Traffic (시스템 성능 및 버스 트래픽에 대한 트랜잭셔널 메모리의 충돌 관리 정책 영향 분석)

Kim, Young-Kyu;Moon, Byungin
- The Journal of Korean Institute of Communications and Information Sciences
- /
- v.37B no.11
- /
- pp.1041-1049
- /
- 2012
The transactional memory was proposed to solve the problems of the conventional lock-based synchronization methods in the shared memory multiprocessor system. Various implementation methods for putting the high performance transactional memory to practical use have been continuously studied. However, these studies focus only on the commercialization and performance enhancement of the transactional memory. Besides, there have been few studies to analyze the system overhead of the transactional memory according to the conflict management policy. Thus this paper classifies hardware transactional memory, which is one kind of transactional memories, into four types according to the conflict management policy, and then compares and analyzes their performance and system bus traffic through their modeling and simulation. In addition, the most effective conflict management policy for the hardware transactional memory is presented through these comparison and analysis.
https://doi.org/10.7840/kics.2012.37B.11.1041 인용 PDF KSCI

A Study on Efficient Use of Dual Data Memory Banks in Flight Control Computers

Cho, Doosan
- International Journal of Internet, Broadcasting and Communication
- /
- v.9 no.1
- /
- pp.29-34
- /
- 2017
Over the past several decades, embedded system and flight control computer technologies have been evolved to meet the diverse needs of the mobile device market. Current embedded systems are at the heart of technologies that can take advantage of small-sized specialized hardware while still providing high-efficiency performance at low cost. One of these key technologies is multiple memory banks. For example, a dual memory bank can provide two times more memory bandwidth in the same memory space. This benefit take lower cost to provide the same bandwidth. However, there is still few software technologies to support the efficient use of multiple memory banks. In this study, we present a technique to efficiently exploit multiple memory banks by software support. Specifically, our technique use an interference graph to optimally allocate data to different memory banks by an optimizing compiler. As a result, the execution time can be improved upto 7% with the proposed technique.
https://doi.org/10.7236/IJIBC.2017.9.1.29 인용 PDF

Location-Aware Hybrid SLC/MLC Management for Compressed Phase-Change Memory Systems (압축 기반 상변화 메모리 시스템에서 저장 위치를 고려한 하이브리드 SLC/MLC 관리 기법)

Park, Jaehyun;Lee, Hyung Gyu
- IEMEK Journal of Embedded Systems and Applications
- /
- v.11 no.2
- /
- pp.107-116
- /
- 2016
Density of Phase-Change Memory (PCM) devices has been doubled through the employment of multi-level cell (MLC) technology. However, this doubled-capacity comes in the expense of severe performance degradation, as compared to the conventional single-level cell (SLC) PCM. This negative effect on the performance of the MLC PCM detracts from the potential benefits of the MLC PCM. This paper introduces an efficient way of minimizing the performance degradation while maximizing the capacity benefits of the MLC PCM. To this end, we propose a location-aware hybrid management of SLC and MLC in compressed PCM main memory systems. Our trace-driven simulations using real application workloads demonstrate that the proposed technique enhances the performance and energy consumption by 45.1% and 46.5%, respectively, on the average, over the conventional technique that only uses a MLC PCM.
https://doi.org/10.14372/IEMEK.2016.11.2.107 인용 PDF KSCI

GPU Memory Management Technique to Improve the Performance of GPGPU Task of Virtual Machines in RPC-Based GPU Virtualization Environments (RPC 기반 GPU 가상화 환경에서 가상머신의 GPGPU 작업 성능 향상을 위한 GPU 메모리 관리 기법)

Kang, Jihun
- KIPS Transactions on Computer and Communication Systems
- /
- v.10 no.5
- /
- pp.123-136
- /
- 2021
RPC (Remote Procedure Call)-based Graphics Processing Unit (GPU) virtualization technology is one of the technologies for sharing GPUs with multiple user virtual machines. However, in a cloud environment, unlike CPU or memory, general GPUs do not provide a resource isolation technology that can limit the resource usage of virtual machines. In particular, in an RPC-based virtualization environment, since GPU tasks executed in each virtual machine are performed in the form of multi-process, the lack of resource isolation technology causes performance degradation due to resource competition. In addition, the GPU memory competition accelerates the performance degradation as the resource demand of the virtual machines increases, and the fairness decreases because it cannot guarantee equal performance between virtual machines. This paper, in the RPC-based GPU virtualization environment, analyzes the performance degradation problem caused by resource contention when the GPU memory requirement of virtual machines exceeds the available GPU memory capacity and proposes a GPU memory management technique to solve this problem. Also, experiments show that the GPU memory management technique proposed in this paper can improve the performance of GPGPU tasks.
https://doi.org/10.3745/KTCCS.2021.10.5.123 인용 PDF KSCI

Search Result 3,126, Processing Time 0.028 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)