Search | Korea Science

An Integrated Power Management Framework for WiFi-based Mobile Embedded Systems (WiFi기반 모바일 임베디드 시스템을 위한 통합 전력 제어 기법)

Min Jung-Hi;Cha Ho-Jung
- Journal of KIISE:Computer Systems and Theory
- /
- v.33 no.9
- /
- pp.658-665
- /
- 2006
In these days, the demand of users to extend available period of mobile systems is increased according as the functions of mobile systems have been varied and the use of multimedia application has been increased. This paper proposes an integrated power management framework that considers executed workload types for effective energy management. The conventional methods use DVFS technique for CPU and DPM technique for WNIC separately or simply combine them based on the assumption that they are orthogonal one another. However, the proposed mechanism determines the kind of workload under analysis of the characteristics of workloads incoming through a WNIC. The proposed method can reduce energy consumption of system level effectively by controlling CPU and WNIC to proper power mode based on analyzed characteristics of workload. The experimental result shows the proposed method reduces energy consumption by 9% for BE (Best Effort) workload, CBR (Constant Bit Rate) workload, and Interactive workload on average and by 16% to maximum when compared with the conventional methods which simply combine DVFS technique for CPU and DPM technique for WNIC.
PDF KSCI

An Investigation of the Performance of the Colored Gauss-Seidel Solver on CPU and GPU (Coloring이 적용된 Gauss-Seidel 해법을 통한 CPU와 GPU의 연산 효율에 관한 연구)

Yoon, Jong Seon;Jeon, Byoung Jin;Choi, Hyoung Gwon
- Transactions of the Korean Society of Mechanical Engineers B
- /
- v.41 no.2
- /
- pp.117-124
- /
- 2017
The performance of the colored Gauss-Seidel solver on CPU and GPU was investigated for the two- and three-dimensional heat conduction problems by using different mesh sizes. The heat conduction equation was discretized by the finite difference method and finite element method. The CPU yielded good performance for small problems but deteriorated when the total memory required for computing was larger than the cache memory for large problems. In contrast, the GPU performed better as the mesh size increased because of the latency hiding technique. Further, GPU computation by the colored Gauss-Siedel solver was approximately 7 times that by the single CPU. Furthermore, the colored Gauss-Seidel solver was found to be approximately twice that of the Jacobi solver when parallel computing was conducted on the GPU.
https://doi.org/10.3795/KSME-B.2017.41.2.117 인용 PDF KSCI

Performance Comparison of Join Operations Parallelization by using GPGPU (GPGPU 기반 조인 연산 병렬화 성능 비교)

Lee, Jong-Sub;Lee, Sang-Back;Lee, Kyu-Chul
- Database Research
- /
- v.34 no.3
- /
- pp.28-44
- /
- 2018
In a database system, the most expensive operation among relational operations is a join operation. Generally, CPU-based join operations uses parallel processing with either 1 core or 16 cores at most, which does not significantly improve the function. On the other hand, GPGPU(General-Purpose computing on Graphics Processing Units) allows parallel processing through thousands of processing units, greatly reducing the time required to perform join operations. Parallelization of the operation using GPGPU uses NVIDIA's CUDA SDK. In this paper, we implement parallelization of the join operation using GPGPU and compare the performances. The used join operations are Nested Loop Join (NLJ), Sort Merge Join (SMJ) and Hash Join (HJ), and GPGPU equipment uses TITAN Xp, GTX 1080 Ti and GTX 1080. We measure and compare the performance of join operations based on CPU and GPGPU. We compare this performance with the performance of the previous study on the join operation based on GPGPU. The results of experiment show that the performance based on GPGPU is 6~328 times faster than the one based on CPU.

Grid Computing Application to a three dimensional comprehensive air pollution model (3차원 대기오염 모형에 그리드 컴퓨팅 기술 적응)

조석연;조금원
- Proceedings of the Korea Air Pollution Research Association Conference
- /
- 2002.11a
- /
- pp.101-102
- /
- 2002
과거 50여년간 전산기는 성능과 효율면에서 획기적인 발전을 거듭하여 왔다. 더욱이 최근 20여년간 microcomputer와 PC 분야의 발전으로 저렴한 가격으로 전산자원을 활용할 수 있게 되었다. 그러나, 1개의 CPU로 도달할 수 있는 속도의 한계에 근접함에 따라서, CPU 성능의 향상 속도가 느려지고 있다. 현재 초고성능 전산기로도 많은 시간과 비용이 소요되는 문제가 과학 및 공학 분야에 아직 많이 있는 상황에서 이러한 전산속도 발전의 둔화는 심각한 문제가 되고 있다. (중략)
PDF

PDA-based Text Localization System Using Client/Server Architecture (Client/Server 구조를 이용한 PDA기반의 문자 추출 시스템)

박안진;정기철
- Proceedings of the Korean Information Science Society Conference
- /
- 2004.04b
- /
- pp.751-753
- /
- 2004
PDA에서 사용하는 대부분의 CPU는 실수 연산 구성요소(float computation component)가 없는 정수(integer) CPU를 사용한다. 인공 신경망(neural network)과 같은 실수 연산이 많은 알고리즘은 PDA에서 많은 수행시간을 가진다. 본 논문에서는 이런 단점을 해결하기 위해 무선 랜(LAN)으로 연결된 Client(PDA)/Server(PC) 구조를 이용한 효과적인 문자 추출 시스템을 제안한다. Client(PDA)는 대략적인 문자 추출 결과를 JPEG으로 압축하여 전송속도를 최소화한다. Server(PC)는 Client(PDA)의 결과를 바탕으로 정밀한 문자 영역 추출을 위해, 텍스춰 분류 방법과 연결 성분 분석 방법을 이용한다. 실험에서 제안한 방법은 속도뿐만 아니라 문자 추출에서도 효과적이었다.
PDF

A Study on Time Synchronization Method for Analyzing the Network Performance of Remote Control System (원격운용 시스템의 네트워크 성능분석을 위한 시간동기화 방안에 관한 연구)

Yang, DongWon;Kim, Namgon;Kim, Dojong
- Journal of IKEEE
- /
- v.26 no.2
- /
- pp.141-149
- /
- 2022
With the development of artificial intelligence and unmanned technologies, the remote surveillance/autonomous driving systems have been actively researched. For an effective performance analysis of the developed remote control system, it is important to record the data of it in real time. In addition, in order to analyze the performance between the control system and the remote system, the recorded data from them should be synchronized with time. In this paper we proposed a novel time synchronization method for the remote control system. The proposed remote control system satisfies the time difference of the recorded data within 1 ms, and we can reduce the time difference by using a CPU shielding and affinity setting. The performance of the proposed method was proved through various network data storage experiments. And the experiments confirmed that the proposed method can be applied to recording devices of unmanned ground vehicles and control vehicles. The proposed method will be used as a method for analyzing network data of UGV-R (Unmanned Ground Vehicle - Reconnaissance).
https://doi.org/10.7471/ikeee.2022.26.2.141 인용 PDF KSCI

Improving 3D Measurement Speed using CUDA (CUDA를 이용한 3D 측정 속도 향상)

Kim, Ho-Joong;Cho, Tai-Hoon
- Proceedings of the Korean Institute of Information and Commucation Sciences Conference
- /
- 2017.05a
- /
- pp.331-334
- /
- 2017
Recently, a method using a fringe pattern is widely used for 3D measurements. This is a method of measuring by using a phase value obtained by projecting a pattern to an object to be measured. This method requires many operations such as calculating the phase value and calculating the height. It takes a lot of time depending on the amount of computation. In this paper, we present a method using NVIDIA's CUDA to reduce this time. And we introduce the method of calculating phase value and height. It also shows the exact time difference between the CPU version and the CUDA version. This method is very effective because it can process the same operation in a shorter time.
PDF

Efficient Task Distribution of Pig Monitoring Application using OpenCL (OpenCL을 사용한 돈사 감시 응용의 효율적인 태스크 분배)

Kim, J.;Choi, Y.;Kim, J.;Chung, Y.;Chung, Y.;Park, D.;Kim, H.
- Proceedings of the Korea Information Processing Society Conference
- /
- 2017.04a
- /
- pp.54-57
- /
- 2017
돈사 감시 응용은 내재된 데이터 병렬성을 활용하고 성능가속기를 사용하여 병렬처리가 가능하다. 본 논문에서는 multicore-CPU와 manycore-GPU로 구성된 이기종 컴퓨팅 환경에서 돈사 감시 응용 수행 시 태스크 분배 방법을 제안한다. 즉, 각 태스크별로 OpenCL로 작성된 병렬 프로그램을 deviceCPU와 deviceGPU 각각에서 수행시켜 측정된 수행시간을 기준으로 가장 적합한 처리기를 결정한다. 제안 방법은 간단하지만 매우 효과적이고, CPU와 GPU로 구성된 이기종 컴퓨팅 플랫폼에서 다른 응용을 병렬화하는데에도 적용될 수 있다. 실험 결과, 상이한 이기종 컴퓨팅 플랫폼에서 최적의 태스크 분배로 수행 한 경우가 전체 태스크들을 deviceGPU에서 수행한 방법에 비교하여 각각 2배, 11배 성능 개선이 되었음을 확인하였다.
https://doi.org/10.3745/PKIPS.y2017m04a.54 인용 PDF

Parallel Processing Method on CPU for Image Processing on Mobile Heterogeneous Computing System (모바일 이기종 컴퓨팅 시스템에서 영상처리 고속화를 위한 CPU측 병렬처리 방법)

Beak, Aram;Choi, Haechul
- Proceedings of the Korean Society of Broadcast Engineers Conference
- /
- 2015.07a
- /
- pp.181-182
- /
- 2015
모바일 기기의 보급률과 성능이 급속도로 성장하면서 모바일 기기에서의 비디오 소비 또한 크게 증가하였다. 하지만, 전력과 공간을 줄이기 위해 설계된 모바일 플랫폼은 데스크톱 플랫폼과 비교하여 성능의 한계가 존재한다. 따라서 대용량 비디오 처리를 위해 SIMD 아키텍쳐를 이용하는 임베디드 GPU를 활용하여 이와 같은 한계를 극복하기 위한 고속화 연구가 많이 진행되고 있다. 저장된 데이터를 활용하는 영상처리는 GPU 뿐만 아니라 CPU가 반드시 함께 이용되어야 하며, 모바일 환경에서의 이기종 컴퓨팅 시스템은 프로세서 사이의 낮은 전송속도와 이로 인한 대기시간, 모바일 운영체제가 지원하는 데이터 형태의 필수적인 사용 등의 구조적 단점이 존재한다. 본 논문에서는 임베디드 GPU를 활용한 영상처리 고속화를 위해 임베디드 CPU측에서 병렬처리를 이용하여 앞서 설명한 단점들을 극복하고 실험결과로 모바일 이기종 컴퓨팅 구조에서 임베디드 CPU 활용이 전체적인 연산 효율을 증가시키는 결과를 보였다.
PDF

Exploring GEMM Optimization Techniques for PIM Architecture: A Case Study on UPMEM (PIM 아키텍처를 위한 GEMM 최적화 기법 탐구: UPMEM 사례 연구)

Chan Lee;Heelim Choi;Hanjun Kim
- Proceedings of the Korea Information Processing Society Conference
- /
- 2024.05a
- /
- pp.65-68
- /
- 2024
이 연구는 PIM(Processing-in-Memory) 아키텍처를 활용하여 General Matrix Multiplication(GEMM)의 최적화 기법을 UPMEM PIM 을 통해 탐구한다. 본 연구는 CPU 에서 경험하는 메모리 대역폭의 제한을 극복하고 병렬 처리 구조를 활용함으로써 GEMM 연산에서 PIM 의 잠재적 이점을 확인한다. 또한 연속된 세 개의 행렬 곱셈에 대한 효율성을 평가하고, 데이터 전송 시간이 성능 최적화의 주요병목 현상으로 작용하는 것을 확인한다. CPU 에서 UPMEM 커널로 전송되는 데이터의 양을 한 번에 늘리면서 전송 횟수를 줄이는 방법을 사용하여 CPU 에 비해 성능을 최대 6.57 배 향상시켰다.
https://doi.org/10.3745/PKIPS.y2024m05a.65 인용 PDF

Search Result 518, Processing Time 0.038 seconds

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)