Browse > Article
http://dx.doi.org/10.3745/KTCCS.2020.9.9.189

GPGPU Task Management Technique to Mitigate Performance Degradation of Virtual Machines due to GPU Operation in Cloud Environments  

Kang, Jihun (고려대학교 정보창의교육연구소)
Gil, Joon-Min (대구가톨릭대학교 컴퓨터소프트웨어학부)
Publication Information
KIPS Transactions on Computer and Communication Systems / v.9, no.9, 2020 , pp. 189-196 More about this Journal
Abstract
Recently, GPU cloud computing technology applying GPU(Graphics Processing Unit) devices to virtual machines is widely used in the cloud environment. In a cloud environment, GPU devices assigned to virtual machines can perform operations faster than CPUs through massively parallel processing, which can provide many benefits when operating high-performance computing services in a variety of fields in a cloud environment. In a cloud environment, a GPU device can help improve the performance of a virtual machine, but the virtual machine scheduler, which is based on the CPU usage time of a virtual machine, does not take into account GPU device usage time, affecting the performance of other virtual machines. In this paper, we test and analyze the performance degradation of other virtual machines due to the virtual machine that performs GPGPU(General-Purpose computing on Graphics Processing Units) task in the direct path based GPU virtualization environment, which is often used when assigning GPUs to virtual machines in cloud environments. Then to solve this problem, we propose a GPGPU task management method for a virtual machine.
Keywords
GPU Virtualization; Performance Isolation; Scheduling; Cloud; GPGPU Task Management;
Citations & Related Records
연도 인용수 순위
  • Reference
1 Amazon, Amazon EC2 Instance Types [Internet]. https://aws.amazon.com/ec2/instanc e-types/?nc1=f_ls.
2 Alibaba Cloud, Elastic GPU Service [Internet], https://hpc.aliyun.com/product/gpu_bare_metal.
3 P. Barham, B. Dragovic, K. Fraser, S. Hand, T. Harris, A. Ho, R. Neugebauer, I. Pratt and A. Warfield, "Xen and the art of virtualization," In Proceedings of the Nineteenth ACM Symposium on Operating Systems Principles, SOSP '03. ACM: New York, NY, USA, 2003, pp.164-177.
4 Xen Project, Credit Scheduler [Internet], https://wiki.xen.org/wiki/Credit_Scheduler.
5 AMD, OpenCL: Open Computing Language [Internet], https://www.khron os.org/opencl/.
6 nVidia, CUDA: Compute Unified Device Architecture [Internet], http://www.nvidia.com/object/cuda_home_new. html.
7 Xen Project, VGA Passthrough [Internet], https://wiki.xen.org/wiki/Xen_VGA_ Passthrough.
8 D. Abramson, J. Jackson, S. Muthrasanallur, G. Neiger,G. Regnier, R. Sankaran, I. Schoinas, R. Uhlig, B. Vembu, and J. Wiegert, "Intel virtualization technology for directed I/O," Intel Technology Journal, 2006.
9 W. Hwu and D. Kirk, "Programming Massively Parallel Processors: A Hands-On Approach," Morgan Kaufmann, 2010, pp.20-24.
10 L. Shi, H. Chen, J. Sun, and K. Li, "vCUDA: GPU-accelerated high-performance computing in virtual machines," IEEE Transactions on Computers, Vol.61, No.6, pp.804-816, 2012.   DOI
11 Y. Suzuki, S. Kato, H. Yamada, and K. Kono, "Gpuvm: Gpu virtualization at the hypervisor," IEEE Transactions on Computers, Vol.65, No.9, pp.2752-2766, 2016.   DOI
12 K. Tian, Y. Dong, and D. Cowperthwaite, "A Full GPU Virtualization Solution with Mediated Pass-Through," USENIX Annual Technical Conference, pp.121-132, 2014.
13 A. A. Sani, K. Boos, S. Qin and L. Zhong, "I/o paravirtualization at the device file boundary," In ACM SIGPLAN Notices, Vol.49, No.4, pp.319-332, 2014.   DOI
14 Y. Zhang, P. Qu, J. Cihang, and W. Zheng, "A cloud gaming system based on user-level virtualization and its resource scheduling," IEEE Transactions on Parallel and Distributed Systems, Vol.27, No.5, pp.1239-1252, 2016.   DOI