Transfer Learning Technique for Accelerating Learning of Reinforcement Learning-Based Horizontal Pod Autoscaling Policy

Jang, Yonghyeon;Yu, Heonchang;Kim, SungSuk;

doi:10.3745/KTCCS.2022.11.4.105

KIPS Transactions on Computer and Communication Systems (정보처리학회논문지:컴퓨터 및 통신 시스템)

Volume 11 Issue 4
/
Pages.105-112
/
2022
/
2287-5891(pISSN)
/
2734-049X(eISSN)

Korea Information Processing Society (한국정보처리학회)

DOI QR Code

Transfer Learning Technique for Accelerating Learning of Reinforcement Learning-Based Horizontal Pod Autoscaling Policy

강화학습 기반 수평적 파드 오토스케일링 정책의 학습 가속화를 위한 전이학습 기법

장용현 (고려대학교 컴퓨터학과) ;
유헌창 (고려대학교 컴퓨터학과) ;
김성석 (서경대학교 소프트웨어학과)

Received : 2021.11.05
Accepted : 2021.11.22
Published : 2022.04.30

https://doi.org/10.3745/KTCCS.2022.11.4.105 Citation PDF KSCI

Download PDF

⟨ Previous Next ⟩

Abstract

Recently, many studies using reinforcement learning-based autoscaling have been performed to make autoscaling policies that are adaptive to changes in the environment and meet specific purposes. However, training the reinforcement learning-based Horizontal Pod Autoscaler(HPA) policy in a real environment requires a lot of money and time. And it is not practical to retrain the reinforcement learning-based HPA policy from scratch every time in a real environment. In this paper, we implement a reinforcement learning-based HPA in Kubernetes, and propose a transfer leanring technique using a queuing model-based simulation to accelerate the training of a reinforcement learning-based HPA policy. Pre-training using simulation enabled training the policy through simulation experience without consuming time and resources in the real environment, and by using the transfer learning technique, the cost was reduced by about 42.6% compared to the case without transfer learning technique.

최근 환경의 변화에 적응적이고 특정 목적에 부합하는 오토스케일링 정책을 만들기 위해 강화학습 기반 오토스케일링을 사용하는 연구가 많이 이루어지고 있다. 하지만 실제 환경에서 강화학습 기반 수평적 파드 오토스케일러(HPA, Horizontal Pod Autoscaler)의 정책을 학습하기 위해서는 많은 비용과 시간이 요구되며, 서비스를 배포할 때마다 실제 환경에서 강화학습 기반 HPA 정책을 처음부터 다시 학습하는 것은 실용적이지 않다. 본 논문에서는 쿠버네티스에서 강화학습 기반 HPA를 구현하고, 강화학습 기반 HPA 정책에 대한 학습을 가속화하기 위해 대기행렬 모델 기반 시뮬레이션을 활용한 전이 학습 기법을 제안한다. 시뮬레이션을 활용한 사전 학습을 수행함으로써 실제 환경에서 시간과 자원을 소모하며 학습을 수행하지 않아도 시뮬레이션 경험을 통해 정책 학습이 이루어질 수 있도록 하였고, 전이 학습 기법을 사용함으로써 전이 학습 기법을 사용하지 않았을 때보다 약 42.6%의 비용을 절감할 수 있었다.

Keywords

Acknowledgement

이 논문은 정부(과학기술정보통신부, 교육부)의 재원으로 한국연구재단의 지원을 받아 수행된 연구임(No. 2019M3E7A1113102).

References

Horizontal Pod Autoscaler - Kubernetes [Internet], https://kubernetes.io/ko/docs/tasks/run-application/horizontal-pod-autoscale/
S. Horovitz and Y. Arian, "Efficient cloud auto-scaling with SLA objective using Q-learning," In 2018 IEEE 6th International Conference on Future Internet of Things and Cloud (FiCloud), pp.85-92, 2018.
F. Rossi, M. Nardelli, and V. Cardellini, "Horizontal and vertical scaling of container-based applications using reinforcement learning," In 2019 IEEE 12th International Conference on Cloud Computing (CLOUD), pp.329-338, Jul. 2019.
H. Qiu, S. S. Banerjee, S. Jha, Z. T. Kalbarczyk, and R. K. Iyer, "FIRM: An intelligent fine-grained resource management framework for SLO-Oriented microservices," In 14th USENIX Symposium on Operating Systems Design and Implementation (OSDI 20), pp.805-825, 2020.
Y. Gari, D. A. Monge, E. Pacini, C. Mateos, and C. G. Garino, "Reinforcement learning-based application autoscaling in the cloud: A survey," Engineering Applications of Artificial Intelligence, Vol.102, 2021.
R. S. Sutton and A. G. Barto, "Reinforcement learning: An introduction," 2nd ed., Cambridge: MIT press, 2018.
L. Toka, G. Dobreff, B. Fodor, and B. Sonkoly, "Machine learning-based scaling management for kubernetes edge clusters," IEEE Transactions on Network and Service Management, Vol.18, No.1, pp.958-972, 2021. https://doi.org/10.1109/TNSM.2021.3052837
kOps: Kubernetes Operations [Internet], https://kops.sigs.k8s.io/
Kopf: Kubernetes Operators Framework [Internet], https://kopf.readthedocs.io/en/stable/
NASA-HTTP Logs, [Internet], ftp://ita.ee.lbl.gov/html/contrib/NASA-HTTP.html
Gatling [Internet], https://gatling.io/

KIPS Transactions on Computer and Communication Systems (정보처리학회논문지:컴퓨터 및 통신 시스템)

Transfer Learning Technique for Accelerating Learning of Reinforcement Learning-Based Horizontal Pod Autoscaling Policy

강화학습 기반 수평적 파드 오토스케일링 정책의 학습 가속화를 위한 전이학습 기법

Abstract

Keywords

Acknowledgement

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)