• Title/Summary/Keyword: 자원 스케일링

Search Result 29, Processing Time 0.044 seconds

SLA-based Auto-Scaling for Cloud Collaboration Platform (클라우드 협업 플랫폼을 위한 SLA기반 오토-스케일링 기법)

  • Kim, Ki-Hyun;Jung, In-Yong;Han, Byung-John;Jeong, Chang-Sung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2013.11a
    • /
    • pp.98-99
    • /
    • 2013
  • 가상화 기술의 등장으로 서버를 가상화하여 컴퓨팅 자원을 동적으로 사용할 수 있게 되면서 탄력적인 프로비져닝이 가능해져 기업은 좀 더 유연하게 대처할 수 있게 되었다. 클라우드 컴퓨팅 환경을 이용하는 데 있어 가장 중요한 이슈 중 하나는 자원을 사용함에 있어 실제 필요한 양만큼 할당하여 사용하는 것이다. 현대 응용의 가변적인 작업 부하에 따라 필요한 만큼만 자원을 제공하기 위해 SLA 기반 오토-스케일링 기법을 이용하여 자원 할당의 자동화를 연구하고 있다. 하지만 대부분의 클라우드 협업 플랫폼은 오토-스케일링 기술을 지원하지 않는다. 또한, 대부분의 SLA 기반 오토-스케일링 기법은 웹 응용에서 중요한 평균 응답 시간은 보장하지 않는다. 본 논문에서는 클라우드 환경에서 가변적인 사용자 요청에 따라 플랫폼은 평균 응답 시간을 보장하고 SLA 위반 여부에 따라 추가 자원을 할당하는 오토-스케일링 기법을 제안한다.

Dynamic SLA based Auto-Scaling Engine (동적 SLA 기반 오토-스케일링 엔진)

  • Kim, Ki-Hyun;Kim, Ho-Seung;Son, Moo-yeol;Jeong, Chang-Sung
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2015.04a
    • /
    • pp.69-72
    • /
    • 2015
  • 계산 과학 분야에서 자원을 필요한 만큼 빌려 쓸 수 있는 클라우드 기술을 적용하여 과학 클라우드(Science Cloud)를 구축하는 연구가 활발해지고 있다. 특히 의료 분야에서는 3D 볼륨 렌더링과 갈은 고성능의 자원을 활용한 대규모 작업 계산 응용이 있다. 이러한 응용의 성공적인 작업 수행과 실시간으로 변화하는 자원 수요에 대처하여 클라우드 자원을 효율적으로 관리하기 위한 오토 스케일링 엔진 개발이 필요하다. 그러나 대부분의 오토 스케일링 엔진은 단순한 하드웨어의 성능을 기반으로 제공되고 있어 클라이언트에 따라 부하를 고려해야한다. 본 논문에는 클라이언트에 따라 가중치를 적용한 동적인 SLA 기반으로 자원 수요를 예측하고 클라우드 자원을 효율적으로 관리하는 오토 스케일링 엔진을 제안한다.

Fine Grained Resource Scaling Approach for Virtualized Environment (가상화 환경에서 세밀한 자원 활용률 적용을 위한 스케일 기법)

  • Lee, Donhyuck;Oh, Sangyoon
    • Journal of the Korea Society of Computer and Information
    • /
    • v.18 no.7
    • /
    • pp.11-21
    • /
    • 2013
  • Recently operating a large scale computing resource like a data center becomes easier because of the virtualization technology that virtualize servers and enable flexible resource provision. The most of public cloud services provides automatic scaling in the form of scale-in or scale-out and these scaling approaches works well to satisfy the service level agreement (SLA) of users. However, a novel scaling approach is required to operate private clouds that has smaller amount of computing resources than vast resources of public clouds. In this paper, we propose a hybrid server scaling architecture and related algorithms using both scale-in and scale-out to achieve higher resource utilization rate for private clouds. We uses dynamic resource allocation and live migration to run our proposed algorithm. Our propose system aims to provide a fine-grain resource scaling by steps. Thus private cloud systems are able to keep stable service and to reduce server management cost by optimizing server utilization. The experiment results show that our proposed approach performs better in resource utilization than the scale-out approach based on the number of users.

A Study on the Change of Dental Scaling Experience in Some Areas after Applying Scaling Insurance (스케일링 보험적용에 따른 일부지역의 스케일링 경험 변화 연구)

  • Park, Il-Soon
    • Journal of Digital Convergence
    • /
    • v.15 no.10
    • /
    • pp.387-397
    • /
    • 2017
  • The purpose of this study is to investigate the regional change of dental scaling experience by scaling insurance coverage in July, 2013. The data were used in the "Community Health Survey" of the 2012 and 2014. The results of the study are as follows; 1) The subjective oral health status and brushing of lunch was highest in Gangnam-gu in both 2012 and 2014(p<0.001). 2) Regular dental check-up was high in Gangnam-gu in both 2012 and 2014(p<0.001). 3) The Scaling experience rate increased in all three regions(p<0.001). 4) The socio-demographic characteristics and scaling experience were higher in 2012 and 2014(p<0.001). The scaling experience was higher when there were office workers and spouses(p<0.001). From the policy perspective, it seems necessary to take measures to reduce the gap in scaling experience rate due to differences in income and unequal medical environment.

An Improvement of Kubernetes Auto-Scaling Based on Multivariate Time Series Analysis (다변량 시계열 분석에 기반한 쿠버네티스 오토-스케일링 개선)

  • Kim, Yong Hae;Kim, Young Han
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.11 no.3
    • /
    • pp.73-82
    • /
    • 2022
  • Auto-scaling is one of the most important functions for cloud computing technology. Even if the number of users or service requests is explosively increased or decreased, system resources and service instances can be appropriately expanded or reduced to provide services suitable for the situation and it can improves stability and cost-effectiveness. However, since the policy is performed based on a single metric data at the time of monitoring a specific system resource, there is a problem that the service is already affected or the service instance that is actually needed cannot be managed in detail. To solve this problem, in this paper, we propose a method to predict system resource and service response time using a multivariate time series analysis model and establish an auto-scaling policy based on this. To verify this, implement it as a custom scheduler in the Kubernetes environment and compare it with the Kubernetes default auto-scaling method through experiments. The proposed method utilizes predictive data based on the impact between system resources and response time to preemptively execute auto-scaling for expected situations, thereby securing system stability and providing as much as necessary within the scope of not degrading service quality. It shows results that allow you to manage instances in detail.

Performance Analysis of Container based Autoscaling System (컨테이너 기반 오토스케일링 환경의 성능 분석)

  • Heo, June;Yu, Heonchang
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2018.10a
    • /
    • pp.63-66
    • /
    • 2018
  • 컨테이너 기술은 운영체제 수준 가상화 기술 중 하나로 하드웨어 레벨 가상화 기술에 비해 인스턴스의 빠른 생성 및 종료시킬 수 있는 특성이 있다. 이러한 특성은 직업 부하에 따라 인스턴스의 빠른 생성 및 종료시킬 수 있는 특성이 있다. 이러한 특성은 작업 부하에 따라 인스턴스의 수량을 동적으로 조정하는 오토스케일링 상황에서 유리하게 작용할 수 있다. 본 논문에서는 다수의 노드를 기반으로 구성된 컨테이너 기반의 오토스케일링 환경과 가상머신 기반의 오토스케일링 환경을 성능 측면에서 비교하고 컨테이너 기반 환경에서 자원 할당의 변화가 성능에 주는 영향을 측정 및 분석한다.

Design of the intranet data center system using auto-scaling method based on virtualization environment (오토스케일링 방법을 적용한 가상화 기반 인트라넷 데이터센터 설계)

  • Keum, Nam-Ho
    • Proceedings of the Korea Information Processing Society Conference
    • /
    • 2016.04a
    • /
    • pp.38-41
    • /
    • 2016
  • 최근 IT기술 및 웹기술의 발달로 웹서비스 소요의 폭발적인 증가는 신규 사업 및 사업 확장에 초기 투자비용 증가를 가지고 왔다. 이에 가상화 기술을 바탕으로 초기 투자비용 없이 사용한 만큼만 인프라를 빌려오는 AWS같은 IaaS가 제안되었지만 이는 인터넷이 연결되지 않은 인트라넷이나 이미 구축되어 있는 기존자원 활용이 불가능하다는 점, 그리고 사용량을 미리, 정확하게 예측해야 예산낭비를 줄일 수 있다는 점에서 제한사항이 있다. 본 논문에서는 인터넷이 연결되어 있지 않은 인트라넷에서 기존의 장비들을 모아 가상화 데이터 센터를 구축을 제안 및 오토 스케일링 기법을 적용한 데이터센터 운용 방안을 제시함으로 써 현재 자원의 효율 극대화 방안에 대해 논의한다.

Autoscaling Mechanism based on Execution-times for VNFM in NFV Platforms (NFV 플랫폼에서 VNFM의 실행 시간에 기반한 자동 자원 조정 메커니즘)

  • Mehmood, Asif;Diaz Rivera, Javier;Khan, Talha Ahmed;Song, Wang-Cheol
    • KNOM Review
    • /
    • v.22 no.1
    • /
    • pp.1-10
    • /
    • 2019
  • The process to determine the required number of resources depends on the factors being considered. Autoscaling is one such mechanism that uses a wide range of factors to decide and is a critical process in NFV. As the networks are being shifted onto the cloud after the invention of SDN, we require better resource managers in the future. To solve this problem, we propose a solution that allows the VNFMs to autoscale the system resources depending on the factors such as overhead of hyperthreading, number of requests, execution-times for the virtual network functions. It is a known fact that the hyperthreaded virtual-cores are not fully capable of performing like the physical cores. Also, as there are different types of core having different frequencies so the process to calculate the number of cores needs to be measured accurately and precisely. The platform independency is achieved by proposing another solution in the form of a monitoring microservice, which communicates through APIs. Hence, by the use of our autoscaling application and a monitoring microservice, we enhance the resource provisioning process to meet the criteria of future networks.

A study on Deep Q-Networks based Auto-scaling in NFV Environment (NFV 환경에서의 Deep Q-Networks 기반 오토 스케일링 기술 연구)

  • Lee, Do-Young;Yoo, Jae-Hyoung;Hong, James Won-Ki
    • KNOM Review
    • /
    • v.23 no.2
    • /
    • pp.1-10
    • /
    • 2020
  • Network Function Virtualization (NFV) is a key technology of 5G networks that has the advantage of enabling building and operating networks flexibly. However, NFV can complicate network management because it creates numerous virtual resources that should be managed. In NFV environments, service function chaining (SFC) composed of virtual network functions (VNFs) is widely used to apply a series of network functions to traffic. Therefore, it is required to dynamically allocate the right amount of computing resources or instances to SFC for meeting service requirements. In this paper, we propose Deep Q-Networks (DQN)-based auto-scaling to operate the appropriate number of VNF instances in SFC. The proposed approach not only resizes the number of VNF instances in SFC composed of multi-tier architecture but also selects a tier to be scaled in response to dynamic traffic forwarding through SFC.

Transfer Learning Technique for Accelerating Learning of Reinforcement Learning-Based Horizontal Pod Autoscaling Policy (강화학습 기반 수평적 파드 오토스케일링 정책의 학습 가속화를 위한 전이학습 기법)

  • Jang, Yonghyeon;Yu, Heonchang;Kim, SungSuk
    • KIPS Transactions on Computer and Communication Systems
    • /
    • v.11 no.4
    • /
    • pp.105-112
    • /
    • 2022
  • Recently, many studies using reinforcement learning-based autoscaling have been performed to make autoscaling policies that are adaptive to changes in the environment and meet specific purposes. However, training the reinforcement learning-based Horizontal Pod Autoscaler(HPA) policy in a real environment requires a lot of money and time. And it is not practical to retrain the reinforcement learning-based HPA policy from scratch every time in a real environment. In this paper, we implement a reinforcement learning-based HPA in Kubernetes, and propose a transfer leanring technique using a queuing model-based simulation to accelerate the training of a reinforcement learning-based HPA policy. Pre-training using simulation enabled training the policy through simulation experience without consuming time and resources in the real environment, and by using the transfer learning technique, the cost was reduced by about 42.6% compared to the case without transfer learning technique.