Proactive Resource Autoscaling Scheme Based on SCINet for High-Performance Cloud Computing
Citations

WEB OF SCIENCE

24
Citations

SCOPUS

26

초록

The container resource autoscaling technique provides scalability to cloud services composed of microservice architecture in a cloud-native computing environment. However, the service efficiency is reduced as the scaling is delayed because dynamic loads occur with various workload patterns. Furthermore, estimating the efficient resource size for the workload is difficult, resulting in resource waste and overload. Therefore, this study proposes high-performance resource management (HiPerRM), which stably and elastically manages container resources to ensure service scalability and efficiency even under rapidly changing dynamic loads. HiPerRM forecasts future workloads using a sample convolutional and interaction network (SCINet) model applied with the reversible instance normalization (RevIN) method. HiPerRM generates a resource request with an elastic size based on the forecasted CPU and memory usage, and then efficiently adjusts the pod's resource request and the number of replicas via HiPerRM's VPA (Hi-VPA) and HiPerRM's HPA (Hi-HPA). As a result of evaluating the performance of HiPerRM, the average resource utilization was improved by approximately 3.96–34.06% compared to conventional autoscaling techniques, even when the resource size was incorrectly estimated for various workloads, and there were relatively fewer overloads. IEEE

키워드

Cloud computingCloud computingContainer resource autoscalingContainersMeasurementMicroservice architecturesPredictive modelsResource managementResource managementScalabilityTime-series forecasting
제목
Proactive Resource Autoscaling Scheme Based on SCINet for High-Performance Cloud Computing
저자
Jeong, ByeonghuiJeon, JueunJeong, Young-Sik
DOI
10.1109/TCC.2023.3292378
발행일
2023-10
유형
Article
저널명
IEEE Transactions on Cloud Computing
11
4
페이지
3497 ~ 3509