상세 보기
- Jeong, Byeonghui;
- Baek, Seungyeon;
- Park, Sihyun;
- Jeon, Jueun;
- Jeong, Young-Sik
WEB OF SCIENCE
32SCOPUS
46초록
Resource management autoscaling in a cloud computing service guarantees the high availability and extensibility of applications and services. Horizontal pod autoscaling (HPA) does not affect the executed tasks but also has the disadvantage that it cannot provide immediate scaling. Furthermore, scale down is not possible if excess resources are allocated, because it is difficult to identify the amount of resources required for applications and services; thus resources are wasted. Therefore, this study proposes Proactive Hybrid Pod Autoscaling (ProHPA), which immediately responds to irregular workloads and reduces resource overallocation. ProHPA uses a bidirectional long short-term memory (Bi-LSTM) model applied with an attention mechanism for forecasting future CPU and memory usage that has similar or different patterns. Reducing excessive resource usage with vertical pod autoscaling (ReVPA) adjusts the overallocation of resources within a pod by forecasted resource usage. Lastly, prevention overload with HPA (PoHPA) immediately performs resource scaling by using forecasted resource usage and pod information. When the performance of ProHPA was evaluated, CPU and memory average utilization were improved by 23.39% and 42.52%, respectively, compared with conventional HPA when initial resources were overallocated. In addition, ProHPA did not exhibit overload compared to conventional HPA when resources are insufficiently allocated.(c) 2022 Elsevier B.V. All rights reserved.
키워드
- 제목
- Stable and efficient resource management using deep neural network on cloud computing
- 저자
- Jeong, Byeonghui; Baek, Seungyeon; Park, Sihyun; Jeon, Jueun; Jeong, Young-Sik
- 발행일
- 2023-02
- 유형
- Article
- 저널명
- Neurocomputing
- 권
- 521
- 페이지
- 99 ~ 112