Auto-Scaling Techniques for Container Workloads in Kubernetes Clusters

Megha Aggarwal

Authors

Megha Aggarwal

Keywords:

Kubernetes, autoscaling, containerization, HPA, VPA, Cluster Autoscaler, KEDA, Karpenter, predictive scaling, resource management

Abstract

This paper presents a comparative study of automatic scaling mechanisms in Kubernetes clusters. The objective of this study is to conduct a comparative analysis of various techniques for automating the scaling of containerized applications in Kubernetes clusters. The methodological foundation of the research comprises a systematic review and analytical processing of current scientific publications in the field. The work examines the architectural principles, key configuration parameters, and built-in limitations of traditional tools, including the Horizontal Pod Autoscaler, Vertical Pod Autoscaler, and Cluster Autoscaler. Particular attention is devoted to advanced solutions designed to enhance the adaptability and predictability of scaling. These include event-driven scaling using KEDA, high-efficiency node management with Karpenter, and the implementation of predictive strategies based on machine learning models. The scientific novelty of the study lies in the description of a comparative classification model of autoscaling techniques, which enables the formulation of clear recommendations for selecting the optimal strategy based on the type of workload: microservice web applications, big data processing pipelines, or resource-intensive machine learning tasks. The analysis suggests that to achieve high performance and resilience, it is advisable to combine various approaches — including horizontal, vertical, and cluster scaling — supplemented by heuristic or predictive methods. The findings will be valuable to DevOps engineers, cloud system architects, and researchers focused on optimizing operational performance and resource management in modern distributed environments.

Author Biography

Megha Aggarwal

Software Development Engineer, Amazon AWS,Seattle, WA, USA

References

[1]. Cloud Native Computing Foundation. (2024). CNCF annual survey 2023: The state of cloud native development, from: https://www.cncf.io/reports/cncf-annual-survey-2023/ (date accessed: 17.05.2025).

[2]. Nguyen, T. T., et al. (2020). Horizontal pod autoscaling in Kubernetes for elastic container orchestration. Sensors, 20(16), 4621. https://doi.org/10.3390/s20164621.

[3]. Augustyn, D. R., Wyciślik, Ł., & Sojka, M. (2024). Tuning a Kubernetes horizontal pod autoscaler for meeting performance and load demands in cloud deployments. Applied Sciences, 14(2), Article 646. https://doi.org/10.3390/app14020646.

[4]. Rolík, O., & Volkov, V. (2024). Method of horizontal pod scaling in Kubernetes to omit overregulation. Information, Computing and Intelligent Systems, (5), 55–67.

[5]. Emma, L. (2025). Multi-cloud AI strategies: Deploying portable machine learning solutions across AWS, Azure, and Google Cloud, 1-12.

[6]. Dang-Quang, N. M., & Yoo, M. (2021). Deep learning–based autoscaling using bidirectional long short-term memory for Kubernetes. Applied Sciences, 11(9), Article 3835. https://doi.org/10.3390/app11093835

[7]. Dogani, J., Namvar, R., & Khunjush, F. (2023). Auto-scaling techniques in container-based cloud and edge/fog computing: Taxonomy and survey. Computer Communications, 209, 120–150. https://doi.org/10.1016/j.comcom.2023.06.010

[8]. Mondal, S. K., et al. (2023). Toward optimal load prediction and customizable autoscaling scheme for Kubernetes. Mathematics, 11(12), Article 2675. https://doi.org/10.3390/math11122675

[9]. Senjab, K., et al. (2023). A survey of Kubernetes scheduling algorithms. Journal of Cloud Computing, 12 (1).

[10]. Rabiu, S., Yong, C. H., & Mohamad, S. M. S. (2022). A cloud-based container microservices: A review on load-balancing and auto-scaling issues. International Journal of Data Science, 3(2), 80–92. https://doi.org/10.18517/ijods.3.2.80-92.2022

[11]. Nuthalapati, A. (2025). Scaling AI applications on the cloud toward optimized cloud-native architectures, model efficiency, and workload distribution. International Journal of Latest Technology in Engineering, Management & Applied Science, 14(2), 200–206. https://doi.org/10.51583/ijltemas.2025.14020022

Auto-Scaling Techniques for Container Workloads in Kubernetes Clusters

Authors

Keywords:

Abstract

Author Biography

References

Downloads

Published

Issue

Section

License

How to Cite

Make a Submission

Information

Developed By

Language

Announcements

Latest publications