AUTOMATIC SCALATION METHODS IN CLOUD ENVIRONMENTS
DOI:
https://doi.org/10.28925/2663-4023.2024.26.715Keywords:
Cloud environments, аutoscaling, machine learningAbstract
This article examines the main approaches to resource scaling in the cloud, as well as the possibilities for automation and integration with machine learning. It describes three types of scaling: vertical, horizontal, and automatic (which combines the first two). Special attention is paid to automatic scaling, which allows the system to dynamically respond in real time. By using pre-configured triggers, you can add or remove resources “on the fly,” ensuring stable and cost-effective application performance. However, incorrectly set triggers or poor monitoring can lead to either an excess or a shortage of resources. The main section focuses on the use of machine learning for load forecasting, such as LSTM models, which can “learn” from historical data, identify long-term patterns, and recognize them. This approach allows you to respond in advance by increasing or decreasing resources before they become excessive or insufficient. In the practical part of the article, using Azure as an example, it shows how to integrate a machine learning model with cloud autoscaling tools to improve resource management and reduce downtime and costs. The conclusion explains that each type of scaling has its pros and cons. Vertical scaling may be best for stable loads and monolithic applications. Horizontal scaling works better for distributed systems with a large number of users. Implementing automatic scaling with load forecasting based on machine learning opens up the possibility of more accurate load predictions and cost-effective use of cloud resources. It requires in-depth knowledge, careful configuration, and continuous data collection, but it enables companies to build flexible, resilient, and economically viable cloud systems.
Downloads
References
Humeniuk, O. V., & Zakharchenko, S. M. (2021). Algorithm for scaling cloud computing resources using thresholds. Vinnytsia National Technical University.
Savchuk, T. O., & Kozachuk, A. V. (2013). Automated decision-making on cloud application scaling. Vinnytsia National Technical University.
Beshlei, G., Seluchenko, M. O., Bodnar, S., Beshlei, M., & Klimash, M. (2024). Development of a platform for researching automatic container scaling and load balancing in distributed systems. Infocommunication technologies and electronic engineering, 4(2), 38–48. https://doi.org/10.23939/ictee2024.02.038
Chieu, T. C., Mohindra, A., Karve, A. A., & Segal, A. (2009). Dynamic Scaling of Web Applications in a Virtualized Cloud Computing Environment. 2009 IEEE International Conference on e-Business Engineering. Доступно: https://ela.kpi.ua/items/35381e27-a6cf-4b32-9a6f-49ede98532d8
Mao, M., Li, J., & Humphrey, M. (2010). Cloud Auto-scaling with Deadline and Budget Constraints. 11th ACM/IEEE International Conference on Grid Computing, 41–48.
Li, K., & Khan, S. U. (2020). Performance-centric resource management in cloud computing. Future Generation Computer Systems, 82, 80–90.
Chaisiri, S., Lee, B. S., & Niyato, D. (2021). Optimization Techniques for Resource Allocation in Cloud Computing. ACM Computing Surveys, 45(4), 57–72.
Best Practices for Scaling Applications in Google Cloud. (б. д.). Google Cloud Documentation. https://cloud.google.com/solutions/best-practices-for-scaling-applications
Autoscale Overview. (б. д.). Microsoft Azure Documentation. https://learn.microsoft.com/en-us/azure/autoscale/autoscale-overview
Kumar, P., & Singhal, M. (2021). Dynamic Scaling in Cloud Environments Using Predictive Analytics. Journal of Cloud Computing, 9(1), 10–17.
Garg, S. K., Versteeg, S., & Buyya, R. (2013). A framework for ranking of cloud computing services. Future Generation Computer Systems, 29(4), 1012–1023.
Lorido-Botran, T., Miguel-Alonso, J., & Lozano, J. A. (2014). A Review of Auto-scaling Techniques for Elastic Applications in Cloud Environments. Journal of Grid Computing, 12, 559–592.
Karpathy, A., Johnson, J., & Fei-Fei, L. (2015). Visualizing and Understanding Recurrent Networks. arXiv preprint arXiv:1506.02078.
The NIST Definition of Cloud Computing. (б. д.). https://csrc.nist.gov/publications/detail/sp/800-145/final
Published
How to Cite
Issue
Section
License
Copyright (c) 2024 Андрій Пазинін

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.