Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques

Kabir Sholagberu Ahmed; Olushola Damilare Odejobi; Theophilus Onyekachukwu Oshoba

doi:10.54660/.IJFMR.2020.1.1.173-183

Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques

Author(s): Kabir Sholagberu Ahmed, Olushola Damilare Odejobi, Theophilus Onyekachukwu Oshoba

Published: 2020

Volume: 1 | Issue: 1 | Pages: 173-183

Subject: Artificial Intelligence

Country: Nigeria

DOI: https://doi.org/10.54660/.IJFMR.2020.1.1.173-183

License: CC BY 4.0

Full Text (PDF)

Open Access - Free to Download

Download Full Article (PDF)

Abstract

The rapid growth of cloud computing has made efficient resource management essential to ensure performance, scalability, and cost-effectiveness. Traditional approaches to cloud resource scaling, such as manual provisioning and rule-based or reactive autoscaling, often fail to meet the dynamic requirements of modern applications. These methods either over-provision resources, resulting in unnecessary costs, or under-provision them, leading to performance degradation and service-level agreement (SLA) violations. To overcome these limitations, this study proposes a Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques that leverages workload forecasting and intelligent decision-making to achieve proactive, adaptive resource management. The proposed model employs a multi-layered architecture. A data collection layer gathers system metrics such as CPU utilization, memory consumption, network throughput, and application latency. These inputs are processed through feature engineering and time-series analysis, enabling the identification of workload patterns and trends. At the core, a machine learning prediction engine—utilizing algorithms such as Long Short-Term Memory (LSTM), Random Forests, and Reinforcement Learning—forecasts resource demand over short- and long-term horizons. Predictions are fed into a decision engine, which applies optimization strategies to determine scaling actions that balance performance, cost efficiency, and SLA compliance. Finally, an execution layer integrates with cloud orchestration tools (e.g., Kubernetes Horizontal Pod Autoscaler, AWS Auto Scaling) to enforce scaling decisions in real time. Key benefits of the model include improved SLA compliance, reduced operational costs, enhanced system responsiveness, and support for multi-resource scaling across hybrid and multi-cloud environments. However, challenges such as prediction accuracy, cold-start scenarios, and integration complexity remain. By shifting cloud resource management from reactive to predictive, this model offers a pathway toward autonomous, intelligent, and sustainable cloud ecosystems capable of adapting to evolving application and workload demands.

How to Cite This Article

Kabir Sholagberu Ahmed, Olushola Damilare Odejobi, Theophilus Onyekachukwu Oshoba (2020). Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques . Journal of Frontiers in Multidisciplinary Research (JFMR), 1(1), 173-183. DOI: https://doi.org/10.54660/.IJFMR.2020.1.1.173-183

Publication Information

Journal: Journal of Frontiers in Multidisciplinary Research (JFMR)

Publisher: Anfo Publication House

ISSN: 3050-9718 (Print), 3050-9726 (Online)

Frequency: Half Yearly

Language: English

Open Access: Yes - This article is distributed under the terms of the Creative Commons Attribution 4.0 International License

Journal of Frontiers in Multidisciplinary Research

Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques

Full Text (PDF)

Abstract

How to Cite This Article

Publication Information

Share This Article:

Company

Useful Links

Follow Us