**Peer Review Journal ** DOI on demand of Author (Charges Apply) ** Fast Review and Publicaton Process ** Free E-Certificate to Each Author

Current Issues
     2026:7/1

Journal of Frontiers in Multidisciplinary Research

ISSN: 3050-9718 (Print) | 3050-9726 (Online) | Impact Factor: 8.10 | Open Access

Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques

Full Text (PDF)

Open Access - Free to Download

Download Full Article (PDF)

Abstract

The rapid growth of cloud computing has made efficient resource management essential to ensure performance, scalability, and cost-effectiveness. Traditional approaches to cloud resource scaling, such as manual provisioning and rule-based or reactive autoscaling, often fail to meet the dynamic requirements of modern applications. These methods either over-provision resources, resulting in unnecessary costs, or under-provision them, leading to performance degradation and service-level agreement (SLA) violations. To overcome these limitations, this study proposes a Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques that leverages workload forecasting and intelligent decision-making to achieve proactive, adaptive resource management. The proposed model employs a multi-layered architecture. A data collection layer gathers system metrics such as CPU utilization, memory consumption, network throughput, and application latency. These inputs are processed through feature engineering and time-series analysis, enabling the identification of workload patterns and trends. At the core, a machine learning prediction engine—utilizing algorithms such as Long Short-Term Memory (LSTM), Random Forests, and Reinforcement Learning—forecasts resource demand over short- and long-term horizons. Predictions are fed into a decision engine, which applies optimization strategies to determine scaling actions that balance performance, cost efficiency, and SLA compliance. Finally, an execution layer integrates with cloud orchestration tools (e.g., Kubernetes Horizontal Pod Autoscaler, AWS Auto Scaling) to enforce scaling decisions in real time. Key benefits of the model include improved SLA compliance, reduced operational costs, enhanced system responsiveness, and support for multi-resource scaling across hybrid and multi-cloud environments. However, challenges such as prediction accuracy, cold-start scenarios, and integration complexity remain. By shifting cloud resource management from reactive to predictive, this model offers a pathway toward autonomous, intelligent, and sustainable cloud ecosystems capable of adapting to evolving application and workload demands.

How to Cite This Article

Kabir Sholagberu Ahmed, Olushola Damilare Odejobi, Theophilus Onyekachukwu Oshoba (2020). Predictive Model for Cloud Resource Scaling Using Machine Learning Techniques . Journal of Frontiers in Multidisciplinary Research (JFMR), 1(1), 173-183. DOI: https://doi.org/10.54660/.IJFMR.2020.1.1.173-183

Share This Article: