Score: 0

A Hybrid Reactive-Proactive Auto-scaling Algorithm for SLA-Constrained Edge Computing

Published: December 16, 2025 | arXiv ID: 2512.14290v1

By: Suhrid Gupta, Muhammed Tawfiqul Islam, Rajkumar Buyya

Potential Business Impact:

Keeps apps running smoothly, even with lots of users.

Business Areas:

Cloud Computing Internet Services, Software

Edge computing decentralizes computing resources, allowing for novel applications in domains such as the Internet of Things (IoT) in healthcare and agriculture by reducing latency and improving performance. This decentralization is achieved through the implementation of microservice architectures, which require low latencies to meet stringent service level agreements (SLA) such as performance, reliability, and availability metrics. While cloud computing offers the large data storage and computation resources necessary to handle peak demands, a hybrid cloud and edge environment is required to ensure SLA compliance. This is achieved by sophisticated orchestration strategies such as Kubernetes, which help facilitate resource management. The orchestration strategies alone do not guarantee SLA adherence due to the inherent delay of scaling resources. Existing auto-scaling algorithms have been proposed to address these challenges, but they suffer from performance issues and configuration complexity. In this paper, a novel auto-scaling algorithm is proposed for SLA-constrained edge computing applications. This approach combines a Machine Learning (ML) based proactive auto-scaling algorithm, capable of predicting incoming resource requests to forecast demand, with a reactive autoscaler which considers current resource utilization and SLA constraints for immediate adjustments. The algorithm is integrated into Kubernetes as an extension, and its performance is evaluated through extensive experiments in an edge environment with real applications. The results demonstrate that existing solutions have an SLA violation rate of up to 23%, whereas the proposed hybrid solution outperforms the baselines with an SLA violation rate of only 6%, ensuring stable SLA compliance across various applications.

Proactive and Reactive Autoscaling Techniques for Edge Computing

Distributed, Parallel, and Cluster Computing

Makes apps work fast everywhere, even without internet.

11 Oct 2025 0

89%

Multi-dimensional Autoscaling of Processing Services: A Comparison of Agent-based Methods

Artificial Intelligence

Makes computers work better with less power.

12 Jun 2025 0

88%

Data Scheduling Algorithm for Scalable and Efficient IoT Sensing in Cloud Computing

Distributed, Parallel, and Cluster Computing

Makes smart devices send data faster, cheaper.

6 Aug 2025 0

View PDF Login to Bookmark

Country of Origin

🇦🇺 Australia

Page Count

10 pages

A Hybrid Reactive-Proactive Auto-scaling Algorithm for SLA-Constrained Edge Computing

Keeps apps running smoothly, even with lots of users.

Technical Abstract

Proactive and Reactive Autoscaling Techniques for Edge Computing

Multi-dimensional Autoscaling of Processing Services: A Comparison of Agent-based Methods

Data Scheduling Algorithm for Scalable and Efficient IoT Sensing in Cloud Computing