Score: 1

AAPA: An Archetype-Aware Predictive Autoscaler with Uncertainty Quantification for Serverless Workloads on Kubernetes

Published: July 8, 2025 | arXiv ID: 2507.05653v3

By: Guilin Zhang , Srinivas Vippagunta , Raghavendra Nandagopal and more

Potential Business Impact:

Helps computers automatically adjust for changing tasks.

Business Areas:

PaaS Software

Serverless platforms such as Kubernetes are increasingly adopted in high-performance computing, yet autoscaling remains challenging under highly dynamic and heterogeneous workloads. Existing approaches often rely on uniform reactive policies or unconditioned predictive models, ignoring both workload semantics and prediction uncertainty. We present AAPA, an archetype-aware predictive autoscaler that classifies workloads into four behavioral patterns -- SPIKE, PERIODIC, RAMP, and STATIONARY -- and applies tailored scaling strategies with confidence-based adjustments. To support reproducible evaluation, we release AAPAset, a weakly labeled dataset of 300,000 Azure Functions workload windows spanning diverse patterns. AAPA reduces SLO violations by up to 50% and lowers latency by 40% compared to Kubernetes HPA, albeit at 2-8x higher resource usage under spike-dominated conditions. To assess trade-offs, we propose the Resource Efficiency Index (REI), a unified metric balancing performance, cost, and scaling smoothness. Our results demonstrate the importance of modeling workload heterogeneity and uncertainty in autoscaling design.

Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework

Multiagent Systems

Keeps computer systems running even when attacked.

26 May 2025 2

87%

Resilient Auto-Scaling of Microservice Architectures with Efficient Resource Management

Distributed, Parallel, and Cluster Computing

Keeps apps running smoothly during computer problems.

6 Jun 2025 1

86%

An SLO Driven and Cost-Aware Autoscaling Framework for Kubernetes

Software Engineering

Makes computer programs run better and cheaper.

29 Dec 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

7 pages

AAPA: An Archetype-Aware Predictive Autoscaler with Uncertainty Quantification for Serverless Workloads on Kubernetes

Helps computers automatically adjust for changing tasks.

Technical Abstract

Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework

Resilient Auto-Scaling of Microservice Architectures with Efficient Resource Management

An SLO Driven and Cost-Aware Autoscaling Framework for Kubernetes