Score: 0

A Reinforcement Learning-Driven Task Scheduling Algorithm for Multi-Tenant Distributed Systems

Published: August 11, 2025 | arXiv ID: 2508.08525v1

By: Xiaopei Zhang, Xingang Wang, Xin Wang

Potential Business Impact:

Makes computers share resources fairly and fast.

This paper addresses key challenges in task scheduling for multi-tenant distributed systems, including dynamic resource variation, heterogeneous tenant demands, and fairness assurance. An adaptive scheduling method based on reinforcement learning is proposed. By modeling the scheduling process as a Markov decision process, the study defines the state space, action space, and reward function. A scheduling policy learning framework is designed using Proximal Policy Optimization (PPO) as the core algorithm. This enables dynamic perception of complex system states and real-time decision-making. Under a multi-objective reward mechanism, the scheduler jointly optimizes task latency, resource utilization, and tenant fairness. The coordination between the policy network and the value network continuously refines the scheduling strategy. This enhances overall system performance. To validate the effectiveness of the proposed method, a series of experiments were conducted in multi-scenario environments built using a real-world public dataset. The experiments evaluated task latency control, resource efficiency, policy stability, and fairness. The results show that the proposed method outperforms existing scheduling approaches across multiple evaluation metrics. It demonstrates strong stability and generalization ability. The proposed scheduling framework provides practical and engineering value in policy design, dynamic resource modeling, and multi-tenant service assurance. It effectively improves scheduling efficiency and resource management in distributed systems under complex conditions.

Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems

Distributed, Parallel, and Cluster Computing

Makes supercomputers run jobs faster and better.

6 May 2025 0

90%

Multi-Agent Reinforcement Learning for Adaptive Resource Orchestration in Cloud-Native Clusters

Machine Learning (CS)

Makes computer databases run faster and smoother.

14 Aug 2025 0

89%

Multi-Agent Reinforcement Learning Scheduling to Support Low Latency in Teleoperated Driving

Networking and Internet Architecture

Makes self-driving cars react faster to avoid crashes.

6 May 2025 0

View PDF Login to Bookmark

Page Count

5 pages

A Reinforcement Learning-Driven Task Scheduling Algorithm for Multi-Tenant Distributed Systems

Makes computers share resources fairly and fast.

Technical Abstract

Decentralized Distributed Proximal Policy Optimization (DD-PPO) for High Performance Computing Scheduling on Multi-User Systems

Multi-Agent Reinforcement Learning for Adaptive Resource Orchestration in Cloud-Native Clusters

Multi-Agent Reinforcement Learning Scheduling to Support Low Latency in Teleoperated Driving