Score: 0

Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty

Published: December 4, 2025 | arXiv ID: 2512.04918v1

By: Kailiang Liu , Ying Chen , Ralf Borndörfer and more

Potential Business Impact:

Makes hospital surgeries run faster and smoother.

Business Areas:

Scheduling Information Technology, Software

Intraday surgical scheduling is a multi-objective decision problem under uncertainty-balancing elective throughput, urgent and emergency demand, delays, sequence-dependent setups, and overtime. We formulate the problem as a cooperative Markov game and propose a multi-agent reinforcement learning (MARL) framework in which each operating room (OR) is an agent trained with centralized training and decentralized execution. All agents share a policy trained via Proximal Policy Optimization (PPO), which maps rich system states to actions, while a within-epoch sequential assignment protocol constructs conflict-free joint schedules across ORs. A mixed-integer pre-schedule provides reference starting times for electives; we impose type-specific quadratic delay penalties relative to these references and a terminal overtime penalty, yielding a single reward that captures throughput, timeliness, and staff workload. In simulations reflecting a realistic hospital mix (six ORs, eight surgery types, random urgent and emergency arrivals), the learned policy outperforms six rule-based heuristics across seven metrics and three evaluation subsets, and, relative to an ex post MIP oracle, quantifies optimality gaps. Policy analytics reveal interpretable behavior-prioritizing emergencies, batching similar cases to reduce setups, and deferring lower-value electives. We also derive a suboptimality bound for the sequential decomposition under simplifying assumptions. We discuss limitations-including OR homogeneity and the omission of explicit staffing constraints-and outline extensions. Overall, the approach offers a practical, interpretable, and tunable data-driven complement to optimization for real-time OR scheduling.

A Bilevel Approach to Integrated Surgeon Scheduling and Surgery Planning solved via Branch-and-Price

CS and Game Theory

Smarter surgery scheduling saves time and resources.

29 Sep 2025 1

88%

NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment

Machine Learning (CS)

Helps hospitals assign nurses to patients better.

10 Sep 2025 1

88%

A Negotiation-Based Multi-Agent Reinforcement Learning Approach for Dynamic Scheduling of Reconfigurable Manufacturing Systems

Multiagent Systems

Helps factories change what they make super fast.

11 Nov 2025 0

View PDF Login to Bookmark

Country of Origin

🇸🇬 Singapore

Page Count

33 pages

Multi-Agent Reinforcement Learning for Intraday Operating Rooms Scheduling under Uncertainty

Makes hospital surgeries run faster and smoother.

Technical Abstract

A Bilevel Approach to Integrated Surgeon Scheduling and Surgery Planning solved via Branch-and-Price

NurseSchedRL: Attention-Guided Reinforcement Learning for Nurse-Patient Assignment

A Negotiation-Based Multi-Agent Reinforcement Learning Approach for Dynamic Scheduling of Reconfigurable Manufacturing Systems