Score: 0

Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner

Published: October 27, 2025 | arXiv ID: 2510.22969v1

By: Kechen Meng , Sinuo Zhang , Rongpeng Li and more

Potential Business Impact:

Helps phones share internet better and faster.

Business Areas:

Multi-level Marketing Sales and Marketing

In wireless communication systems, efficient and adaptive resource allocation plays a crucial role in enhancing overall Quality of Service (QoS). While centralized Multi-Agent Reinforcement Learning (MARL) frameworks rely on a central coordinator for policy training and resource scheduling, they suffer from scalability issues and privacy risks. In contrast, the Distributed Training with Decentralized Execution (DTDE) paradigm enables distributed learning and decision-making, but it struggles with non-stationarity and limited inter-agent cooperation, which can severely degrade system performance. To overcome these challenges, we propose the Multi-Agent Conditional Diffusion Model Planner (MA-CDMP) for decentralized communication resource management. Built upon the Model-Based Reinforcement Learning (MBRL) paradigm, MA-CDMP employs Diffusion Models (DMs) to capture environment dynamics and plan future trajectories, while an inverse dynamics model guides action generation, thereby alleviating the sample inefficiency and slow convergence of conventional DTDE methods. Moreover, to approximate large-scale agent interactions, a Mean-Field (MF) mechanism is introduced as an assistance to the classifier in DMs. This design mitigates inter-agent non-stationarity and enhances cooperation with minimal communication overhead in distributed settings. We further theoretically establish an upper bound on the distributional approximation error introduced by the MF-based diffusion generation, guaranteeing convergence stability and reliable modeling of multi-agent stochastic dynamics. Extensive experiments demonstrate that MA-CDMP consistently outperforms existing MARL baselines in terms of average reward and QoS metrics, showcasing its scalability and practicality for real-world wireless network optimization.

Conditional Diffusion Model with OOD Mitigation as High-Dimensional Offline Resource Allocation Planner in Clustered Ad Hoc Networks

Networking and Internet Architecture

Helps computers share internet better and faster.

22 Mar 2025 0

89%

Robust Multi-agent Communication Based on Decentralization-Oriented Adversarial Training

Multiagent Systems

Makes AI teams share information better, even if some signals fail.

30 Apr 2025 1

89%

Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks

Machine Learning (CS)

Helps robots share resources without talking much.

1 Sep 2025 0

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Page Count

13 pages

Multi-Agent Conditional Diffusion Model with Mean Field Communication as Wireless Resource Allocation Planner

Helps phones share internet better and faster.

Technical Abstract

Conditional Diffusion Model with OOD Mitigation as High-Dimensional Offline Resource Allocation Planner in Clustered Ad Hoc Networks

Robust Multi-agent Communication Based on Decentralization-Oriented Adversarial Training

Multi-Agent Reinforcement Learning for Task Offloading in Wireless Edge Networks