Score: 0

Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning

Published: September 20, 2025 | arXiv ID: 2509.16606v1

By: Wei Duan, Jie Lu, Junyu Xuan

Potential Business Impact:

Helps many robots learn to work together better.

Business Areas:

Peer to Peer Collaboration

In networked multi-agent reinforcement learning (Networked-MARL), decentralized agents must act under local observability and constrained communication over fixed physical graphs. Existing methods often assume static neighborhoods, limiting adaptability to dynamic or heterogeneous environments. While centralized frameworks can learn dynamic graphs, their reliance on global state access and centralized infrastructure is impractical in real-world decentralized systems. We propose a stochastic graph-based policy for Networked-MARL, where each agent conditions its decision on a sampled subgraph over its local physical neighborhood. Building on this formulation, we introduce BayesG, a decentralized actor-framework that learns sparse, context-aware interaction structures via Bayesian variational inference. Each agent operates over an ego-graph and samples a latent communication mask to guide message passing and policy computation. The variational distribution is trained end-to-end alongside the policy using an evidence lower bound (ELBO) objective, enabling agents to jointly learn both interaction topology and decision-making strategies. BayesG outperforms strong MARL baselines on large-scale traffic control tasks with up to 167 agents, demonstrating superior scalability, efficiency, and performance.

Structured Cooperative Multi-Agent Reinforcement Learning: a Bayesian Network Perspective

Multiagent Systems

Helps many robots learn to work together better.

11 Oct 2025 1

88%

Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning

Machine Learning (CS)

Lets many robots learn to work together.

10 Mar 2025 0

88%

Local-Canonicalization Equivariant Graph Neural Networks for Sample-Efficient and Generalizable Swarm Robot Control

Robotics

Helps robot teams work together better, even when things change.

17 Sep 2025 1

View PDF Login to Bookmark

Country of Origin

🇦🇺 Australia

Page Count

22 pages

Bayesian Ego-graph inference for Networked Multi-Agent Reinforcement Learning

Helps many robots learn to work together better.

Technical Abstract

Structured Cooperative Multi-Agent Reinforcement Learning: a Bayesian Network Perspective

Q-MARL: A quantum-inspired algorithm using neural message passing for large-scale multi-agent reinforcement learning

Local-Canonicalization Equivariant Graph Neural Networks for Sample-Efficient and Generalizable Swarm Robot Control