Empowering Multi-Robot Cooperation via Sequential World Models
By: Zijie Zhao , Honglei Guo , Shengqian Chen and more
Potential Business Impact:
Robots learn to work together by sharing plans.
Model-based reinforcement learning (MBRL) has shown significant potential in robotics due to its high sample efficiency and planning capability. However, extending MBRL to multi-robot cooperation remains challenging due to the complexity of joint dynamics. To address this, we propose the Sequential World Model (SeqWM), a novel framework that integrates the sequential paradigm into model-based multi-agent reinforcement learning. SeqWM employs independent, sequentially structured agent-wise world models to decompose complex joint dynamics. Latent rollouts and decision-making are performed through sequential communication, where each agent generates its future trajectory and plans its actions based on the predictions of its predecessors. This design enables explicit intention sharing, enhancing cooperative performance, and reduces communication overhead to linear complexity. Results in challenging simulated environments (Bi-DexHands and Multi-Quad) show that SeqWM outperforms existing state-of-the-art model-free and model-based baselines in both overall performance and sample efficiency, while exhibiting advanced cooperative behaviors such as predictive adaptation and role division. Furthermore, SeqWM has been success fully deployed on physical quadruped robots, demonstrating its effectiveness in real-world multi-robot systems. Demos and code are available at: https://github.com/zhaozijie2022/seqwm-marl
Similar Papers
Offline Robotic World Model: Learning Robotic Policies without a Physics Simulator
Robotics
Teaches robots to learn safely from old data.
From Pixels to Cooperation Multi Agent Reinforcement Learning based on Multimodal World Models
Multiagent Systems
Teaches robots to work together using sight and sound.
MrCoM: A Meta-Regularized World-Model Generalizing Across Multi-Scenarios
Machine Learning (CS)
Teaches robots to learn many tasks quickly.