Score: 1

CoRL-MPPI: Enhancing MPPI With Learnable Behaviours For Efficient And Provably-Safe Multi-Robot Collision Avoidance

Published: November 12, 2025 | arXiv ID: 2511.09331v1

By: Stepan Dergachev , Artem Pshenitsyn , Aleksandr Panov and more

Potential Business Impact:

Robots learn to avoid each other safely and quickly.

Business Areas:

Robotics Hardware, Science and Engineering, Software

Decentralized collision avoidance remains a core challenge for scalable multi-robot systems. One of the promising approaches to tackle this problem is Model Predictive Path Integral (MPPI) -- a framework that is naturally suited to handle any robot motion model and provides strong theoretical guarantees. Still, in practice MPPI-based controller may provide suboptimal trajectories as its performance relies heavily on uninformed random sampling. In this work, we introduce CoRL-MPPI, a novel fusion of Cooperative Reinforcement Learning and MPPI to address this limitation. We train an action policy (approximated as deep neural network) in simulation that learns local cooperative collision avoidance behaviors. This learned policy is then embedded into the MPPI framework to guide its sampling distribution, biasing it towards more intelligent and cooperative actions. Notably, CoRL-MPPI preserves all the theoretical guarantees of regular MPPI. We evaluate our approach in dense, dynamic simulation environments against state-of-the-art baselines, including ORCA, BVC, and a multi-agent MPPI implementation. Our results demonstrate that CoRL-MPPI significantly improves navigation efficiency (measured by success rate and makespan) and safety, enabling agile and robust multi-robot navigation.

Control of Legged Robots using Model Predictive Optimized Path Integral

Robotics

Robots walk better and faster over rough ground.

16 Aug 2025 1

89%

PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments

Robotics

Helps drones find paths in new places.

18 Sep 2025 1

89%

Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning

Robotics

Robots walk better and use less energy.

23 Oct 2025 0

View PDF Login to Bookmark

Page Count

9 pages

CoRL-MPPI: Enhancing MPPI With Learnable Behaviours For Efficient And Provably-Safe Multi-Robot Collision Avoidance

Robots learn to avoid each other safely and quickly.

Technical Abstract

Control of Legged Robots using Model Predictive Optimized Path Integral

PA-MPPI: Perception-Aware Model Predictive Path Integral Control for Quadrotor Navigation in Unknown Environments

Real-Time Gait Adaptation for Quadrupeds using Model Predictive Control and Reinforcement Learning