Curriculum Imitation Learning of Distributed Multi-Robot Policies
By: Jesús Roche, Eduardo Sebastián, Eduardo Montijano
Potential Business Impact:
Teaches robots to work together better.
Learning control policies for multi-robot systems (MRS) remains a major challenge due to long-term coordination and the difficulty of obtaining realistic training data. In this work, we address both limitations within an imitation learning framework. First, we shift the typical role of Curriculum Learning in MRS, from scalability with the number of robots, to focus on improving long-term coordination. We propose a curriculum strategy that gradually increases the length of expert trajectories during training, stabilizing learning and enhancing the accuracy of long-term behaviors. Second, we introduce a method to approximate the egocentric perception of each robot using only third-person global state demonstrations. Our approach transforms idealized trajectories into locally available observations by filtering neighbors, converting reference frames, and simulating onboard sensor variability. Both contributions are integrated into a physics-informed technique to produce scalable, distributed policies from observations. We conduct experiments across two tasks with varying team sizes and noise levels. Results show that our curriculum improves long-term accuracy, while our perceptual estimation method yields policies that are robust to realistic uncertainty. Together, these strategies enable the learning of robust, distributed controllers from global demonstrations, even in the absence of expert actions or onboard measurements.
Similar Papers
Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning
Robotics
Drones carry heavy things without talking to each other.
Learning to Ball: Composing Policies for Long-Horizon Basketball Moves
Graphics
Teaches robots to do many complex actions.
Simultaneous learning of state-to-state minimum-time planning and control
Robotics
Drones fly themselves to any spot fast.