Decentralized Real-Time Planning for Multi-UAV Cooperative Manipulation via Imitation Learning
By: Shantnav Agarwal, Javier Alonso-Mora, Sihao Sun
Potential Business Impact:
Drones carry heavy things without talking to each other.
Existing approaches for transporting and manipulating cable-suspended loads using multiple UAVs along reference trajectories typically rely on either centralized control architectures or reliable inter-agent communication. In this work, we propose a novel machine learning based method for decentralized kinodynamic planning that operates effectively under partial observability and without inter-agent communication. Our method leverages imitation learning to train a decentralized student policy for each UAV by imitating a centralized kinodynamic motion planner with access to privileged global observations. The student policy generates smooth trajectories using physics-informed neural networks that respect the derivative relationships in motion. During training, the student policies utilize the full trajectory generated by the teacher policy, leading to improved sample efficiency. Moreover, each student policy can be trained in under two hours on a standard laptop. We validate our method in both simulation and real-world environments to follow an agile reference trajectory, demonstrating performance comparable to that of centralized approaches.
Similar Papers
Decentralized Aerial Manipulation of a Cable-Suspended Load using Multi-Agent Reinforcement Learning
Robotics
Drones work together to move heavy things.
Simultaneous learning of state-to-state minimum-time planning and control
Robotics
Drones fly themselves to any spot fast.
Energy Efficient Task Offloading in UAV-Enabled MEC Using a Fully Decentralized Deep Reinforcement Learning Approach
Multiagent Systems
Drones fly smarter by talking to neighbors.