Graph Attention-based Decentralized Actor-Critic for Dual-Objective Control of Multi-UAV Swarms
By: Haoran Peng, Ying-Jun Angela Zhang
Potential Business Impact:
Drones cover more ground, last longer.
This research focuses on optimizing multi-UAV systems with dual objectives: maximizing service coverage as the primary goal while extending battery lifetime as the secondary objective. We propose a Graph Attention-based Decentralized Actor-Critic (GADC) to optimize the dual objectives. The proposed approach leverages a graph attention network to process UAVs' limited local observation and reduce the dimension of the environment states. Subsequently, an actor-double-critic network is developed to manage dual policies for joint objective optimization. The proposed GADC uses a Kullback-Leibler (KL) divergence factor to balance the tradeoff between coverage performance and battery lifetime in the multi-UAV system. We assess the scalability and efficiency of GADC through comprehensive benchmarking against state-of-the-art methods, considering both theory and experimental aspects. Extensive testing in both ideal settings and NVIDIA Sionna's realistic ray tracing environment demonstrates GADC's superior performance.
Similar Papers
Multitask Reinforcement Learning for Quadcopter Attitude Stabilization and Tracking using Graph Policy
Robotics
Drones fly better and learn faster.
Energy Efficient Task Offloading in UAV-Enabled MEC Using a Fully Decentralized Deep Reinforcement Learning Approach
Multiagent Systems
Drones fly smarter by talking to neighbors.
Optimizing Multi-Lane Intersection Performance in Mixed Autonomy Environments
Multiagent Systems
Makes traffic lights work better for cars and robots.