FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone Navigation
By: Jiaping Xiao , Cheng Wen Tsao , Yuhang Zhang and more
Potential Business Impact:
Drones fly themselves using smart computer brains.
Path planning is a critical component in autonomous drone operations, enabling safe and efficient navigation through complex environments. Recent advances in foundation models, particularly large language models (LLMs) and vision-language models (VLMs), have opened new opportunities for enhanced perception and intelligent decision-making in robotics. However, their practical applicability and effectiveness in global path planning remain relatively unexplored. This paper proposes foundation model-guided path planners (FM-Planner) and presents a comprehensive benchmarking study and practical validation for drone path planning. Specifically, we first systematically evaluate eight representative LLM and VLM approaches using standardized simulation scenarios. To enable effective real-time navigation, we then design an integrated LLM-Vision planner that combines semantic reasoning with visual perception. Furthermore, we deploy and validate the proposed path planner through real-world experiments under multiple configurations. Our findings provide valuable insights into the strengths, limitations, and feasibility of deploying foundation models in real-world drone applications and providing practical implementations in autonomous flight. Project site: https://github.com/NTU-ICG/FM-Planner.
Similar Papers
Foundation Model Driven Robotics: A Comprehensive Review
Robotics
Robots understand and do tasks better with smart AI.
Deploying Foundation Model-Enabled Air and Ground Robots in the Field: Challenges and Opportunities
Robotics
Robots explore big, messy places using smart language.
Foundation Models for Autonomous Driving System: An Initial Roadmap
Software Engineering
Helps self-driving cars understand the world better.