SIGN: Safety-Aware Image-Goal Navigation for Autonomous Drones via Reinforcement Learning
By: Zichen Yan , Rui Huang , Lei He and more
Potential Business Impact:
Drone finds places using a picture.
Image-goal navigation (ImageNav) tasks a robot with autonomously exploring an unknown environment and reaching a location that visually matches a given target image. While prior works primarily study ImageNav for ground robots, enabling this capability for autonomous drones is substantially more challenging due to their need for high-frequency feedback control and global localization for stable flight. In this paper, we propose a novel sim-to-real framework that leverages visual reinforcement learning (RL) to achieve ImageNav for drones. To enhance visual representation ability, our approach trains the vision backbone with auxiliary tasks, including image perturbations and future transition prediction, which results in more effective policy training. The proposed algorithm enables end-to-end ImageNav with direct velocity control, eliminating the need for external localization. Furthermore, we integrate a depth-based safety module for real-time obstacle avoidance, allowing the drone to safely navigate in cluttered environments. Unlike most existing drone navigation methods that focus solely on reference tracking or obstacle avoidance, our framework supports comprehensive navigation behaviors--autonomous exploration, obstacle avoidance, and image-goal seeking--without requiring explicit global mapping. Code and model checkpoints will be released upon acceptance.
Similar Papers
What Matters in RL-Based Methods for Object-Goal Navigation? An Empirical Study and A Unified Framework
Robotics
Robots find objects in new places better.
IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
CV and Pattern Recognition
Helps robots find things using just a picture.
LEARN: Learning End-to-End Aerial Resource-Constrained Multi-Robot Navigation
Robotics
Tiny drones fly safely through tight spaces.