Score: 0

SLAM-Free Visual Navigation with Hierarchical Vision-Language Perception and Coarse-to-Fine Semantic Topological Planning

Published: September 25, 2025 | arXiv ID: 2509.20739v1

By: Guoyang Zhao , Yudong Li , Weiqing Qi and more

Potential Business Impact:

Robots learn to explore using words and pictures.

Business Areas:

Navigation Navigation and Mapping

Conventional SLAM pipelines for legged robot navigation are fragile under rapid motion, calibration demands, and sensor drift, while offering limited semantic reasoning for task-driven exploration. To deal with these issues, we propose a vision-only, SLAM-free navigation framework that replaces dense geometry with semantic reasoning and lightweight topological representations. A hierarchical vision-language perception module fuses scene-level context with object-level cues for robust semantic inference. And a semantic-probabilistic topological map supports coarse-to-fine planning: LLM-based global reasoning for subgoal selection and vision-based local planning for obstacle avoidance. Integrated with reinforcement-learning locomotion controllers, the framework is deployable across diverse legged robot platforms. Experiments in simulation and real-world settings demonstrate consistent improvements in semantic accuracy, planning quality, and navigation success, while ablation studies further showcase the necessity of both hierarchical perception and fine local planning. This work introduces a new paradigm for SLAM-free, vision-language-driven navigation, shifting robotic exploration from geometry-centric mapping to semantics-driven decision making.

Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System

Robotics

Robots work together better using AI to move things.

5 Jun 2025 1

90%

Vision-Aided Online A* Path Planning for Efficient and Safe Navigation of Service Robots

Robotics

Robot sees important things, not just obstacles.

10 Nov 2025 0

90%

3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning

Robotics

Helps robots understand 3D space for better tasks.

13 Feb 2025 0

View PDF Login to Bookmark

Country of Origin

🇭🇰 Hong Kong

Page Count

9 pages

SLAM-Free Visual Navigation with Hierarchical Vision-Language Perception and Coarse-to-Fine Semantic Topological Planning

Robots learn to explore using words and pictures.

Technical Abstract

Hierarchical Language Models for Semantic Navigation and Manipulation in an Aerial-Ground Robotic System

Vision-Aided Online A* Path Planning for Efficient and Safe Navigation of Service Robots

3D-Grounded Vision-Language Framework for Robotic Task Planning: Automated Prompt Synthesis and Supervised Reasoning