NavSpace: How Navigation Agents Follow Spatial Intelligence Instructions
By: Haolin Yang , Yuxing Long , Zhuoyuan Yu and more
Potential Business Impact:
Helps robots learn to walk and find places.
Instruction-following navigation is a key step toward embodied intelligence. Prior benchmarks mainly focus on semantic understanding but overlook systematically evaluating navigation agents' spatial perception and reasoning capabilities. In this work, we introduce the NavSpace benchmark, which contains six task categories and 1,228 trajectory-instruction pairs designed to probe the spatial intelligence of navigation agents. On this benchmark, we comprehensively evaluate 22 navigation agents, including state-of-the-art navigation models and multimodal large language models. The evaluation results lift the veil on spatial intelligence in embodied navigation. Furthermore, we propose SNav, a new spatially intelligent navigation model. SNav outperforms existing navigation agents on NavSpace and real robot tests, establishing a strong baseline for future work.
Similar Papers
From reactive to cognitive: brain-inspired spatial intelligence for embodied agents
Artificial Intelligence
Helps robots learn and remember places like humans.
IndustryNav: Exploring Spatial Reasoning of Embodied Agents in Dynamic Industrial Navigation
Robotics
Helps robots safely navigate busy factories.
Endowing Embodied Agents with Spatial Reasoning Capabilities for Vision-and-Language Navigation
Artificial Intelligence
Helps robots see and move without getting lost.