Score: 0

PanoNav: Mapless Zero-Shot Object Navigation with Panoramic Scene Parsing and Dynamic Memory

Published: November 10, 2025 | arXiv ID: 2511.06840v1

By: Qunchao Jin, Yilin Wu, Changhao Chen

Potential Business Impact:

Robot finds its way in new places without maps.

Business Areas:

Navigation Navigation and Mapping

Zero-shot object navigation (ZSON) in unseen environments remains a challenging problem for household robots, requiring strong perceptual understanding and decision-making capabilities. While recent methods leverage metric maps and Large Language Models (LLMs), they often depend on depth sensors or prebuilt maps, limiting the spatial reasoning ability of Multimodal Large Language Models (MLLMs). Mapless ZSON approaches have emerged to address this, but they typically make short-sighted decisions, leading to local deadlocks due to a lack of historical context. We propose PanoNav, a fully RGB-only, mapless ZSON framework that integrates a Panoramic Scene Parsing module to unlock the spatial parsing potential of MLLMs from panoramic RGB inputs, and a Memory-guided Decision-Making mechanism enhanced by a Dynamic Bounded Memory Queue to incorporate exploration history and avoid local deadlocks. Experiments on the public navigation benchmark show that PanoNav significantly outperforms representative baselines in both SR and SPL metrics.

BeliefMapNav: 3D Voxel-Based Belief Map for Zero-Shot Object Navigation

Robotics

Robots find things in new places using words.

27 May 2025 2

90%

Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM

Robotics

Helps robots find hidden objects in new places.

6 Jun 2025 0

90%

Fast-SmartWay: Panoramic-Free End-to-End Zero-Shot Vision-and-Language Navigation

Robotics

Helps robots navigate using only a few pictures.

2 Nov 2025 1

View PDF Login to Bookmark

Page Count

9 pages

PanoNav: Mapless Zero-Shot Object Navigation with Panoramic Scene Parsing and Dynamic Memory

Robot finds its way in new places without maps.

Technical Abstract

BeliefMapNav: 3D Voxel-Based Belief Map for Zero-Shot Object Navigation

Object Navigation with Structure-Semantic Reasoning-Based Multi-level Map and Multimodal Decision-Making LLM

Fast-SmartWay: Panoramic-Free End-to-End Zero-Shot Vision-and-Language Navigation