Score: 0

VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs

Published: December 26, 2025 | arXiv ID: 2512.22342v1

By: Wensi Huang , Shaohao Zhu , Meng Wei and more

In most existing embodied navigation tasks, instructions are well-defined and unambiguous, such as instruction following and object searching. Under this idealized setting, agents are required solely to produce effective navigation outputs conditioned on vision and language inputs. However, real-world navigation instructions are often vague and ambiguous, requiring the agent to resolve uncertainty and infer user intent through active dialog. To address this gap, we propose Interactive Instance Object Navigation (IION), a task that requires agents not only to generate navigation actions but also to produce language outputs via active dialog, thereby aligning more closely with practical settings. IION extends Instance Object Navigation (ION) by allowing agents to freely consult an oracle in natural language while navigating. Building on this task, we present the Vision Language-Language Navigation (VL-LN) benchmark, which provides a large-scale, automatically generated dataset and a comprehensive evaluation protocol for training and assessing dialog-enabled navigation models. VL-LN comprises over 41k long-horizon dialog-augmented trajectories for training and an automatic evaluation protocol with an oracle capable of responding to agent queries. Using this benchmark, we train a navigation model equipped with dialog capabilities and show that it achieves significant improvements over the baselines. Extensive experiments and analyses further demonstrate the effectiveness and reliability of VL-LN for advancing research on dialog-enabled embodied navigation. Code and dataset: https://0309hws.github.io/VL-LN.github.io/

A Navigation Framework Utilizing Vision-Language Models

Robotics

Helps robots follow spoken directions in new places.

11 Jun 2025 0

91%

ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation

Robotics

Teaches robots to explore new places by themselves.

16 Sep 2025 1

91%

Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents

Artificial Intelligence

Helps robots follow directions in new places.

11 Aug 2025 2

View PDF Login to Bookmark

VL-LN Bench: Towards Long-horizon Goal-oriented Navigation with Active Dialogs

Technical Abstract

A Navigation Framework Utilizing Vision-Language Models

ActiveVLN: Towards Active Exploration via Multi-Turn RL in Vision-and-Language Navigation

Breaking Down and Building Up: Mixture of Skill-Based Vision-and-Language Navigation Agents