Score: 1

ArtiBench and ArtiBrain: Benchmarking Generalizable Vision-Language Articulated Object Manipulation

Published: November 25, 2025 | arXiv ID: 2511.20330v2

By: Yuhan Wu , Tiantian Wei , Shuo Wang and more

Potential Business Impact:

Robots learn to use new tools and objects.

Business Areas:

Artificial Intelligence Artificial Intelligence, Data and Analytics, Science and Engineering, Software

Interactive articulated manipulation requires long-horizon, multi-step interactions with appliances while maintaining physical consistency. Existing vision-language and diffusion-based policies struggle to generalize across parts, instances, and categories. We first introduce ArtiBench, a five-level benchmark covering kitchen, storage, office, and tool environments. ArtiBench enables structured evaluation from cross-part and cross-instance variation to long-horizon multi-object tasks, revealing the core generalization challenges of articulated object manipulation. Building on this benchmark, we propose ArtiBrain, a modular framework that unifies high-level reasoning with adaptive low-level control. ArtiBrain uses a VLM-based Task Reasoner (GPT-4.1) to decompose and validate subgoals, and employs a Hybrid Controller that combines geometry-aware keyframe execution with affordance-guided diffusion for precise and interpretable manipulation. An Affordance Memory Bank continually accumulates successful execution episodes and propagates part-level actionable affordances to unseen articulated parts and configurations. Extensive experiments on ArtiBench show that our ArtiBrain significantly outperforms state-of-the-art multimodal and diffusion-based methods in robustness and generalization. Code and dataset will be released upon acceptance.

ArtiBench and ArtiBrain: Benchmarking Generalizable Vision-Language Articulated Object Manipulation

Robotics

Robots learn to use new tools and objects.

25 Nov 2025 1

89%

ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation

Robotics

Tests how well robots understand precise movements.

14 May 2025 0

89%

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation

Robotics

Robots learn to plan and think ahead.

7 Jun 2025 0

View PDF Login to Bookmark

Page Count

11 pages

ArtiBench and ArtiBrain: Benchmarking Generalizable Vision-Language Articulated Object Manipulation

Robots learn to use new tools and objects.

Technical Abstract

ArtiBench and ArtiBrain: Benchmarking Generalizable Vision-Language Articulated Object Manipulation

ManipBench: Benchmarking Vision-Language Models for Low-Level Robot Manipulation

RoboCerebra: A Large-scale Benchmark for Long-horizon Robotic Manipulation Evaluation