Score: 2

SliceLens: Fine-Grained and Grounded Error Slice Discovery for Multi-Instance Vision Tasks

Published: December 31, 2025 | arXiv ID: 2512.24592v1

By: Wei Zhang , Chaoqun Wang , Zixuan Guan and more

BigTech Affiliations: Apple

Potential Business Impact:

Finds computer vision mistakes in tricky pictures.

Business Areas:

Image Recognition Data and Analytics, Software

Systematic failures of computer vision models on subsets with coherent visual patterns, known as error slices, pose a critical challenge for robust model evaluation. Existing slice discovery methods are primarily developed for image classification, limiting their applicability to multi-instance tasks such as detection, segmentation, and pose estimation. In real-world scenarios, error slices often arise from corner cases involving complex visual relationships, where existing instance-level approaches lacking fine-grained reasoning struggle to yield meaningful insights. Moreover, current benchmarks are typically tailored to specific algorithms or biased toward image classification, with artificial ground truth that fails to reflect real model failures. To address these limitations, we propose SliceLens, a hypothesis-driven framework that leverages LLMs and VLMs to generate and verify diverse failure hypotheses through grounded visual reasoning, enabling reliable identification of fine-grained and interpretable error slices. We further introduce FeSD (Fine-grained Slice Discovery), the first benchmark specifically designed for evaluating fine-grained error slice discovery across instance-level vision tasks, featuring expert-annotated and carefully refined ground-truth slices with precise grounding to local error regions. Extensive experiments on both existing benchmarks and FeSD demonstrate that SliceLens achieves state-of-the-art performance, improving Precision@10 by 0.42 (0.73 vs. 0.31) on FeSD, and identifies interpretable slices that facilitate actionable model improvements, as validated through model repair experiments.

HiBug2: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging

CV and Pattern Recognition

Finds and fixes computer vision mistakes.

28 Jan 2025 1

88%

Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions

CV and Pattern Recognition

Finds image problems computers miss.

17 Feb 2025 2

87%

FeatureLens: A Highly Generalizable and Interpretable Framework for Detecting Adversarial Examples Based on Image Features

CV and Pattern Recognition

Finds fake images that trick computers.

3 Dec 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

22 pages

SliceLens: Fine-Grained and Grounded Error Slice Discovery for Multi-Instance Vision Tasks

Finds computer vision mistakes in tricky pictures.

Technical Abstract

HiBug2: Efficient and Interpretable Error Slice Discovery for Comprehensive Model Debugging

Detecting Systematic Weaknesses in Vision Models along Predefined Human-Understandable Dimensions

FeatureLens: A Highly Generalizable and Interpretable Framework for Detecting Adversarial Examples Based on Image Features