Score: 1

Training Feature Attribution for Vision Models

Published: October 10, 2025 | arXiv ID: 2510.09135v1

By: Aziz Bacha, Thomas George

Potential Business Impact:

Shows how bad training pictures trick computers.

Business Areas:

Image Recognition Data and Analytics, Software

Deep neural networks are often considered opaque systems, prompting the need for explainability methods to improve trust and accountability. Existing approaches typically attribute test-time predictions either to input features (e.g., pixels in an image) or to influential training examples. We argue that both perspectives should be studied jointly. This work explores *training feature attribution*, which links test predictions to specific regions of specific training images and thereby provides new insights into the inner workings of deep models. Our experiments on vision datasets show that training feature attribution yields fine-grained, test-specific explanations: it identifies harmful examples that drive misclassifications and reveals spurious correlations, such as patch-based shortcuts, that conventional attribution methods fail to expose.

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective

Machine Learning (CS)

Makes AI decisions easier to understand.

11 Aug 2025 2

87%

Distribution-Based Feature Attribution for Explaining the Predictions of Any Classifier

Machine Learning (CS)

Explains AI decisions using data patterns.

12 Nov 2025 1

87%

Higher-Order Feature Attribution: Bridging Statistics, Explainable AI, and Topological Signal Processing

Machine Learning (CS)

Explains how computer decisions are made.

7 Oct 2025 0

View PDF Login to Bookmark

Page Count

19 pages

Training Feature Attribution for Vision Models

Shows how bad training pictures trick computers.

Technical Abstract

Attribution Explanations for Deep Neural Networks: A Theoretical Perspective

Distribution-Based Feature Attribution for Explaining the Predictions of Any Classifier

Higher-Order Feature Attribution: Bridging Statistics, Explainable AI, and Topological Signal Processing