Score: 2

CausalFSFG: Rethinking Few-Shot Fine-Grained Visual Categorization from Causal Perspective

Published: December 25, 2025 | arXiv ID: 2512.21617v1

By: Zhiwen Yang, Jinglin Xu, Yuxin Pen

Potential Business Impact:

Teaches computers to tell apart similar things with few examples.

Business Areas:

Image Recognition Data and Analytics, Software

Few-shot fine-grained visual categorization (FS-FGVC) focuses on identifying various subcategories within a common superclass given just one or few support examples. Most existing methods aim to boost classification accuracy by enriching the extracted features with discriminative part-level details. However, they often overlook the fact that the set of support samples acts as a confounding variable, which hampers the FS-FGVC performance by introducing biased data distribution and misguiding the extraction of discriminative features. To address this issue, we propose a new causal FS-FGVC (CausalFSFG) approach inspired by causal inference for addressing biased data distributions through causal intervention. Specifically, based on the structural causal model (SCM), we argue that FS-FGVC infers the subcategories (i.e., effect) from the inputs (i.e., cause), whereas both the few-shot condition disturbance and the inherent fine-grained nature (i.e., large intra-class variance and small inter-class variance) lead to unobservable variables that bring spurious correlations, compromising the final classification performance. To further eliminate the spurious correlations, our CausalFSFG approach incorporates two key components: (1) Interventional multi-scale encoder (IMSE) conducts sample-level interventions, (2) Interventional masked feature reconstruction (IMFR) conducts feature-level interventions, which together reveal real causalities from inputs to subcategories. Extensive experiments and thorough analyses on the widely-used public datasets, including CUB-200-2011, Stanford Dogs, and Stanford Cars, demonstrate that our CausalFSFG achieves new state-of-the-art performance. The code is available at https://github.com/PKU-ICST-MIPL/CausalFSFG_TMM.

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval

CV and Pattern Recognition

Teaches computers to tell similar things apart with few examples.

6 Aug 2025 0

89%

Saccadic Vision for Fine-Grained Visual Classification

CV and Pattern Recognition

Helps computers tell apart very similar things.

19 Sep 2025 0

88%

FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification

Artificial Intelligence

Helps computers tell similar things apart better.

23 Dec 2025 2

View PDF Login to Bookmark

Country of Origin

🇨🇳 China

Repos / Data Links

github.com

Page Count

12 pages

CausalFSFG: Rethinking Few-Shot Fine-Grained Visual Categorization from Causal Perspective

Teaches computers to tell apart similar things with few examples.

Technical Abstract

UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval

Saccadic Vision for Fine-Grained Visual Classification

FGDCC: Fine-Grained Deep Cluster Categorization -- A Framework for Intra-Class Variability Problems in Plant Classification