Cross-Hierarchical Bidirectional Consistency Learning for Fine-Grained Visual Classification
By: Pengxiang Gao , Yihao Liang , Yanzhi Song and more
Potential Business Impact:
Teaches computers to tell very similar things apart.
Fine-Grained Visual Classification (FGVC) aims to categorize closely related subclasses, a task complicated by minimal inter-class differences and significant intra-class variance. Existing methods often rely on additional annotations for image classification, overlooking the valuable information embedded in Tree Hierarchies that depict hierarchical label relationships. To leverage this knowledge to improve classification accuracy and consistency, we propose a novel Cross-Hierarchical Bidirectional Consistency Learning (CHBC) framework. The CHBC framework extracts discriminative features across various hierarchies using a specially designed module to decompose and enhance attention masks and features. We employ bidirectional consistency loss to regulate the classification outcomes across different hierarchies, ensuring label prediction consistency and reducing misclassification. Experiments on three widely used FGVC datasets validate the effectiveness of the CHBC framework. Ablation studies further investigate the application strategies of feature enhancement and consistency constraints, underscoring the significant contributions of the proposed modules.
Similar Papers
Saccadic Vision for Fine-Grained Visual Classification
CV and Pattern Recognition
Helps computers tell apart very similar things.
H3Former: Hypergraph-based Semantic-Aware Aggregation via Hyperbolic Hierarchical Contrastive Loss for Fine-Grained Visual Classification
CV and Pattern Recognition
Helps computers tell apart very similar things.
UniFGVC: Universal Training-Free Few-Shot Fine-Grained Vision Classification via Attribute-Aware Multimodal Retrieval
CV and Pattern Recognition
Teaches computers to tell similar things apart with few examples.