Improving Group Robustness on Spurious Correlation via Evidential Alignment
By: Wenqian Ye, Guangtao Zheng, Aidong Zhang
Potential Business Impact:
Teaches computers to see things correctly, not by guessing.
Deep neural networks often learn and rely on spurious correlations, i.e., superficial associations between non-causal features and the targets. For instance, an image classifier may identify camels based on the desert backgrounds. While it can yield high overall accuracy during training, it degrades generalization on more diverse scenarios where such correlations do not hold. This problem poses significant challenges for out-of-distribution robustness and trustworthiness. Existing methods typically mitigate this issue by using external group annotations or auxiliary deterministic models to learn unbiased representations. However, such information is costly to obtain, and deterministic models may fail to capture the full spectrum of biases learned by the models. To address these limitations, we propose Evidential Alignment, a novel framework that leverages uncertainty quantification to understand the behavior of the biased models without requiring group annotations. By quantifying the evidence of model prediction with second-order risk minimization and calibrating the biased models with the proposed evidential calibration technique, Evidential Alignment identifies and suppresses spurious correlations while preserving core features. We theoretically justify the effectiveness of our method as capable of learning the patterns of biased models and debiasing the model without requiring any spurious correlation annotations. Empirical results demonstrate that our method significantly improves group robustness across diverse architectures and data modalities, providing a scalable and principled solution to spurious correlations.
Similar Papers
Class-Conditional Distribution Balancing for Group Robust Classification
Machine Learning (CS)
Fixes computer guesses that are wrong for bad reasons.
Superclass-Guided Representation Disentanglement for Spurious Correlation Mitigation
CV and Pattern Recognition
Helps computers learn without needing extra labels.
Mutual Evidential Deep Learning for Medical Image Segmentation
Image and Video Processing
Helps doctors find sickness in scans better.