Imbalanced Classification through the Lens of Spurious Correlations
By: Jakob Hackstein, Sidney Bender
Potential Business Impact:
Fixes computer learning when data is uneven.
Class imbalance poses a fundamental challenge in machine learning, frequently leading to unreliable classification performance. While prior methods focus on data- or loss-reweighting schemes, we view imbalance as a data condition that amplifies Clever Hans (CH) effects by underspecification of minority classes. In a counterfactual explanations-based approach, we propose to leverage Explainable AI to jointly identify and eliminate CH effects emerging under imbalance. Our method achieves competitive classification performance on three datasets and demonstrates how CH effects emerge under imbalance, a perspective largely overlooked by existing approaches.
Similar Papers
Mitigating Clever Hans Strategies in Image Classifiers through Generating Counterexamples
Machine Learning (CS)
Teaches computers to learn better, not just guess.
Class-Conditional Distribution Balancing for Group Robust Classification
Machine Learning (CS)
Fixes computer guesses that are wrong for bad reasons.
When Are Learning Biases Equivalent? A Unifying Framework for Fairness, Robustness, and Distribution Shift
Machine Learning (CS)
Fixes computer mistakes for fairness and accuracy.