OxEnsemble: Fair Ensembles for Low-Data Classification
By: Jonathan Rystrøm, Zihao Fu, Chris Russell
Potential Business Impact:
Helps doctors find diseases better with less data.
We address the problem of fair classification in settings where data is scarce and unbalanced across demographic groups. Such low-data regimes are common in domains like medical imaging, where false negatives can have fatal consequences. We propose a novel approach \emph{OxEnsemble} for efficiently training ensembles and enforcing fairness in these low-data regimes. Unlike other approaches, we aggregate predictions across ensemble members, each trained to satisfy fairness constraints. By construction, \emph{OxEnsemble} is both data-efficient, carefully reusing held-out data to enforce fairness reliably, and compute-efficient, requiring little more compute than used to fine-tune or evaluate an existing model. We validate this approach with new theoretical guarantees. Experimentally, our approach yields more consistent outcomes and stronger fairness-accuracy trade-offs than existing methods across multiple challenging medical imaging classification datasets.
Similar Papers
How Ensemble Learning Balances Accuracy and Overfitting: A Bias-Variance Perspective on Tabular Data
Machine Learning (CS)
Makes computer predictions more accurate without mistakes.
Decoupling Bias, Aligning Distributions: Synergistic Fairness Optimization for Deepfake Detection
CV and Pattern Recognition
Makes fake video checkers fair for everyone.
FairFedMed: Benchmarking Group Fairness in Federated Medical Imaging with FairLoRA
Computers and Society
Makes AI treat all patients fairly in hospitals.