AI Alignment in Medical Imaging: Unveiling Hidden Biases Through Counterfactual Analysis
By: Haroui Ma , Francesco Quinzan , Theresa Willem and more
Potential Business Impact:
Checks if AI doctors are fair to everyone.
Machine learning (ML) systems for medical imaging have demonstrated remarkable diagnostic capabilities, but their susceptibility to biases poses significant risks, since biases may negatively impact generalization performance. In this paper, we introduce a novel statistical framework to evaluate the dependency of medical imaging ML models on sensitive attributes, such as demographics. Our method leverages the concept of counterfactual invariance, measuring the extent to which a model's predictions remain unchanged under hypothetical changes to sensitive attributes. We present a practical algorithm that combines conditional latent diffusion models with statistical hypothesis testing to identify and quantify such biases without requiring direct access to counterfactual data. Through experiments on synthetic datasets and large-scale real-world medical imaging datasets, including \textsc{cheXpert} and MIMIC-CXR, we demonstrate that our approach aligns closely with counterfactual fairness principles and outperforms standard baselines. This work provides a robust tool to ensure that ML diagnostic systems generalize well, e.g., across demographic groups, offering a critical step towards AI safety in healthcare. Code: https://github.com/Neferpitou3871/AI-Alignment-Medical-Imaging.
Similar Papers
Behind the Screens: Uncovering Bias in AI-Driven Video Interview Assessments Using Counterfactuals
Human-Computer Interaction
Checks AI hiring tools for unfairness.
On the Interplay of Human-AI Alignment,Fairness, and Performance Trade-offs in Medical Imaging
CV and Pattern Recognition
Helps AI see patients fairly, not just some.
Understanding and evaluating computer vision models through the lens of counterfactuals
CV and Pattern Recognition
Makes AI fair by testing "what if" scenarios.