State of Abdominal CT Datasets: A Critical Review of Bias, Clinical Relevance, and Real-world Applicability
By: Saeide Danaei , Zahra Dehghanian , Elahe Meftah and more
Potential Business Impact:
Makes AI better at reading body scans.
This systematic review critically evaluates publicly available abdominal CT datasets and their suitability for artificial intelligence (AI) applications in clinical settings. We examined 46 publicly available abdominal CT datasets (50,256 studies). Across all 46 datasets, we found substantial redundancy (59.1\% case reuse) and a Western/geographic skew (75.3\% from North America and Europe). A bias assessment was performed on the 19 datasets with >=100 cases; within this subset, the most prevalent high-risk categories were domain shift (63\%) and selection bias (57\%), both of which may undermine model generalizability across diverse healthcare environments -- particularly in resource-limited settings. To address these challenges, we propose targeted strategies for dataset improvement, including multi-institutional collaboration, adoption of standardized protocols, and deliberate inclusion of diverse patient populations and imaging technologies. These efforts are crucial in supporting the development of more equitable and clinically robust AI models for abdominal imaging.
Similar Papers
Scaling Tumor Segmentation: Best Lessons from Real and Synthetic Data
CV and Pattern Recognition
Makes AI find tumors with less real scans.
AI-Driven Automated Tool for Abdominal CT Body Composition Analysis in Gastrointestinal Cancer Management
Image and Video Processing
Helps doctors quickly measure belly fat for cancer.
Limitations of Public Chest Radiography Datasets for Artificial Intelligence: Label Quality, Domain Shift, Bias and Evaluation Challenges
Machine Learning (CS)
Makes AI better at reading X-rays for doctors.