nnMIL: A generalizable multiple instance learning framework for computational pathology
By: Xiangde Luo , Jinxi Xiang , Yuanfeng Ji and more
Potential Business Impact:
Makes AI better at finding diseases in slides.
Computational pathology holds substantial promise for improving diagnosis and guiding treatment decisions. Recent pathology foundation models enable the extraction of rich patch-level representations from large-scale whole-slide images (WSIs), but current approaches for aggregating these features into slide-level predictions remain constrained by design limitations that hinder generalizability and reliability. Here, we developed nnMIL, a simple yet broadly applicable multiple-instance learning framework that connects patch-level foundation models to robust slide-level clinical inference. nnMIL introduces random sampling at both the patch and feature levels, enabling large-batch optimization, task-aware sampling strategies, and efficient and scalable training across datasets and model architectures. A lightweight aggregator performs sliding-window inference to generate ensemble slide-level predictions and supports principled uncertainty estimation. Across 40,000 WSIs encompassing 35 clinical tasks and four pathology foundation models, nnMIL consistently outperformed existing MIL methods for disease diagnosis, histologic subtyping, molecular biomarker detection, and pan- cancer prognosis prediction. It further demonstrated strong cross-model generalization, reliable uncertainty quantification, and robust survival stratification in multiple external cohorts. In conclusion, nnMIL offers a practical and generalizable solution for translating pathology foundation models into clinically meaningful predictions, advancing the development and deployment of reliable AI systems in real-world settings.
Similar Papers
Do Multiple Instance Learning Models Transfer?
CV and Pattern Recognition
Makes cancer diagnosis faster and more accurate.
A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology
Image and Video Processing
Helps doctors find cancer faster and better.
CAPRMIL: Context-Aware Patch Representations for Multiple Instance Learning
CV and Pattern Recognition
Makes cancer diagnosis faster and cheaper.