Do Multiple Instance Learning Models Transfer?
By: Daniel Shao , Richard J. Chen , Andrew H. Song and more
Potential Business Impact:
Makes cancer diagnosis faster and more accurate.
Multiple Instance Learning (MIL) is a cornerstone approach in computational pathology (CPath) for generating clinically meaningful slide-level embeddings from gigapixel tissue images. However, MIL often struggles with small, weakly supervised clinical datasets. In contrast to fields such as NLP and conventional computer vision, where transfer learning is widely used to address data scarcity, the transferability of MIL models remains poorly understood. In this study, we systematically evaluate the transfer learning capabilities of pretrained MIL models by assessing 11 models across 21 pretraining tasks for morphological and molecular subtype prediction. Our results show that pretrained MIL models, even when trained on different organs than the target task, consistently outperform models trained from scratch. Moreover, pretraining on pancancer datasets enables strong generalization across organs and tasks, outperforming slide foundation models while using substantially less pretraining data. These findings highlight the robust adaptability of MIL models and demonstrate the benefits of leveraging transfer learning to boost performance in CPath. Lastly, we provide a resource which standardizes the implementation of MIL models and collection of pretrained model weights on popular CPath tasks, available at https://github.com/mahmoodlab/MIL-Lab
Similar Papers
A Spatially-Aware Multiple Instance Learning Framework for Digital Pathology
Image and Video Processing
Helps doctors find cancer faster and better.
nnMIL: A generalizable multiple instance learning framework for computational pathology
CV and Pattern Recognition
Makes AI better at finding diseases in slides.
Prototype-Based Multiple Instance Learning for Gigapixel Whole Slide Image Classification
CV and Pattern Recognition
Shows doctors why a computer thinks a picture is sick.