Intuitions of Machine Learning Researchers about Transfer Learning for Medical Image Classification
By: Yucheng Lu , Hubert Dariusz Zając , Veronika Cheplygina and more
Potential Business Impact:
Helps AI learn faster from medical pictures.
Transfer learning is crucial for medical imaging, yet the selection of source datasets - which can impact the generalizability of algorithms, and thus patient outcomes - often relies on researchers' intuition rather than systematic principles. This study investigates these decisions through a task-based survey with machine learning practitioners. Unlike prior work that benchmarks models and experimental setups, we take a human-centered HCI perspective on how practitioners select source datasets. Our findings indicate that choices are task-dependent and influenced by community practices, dataset properties, and computational (data embedding), or perceived visual or semantic similarity. However, similarity ratings and expected performance are not always aligned, challenging a traditional "more similar is better" view. Participants often used ambiguous terminology, which suggests a need for clearer definitions and HCI tools to make them explicit and usable. By clarifying these heuristics, this work provides practical insights for more systematic source selection in transfer learning.
Similar Papers
Quantifying Dataset Similarity to Guide Transfer Learning
Machine Learning (Stat)
Tells computers if learning from old data helps.
Source-Optimal Training is Transfer-Suboptimal
Machine Learning (Stat)
Improves computer learning by finding the best way to teach it.
Robust Data Fusion via Subsampling
Machine Learning (Stat)
Improves learning from different, messy data.