EvalBlocks: A Modular Pipeline for Rapidly Evaluating Foundation Models in Medical Imaging
By: Jan Tagscherer , Sarah de Boer , Lena Philipp and more
Potential Business Impact:
Helps doctors find diseases faster with better AI.
Developing foundation models in medical imaging requires continuous monitoring of downstream performance. Researchers are burdened with tracking numerous experiments, design choices, and their effects on performance, often relying on ad-hoc, manual workflows that are inherently slow and error-prone. We introduce EvalBlocks, a modular, plug-and-play framework for efficient evaluation of foundation models during development. Built on Snakemake, EvalBlocks supports seamless integration of new datasets, foundation models, aggregation methods, and evaluation strategies. All experiments and results are tracked centrally and are reproducible with a single command, while efficient caching and parallel execution enable scalable use on shared compute infrastructure. Demonstrated on five state-of-the-art foundation models and three medical imaging classification tasks, EvalBlocks streamlines model evaluation, enabling researchers to iterate faster and focus on model innovation rather than evaluation logistics. The framework is released as open source software at https://github.com/DIAGNijmegen/eval-blocks.
Similar Papers
A Deep Learning Framework for Real-Time Image Processing in Medical Diagnostics: Enhancing Accuracy and Speed in Clinical Applications
CV and Pattern Recognition
Helps doctors find sickness faster with AI.
Accelerating Data Processing and Benchmarking of AI Models for Pathology
CV and Pattern Recognition
Helps doctors find diseases faster with smart computer tools.
Atlas 2 -- Foundation models for clinical deployment
CV and Pattern Recognition
Helps doctors see diseases better in tissue pictures.