SLIM-Brain: A Data- and Training-Efficient Foundation Model for fMRI Data Analysis
By: Mo Wang , Junfeng Xia , Wenhao Ye and more
Potential Business Impact:
Reads brain scans faster, using less computer power.
Foundation models are emerging as a powerful paradigm for fMRI analysis, but current approaches face a dual bottleneck of data- and training-efficiency. Atlas-based methods aggregate voxel signals into fixed regions of interest, reducing data dimensionality but discarding fine-grained spatial details, and requiring extremely large cohorts to train effectively as general-purpose foundation models. Atlas-free methods, on the other hand, operate directly on voxel-level information - preserving spatial fidelity but are prohibitively memory- and compute-intensive, making large-scale pre-training infeasible. We introduce SLIM-Brain (Sample-efficient, Low-memory fMRI Foundation Model for Human Brain), a new atlas-free foundation model that simultaneously improves both data- and training-efficiency. SLIM-Brain adopts a two-stage adaptive design: (i) a lightweight temporal extractor captures global context across full sequences and ranks data windows by saliency, and (ii) a 4D hierarchical encoder (Hiera-JEPA) learns fine-grained voxel-level representations only from the top-$k$ selected windows, while deleting about 70% masked patches. Extensive experiments across seven public benchmarks show that SLIM-Brain establishes new state-of-the-art performance on diverse tasks, while requiring only 4 thousand pre-training sessions and approximately 30% of GPU memory comparing to traditional voxel-level methods.
Similar Papers
Towards Generalisable Foundation Models for 3D Brain MRI
CV and Pattern Recognition
Helps doctors find brain problems from scans.
A Modality-agnostic Multi-task Foundation Model for Human Brain Imaging
CV and Pattern Recognition
Makes brain scans work better, no matter how they're taken.
Bridging Foundation Models and Efficient Architectures: A Modular Brain Imaging Framework with Local Masking and Pretrained Representation Learning
Neurons and Cognition
Predicts age and smarts from brain scans.