fMRI-LM: Towards a Universal Foundation Model for Language-Aligned fMRI Understanding
By: Yuxiang Wei , Yanteng Zhang , Xi Xiao and more
Potential Business Impact:
Reads thoughts from brain scans using language.
Recent advances in multimodal large language models (LLMs) have enabled unified reasoning across images, audio, and video, but extending such capability to brain imaging remains largely unexplored. Bridging this gap is essential to link neural activity with semantic cognition and to develop cross-modal brain representations. To this end, we present fMRI-LM, a foundational model that bridges functional MRI (fMRI) and language through a three-stage framework. In Stage 1, we learn a neural tokenizer that maps fMRI into discrete tokens embedded in a language-consistent space. In Stage 2, a pretrained LLM is adapted to jointly model fMRI tokens and text, treating brain activity as a sequence that can be temporally predicted and linguistically described. To overcome the lack of natural fMRI-text pairs, we construct a large descriptive corpus that translates diverse imaging-based features into structured textual descriptors, capturing the low-level organization of fMRI signals. In Stage 3, we perform multi-task, multi-paradigm instruction tuning to endow fMRI-LM with high-level semantic understanding, supporting diverse downstream applications. Across various benchmarks, fMRI-LM achieves strong zero-shot and few-shot performance, and adapts efficiently with parameter-efficient tuning (LoRA), establishing a scalable pathway toward a language-aligned, universal model for structural and semantic understanding of fMRI.
Similar Papers
Language models align with brain regions that represent concepts across modalities
Computation and Language
Computers understand ideas, not just words.
Leveraging LLMs for Early Alzheimer's Prediction
Computation and Language
Finds Alzheimer's early using brain scans.
Scaling and context steer LLMs along the same computational path as the human brain
Machine Learning (CS)
Brain and AI process information in a similar order.