A Survey on fMRI-based Brain Decoding for Reconstructing Multimodal Stimuli
By: Pengyu Liu , Guohua Dong , Dan Guo and more
Potential Business Impact:
Lets computers see what you see from brain scans.
In daily life, we encounter diverse external stimuli, such as images, sounds, and videos. As research in multimodal stimuli and neuroscience advances, fMRI-based brain decoding has become a key tool for understanding brain perception and its complex cognitive processes. Decoding brain signals to reconstruct stimuli not only reveals intricate neural mechanisms but also drives progress in AI, disease treatment, and brain-computer interfaces. Recent advancements in neuroimaging and image generation models have significantly improved fMRI-based decoding. While fMRI offers high spatial resolution for precise brain activity mapping, its low temporal resolution and signal noise pose challenges. Meanwhile, techniques like GANs, VAEs, and Diffusion Models have enhanced reconstructed image quality, and multimodal pre-trained models have boosted cross-modal decoding tasks. This survey systematically reviews recent progress in fMRI-based brain decoding, focusing on stimulus reconstruction from passive brain signals. It summarizes datasets, relevant brain regions, and categorizes existing methods by model structure. Additionally, it evaluates model performance and discusses their effectiveness. Finally, it identifies key challenges and proposes future research directions, offering valuable insights for the field. For more information and resources related to this survey, visit https://github.com/LpyNow/BrainDecodingImage.
Similar Papers
Seeing Through the Brain: New Insights from Decoding Visual Stimuli with fMRI
CV and Pattern Recognition
Lets computers see what you see from brain scans.
Deep Neural Encoder-Decoder Model to Relate fMRI Brain Activity with Naturalistic Stimuli
CV and Pattern Recognition
Reconstructs movies from brain scans.
Unified Multimodal Brain Decoding via Cross-Subject Soft-ROI Fusion
Machine Learning (CS)
Reads minds to describe what you see.