Score: 0

Surprisal from Larger Transformer-based Language Models Predicts fMRI Data More Poorly

Published: June 12, 2025 | arXiv ID: 2506.11338v1

By: Yi-Chien Lin, William Schuler

Potential Business Impact:

Brain scans show how well computers understand words.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

As Transformers become more widely incorporated into natural language processing tasks, there has been considerable interest in using surprisal from these models as predictors of human sentence processing difficulty. Recent work has observed a positive relationship between Transformer-based models' perplexity and the predictive power of their surprisal estimates on reading times, showing that language models with more parameters and trained on more data are less predictive of human reading times. However, these studies focus on predicting latency-based measures (i.e., self-paced reading times and eye-gaze durations) with surprisal estimates from Transformer-based language models. This trend has not been tested on brain imaging data. This study therefore evaluates the predictive power of surprisal estimates from 17 pre-trained Transformer-based models across three different language families on two functional magnetic resonance imaging datasets. Results show that the positive relationship between model perplexity and model fit still obtains, suggesting that this trend is not specific to latency-based measures and can be generalized to neural measures.

The Inverse Scaling Effect of Pre-Trained Language Model Surprisal Is Not Due to Data Leakage

Computation and Language

Computers predict reading speed better without cheating.

1 Jun 2025 2

89%

Vectors from Larger Language Models Predict Human Reading Time and fMRI Data More Poorly when Dimensionality Expansion is Controlled

Computation and Language

Makes computers understand sentences less like people.

18 May 2025 1

88%

Modeling cognitive processes of natural reading with transformer-based Language Models

Computation and Language

Helps computers understand how people read.

16 May 2025 2

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

6 pages

Surprisal from Larger Transformer-based Language Models Predicts fMRI Data More Poorly

Brain scans show how well computers understand words.

Technical Abstract

The Inverse Scaling Effect of Pre-Trained Language Model Surprisal Is Not Due to Data Leakage

Vectors from Larger Language Models Predict Human Reading Time and fMRI Data More Poorly when Dimensionality Expansion is Controlled

Modeling cognitive processes of natural reading with transformer-based Language Models