Score: 2

Disaggregation Reveals Hidden Training Dynamics: The Case of Agreement Attraction

Published: October 28, 2025 | arXiv ID: 2510.24934v1

By: James A. Michaelov, Catherine Arnett

BigTech Affiliations: Massachusetts Institute of Technology

Potential Business Impact:

Makes computers learn grammar like kids do.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Language models generally produce grammatical text, but they are more likely to make errors in certain contexts. Drawing on paradigms from psycholinguistics, we carry out a fine-grained analysis of those errors in different syntactic contexts. We demonstrate that by disaggregating over the conditions of carefully constructed datasets and comparing model performance on each over the course of training, it is possible to better understand the intermediate stages of grammatical learning in language models. Specifically, we identify distinct phases of training where language model behavior aligns with specific heuristics such as word frequency and local context rather than generalized grammatical rules. We argue that taking this approach to analyzing language model behavior more generally can serve as a powerful tool for understanding the intermediate learning phases, overall training dynamics, and the specific generalizations learned by language models.

Different types of syntactic agreement recruit the same units within large language models

Computation and Language

Models learn grammar rules like humans do.

3 Dec 2025 2

88%

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

Computation and Language

Makes AI understand words by how often they're used.

28 Oct 2025 2

87%

Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics

Computation and Language

Helps computers use knowledge across different languages.

14 Aug 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

14 pages

Disaggregation Reveals Hidden Training Dynamics: The Case of Agreement Attraction

Makes computers learn grammar like kids do.

Technical Abstract

Different types of syntactic agreement recruit the same units within large language models

Language Model Behavioral Phases are Consistent Across Architecture, Training Data, and Scale

Beyond the Rosetta Stone: Unification Forces in Generalization Dynamics