Understanding or Memorizing? A Case Study of German Definite Articles in Language Models
By: Jonathan Drechsel, Erisa Bytyqi, Steffen Herbold
Language models perform well on grammatical agreement, but it is unclear whether this reflects rule-based generalization or memorization. We study this question for German definite singular articles, whose forms depend on gender and case. Using GRADIEND, a gradient-based interpretability method, we learn parameter update directions for gender-case specific article transitions. We find that updates learned for a specific gender-case article transition frequently affect unrelated gender-case settings, with substantial overlap among the most affected neurons across settings. These results argue against a strictly rule-based encoding of German definite articles, indicating that models at least partly rely on memorized associations rather than abstract grammatical rules.
Similar Papers
Do Large Language Models Grasp The Grammar? Evidence from Grammar-Book-Guided Probing in Luxembourgish
Computation and Language
Tests if computers truly understand language rules.
Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models
Computation and Language
AI pictures change based on word gender.
Beyond Content: How Grammatical Gender Shapes Visual Representation in Text-to-Image Models
Computation and Language
AI pictures change based on word gender.