Language Models Model Language
By: Łukasz Borchmann
Potential Business Impact:
Makes AI understand language by counting word use.
Linguistic commentary on LLMs, heavily influenced by the theoretical frameworks of de Saussure and Chomsky, is often speculative and unproductive. Critics challenge whether LLMs can legitimately model language, citing the need for "deep structure" or "grounding" to achieve an idealized linguistic "competence." We argue for a radical shift in perspective towards the empiricist principles of Witold Ma\'nczak, a prominent general and historical linguist. He defines language not as a "system of signs" or a "computational system of the brain" but as the totality of all that is said and written. Above all, he identifies frequency of use of particular language elements as language's primary governing principle. Using his framework, we challenge prior critiques of LLMs and provide a constructive guide for designing, evaluating, and interpreting language models.
Similar Papers
Under the Shadow of Babel: How Language Shapes Reasoning in LLMs
Computation and Language
Computers learn thinking habits from languages.
Pragmatics beyond humans: meaning, communication, and LLMs
Computation and Language
Helps computers understand how we really talk.
Integrating LLM in Agent-Based Social Simulation: Opportunities and Challenges
Artificial Intelligence
Lets computer characters act more like real people.