Dancing with Deer: A Constructional Perspective on MWEs in the Era of LLMs
By: Claire Bonial, Julia Bonn, Harish Tayyar Madabushi
Potential Business Impact:
Helps computers learn new phrases like people do.
In this chapter, we argue for the benefits of understanding multiword expressions from the perspective of usage-based, construction grammar approaches. We begin with a historical overview of how construction grammar was developed in order to account for idiomatic expressions using the same grammatical machinery as the non-idiomatic structures of language. We cover a comprehensive description of constructions, which are pairings of meaning with form of any size (morpheme, word, phrase), as well as how constructional approaches treat the acquisition and generalization of constructions. We describe a successful case study leveraging constructional templates for representing multiword expressions in English PropBank. Because constructions can be at any level or unit of form, we then illustrate the benefit of a constructional representation of multi-meaningful morphosyntactic unit constructions in Arapaho, a highly polysynthetic and agglutinating language. We include a second case study leveraging constructional templates for representing these multi-morphemic expressions in Uniform Meaning Representation. Finally, we demonstrate the similarities and differences between a usage-based explanation of a speaker learning a novel multiword expression, such as "dancing with deer," and that of a large language model. We present experiments showing that both models and speakers can generalize the meaning of novel multiword expressions based on a single exposure of usage. However, only speakers can reason over the combination of two such expressions, as this requires comparison of the novel forms to a speaker's lifetime of stored constructional exemplars, which are rich with cross-modal details.
Similar Papers
Evaluating the Impact of Verbal Multiword Expressions on Machine Translation
Computation and Language
Fixes computer translations of tricky phrases.
An Empirical Study on Chinese Character Decomposition in Multiword Expression-Aware Neural Machine Translation
Computation and Language
Helps computers understand Chinese word meanings better.
Do Construction Distributions Shape Formal Language Learning In German BabyLMs?
Computation and Language
Helps computers learn language like babies.