Score: 2

Quantifying Phonosemantic Iconicity Distributionally in 6 Languages

Published: October 15, 2025 | arXiv ID: 2510.14040v1

By: George Flint, Kaustubh Kislay

BigTech Affiliations: University of California, Berkeley

Potential Business Impact:

Words sound like what they mean.

Business Areas:

Semantic Web Internet Services

Language is, as commonly theorized, largely arbitrary. Yet, systematic relationships between phonetics and semantics have been observed in many specific cases. To what degree could those systematic relationships manifest themselves in large scale, quantitative investigations--both in previously identified and unidentified phenomena? This work undertakes a distributional approach to quantifying phonosemantic iconicity at scale across 6 diverse languages (English, Spanish, Hindi, Finnish, Turkish, and Tamil). In each language, we analyze the alignment of morphemes' phonetic and semantic similarity spaces with a suite of statistical measures, and discover an array of interpretable phonosemantic alignments not previously identified in the literature, along with crosslinguistic patterns. We also analyze 5 previously hypothesized phonosemantic alignments, finding support for some such alignments and mixed results for others.

Do Language Models Associate Sound with Meaning? A Multimodal Study of Sound Symbolism

Computation and Language

Computers learn how sounds match meanings like humans.

13 Nov 2025 1

87%

Most over-representation of phonological features in basic vocabulary disappears when controlling for spatial and phylogenetic effects

Computation and Language

Makes language sound patterns less common than thought.

8 Dec 2025 0

87%

A new kid on the block: Distributional semantics predicts the word-specific tone signatures of monosyllabic words in conversational Taiwan Mandarin

Computation and Language

Word meanings change how we say words.

21 Nov 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Repos / Data Links

github.com

Page Count

19 pages

Quantifying Phonosemantic Iconicity Distributionally in 6 Languages

Words sound like what they mean.

Technical Abstract

Do Language Models Associate Sound with Meaning? A Multimodal Study of Sound Symbolism

Most over-representation of phonological features in basic vocabulary disappears when controlling for spatial and phylogenetic effects

A new kid on the block: Distributional semantics predicts the word-specific tone signatures of monosyllabic words in conversational Taiwan Mandarin