Language-agnostic, automated assessment of listeners' speech recall using large language models
By: Björn Herrmann
Potential Business Impact:
Helps doctors test how well people understand stories.
Speech-comprehension difficulties are common among older people. Standard speech tests do not fully capture such difficulties because the tests poorly resemble the context-rich, story-like nature of ongoing conversation and are typically available only in a country's dominant/official language (e.g., English), leading to inaccurate scores for native speakers of other languages. Assessments for naturalistic, story speech in multiple languages require accurate, time-efficient scoring. The current research leverages modern large language models (LLMs) in native English speakers and native speakers of 10 other languages to automate the generation of high-quality, spoken stories and scoring of speech recall in different languages. Participants listened to and freely recalled short stories (in quiet/clear and in babble noise) in their native language. LLM text-embeddings and LLM prompt engineering with semantic similarity analyses to score speech recall revealed sensitivity to known effects of temporal order, primacy/recency, and background noise, and high similarity of recall scores across languages. The work overcomes limitations associated with simple speech materials and testing of closed native-speaker groups because recall data of varying length and details can be mapped across languages with high accuracy. The full automation of speech generation and recall scoring provides an important step towards comprehension assessments of naturalistic speech with clinical applicability.
Similar Papers
Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment
Machine Learning (CS)
Listens to voices to find memory problems early.
Audio Large Language Models Can Be Descriptive Speech Quality Evaluators
Sound
Helps computers understand if speech sounds good.
Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies
Computation and Language
Helps find Alzheimer's by listening to speech.