Score: 1

Language-agnostic, automated assessment of listeners' speech recall using large language models

Published: March 2, 2025 | arXiv ID: 2503.01045v1

By: Björn Herrmann

Potential Business Impact:

Helps doctors test how well people understand stories.

Business Areas:

Natural Language Processing Artificial Intelligence, Data and Analytics, Software

Speech-comprehension difficulties are common among older people. Standard speech tests do not fully capture such difficulties because the tests poorly resemble the context-rich, story-like nature of ongoing conversation and are typically available only in a country's dominant/official language (e.g., English), leading to inaccurate scores for native speakers of other languages. Assessments for naturalistic, story speech in multiple languages require accurate, time-efficient scoring. The current research leverages modern large language models (LLMs) in native English speakers and native speakers of 10 other languages to automate the generation of high-quality, spoken stories and scoring of speech recall in different languages. Participants listened to and freely recalled short stories (in quiet/clear and in babble noise) in their native language. LLM text-embeddings and LLM prompt engineering with semantic similarity analyses to score speech recall revealed sensitivity to known effects of temporal order, primacy/recency, and background noise, and high similarity of recall scores across languages. The work overcomes limitations associated with simple speech materials and testing of closed native-speaker groups because recall data of varying length and details can be mapped across languages with high accuracy. The full automation of speech generation and recall scoring provides an important step towards comprehension assessments of naturalistic speech with clinical applicability.

Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment

Machine Learning (CS)

Listens to voices to find memory problems early.

30 Jan 2025 0

88%

Audio Large Language Models Can Be Descriptive Speech Quality Evaluators

Sound

Helps computers understand if speech sounds good.

27 Jan 2025 1

88%

Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies

Computation and Language

Helps find Alzheimer's by listening to speech.

24 Aug 2025 0

View PDF Login to Bookmark

Repos / Data Links

github.com

Page Count

37 pages

Language-agnostic, automated assessment of listeners' speech recall using large language models

Helps doctors test how well people understand stories.

Technical Abstract

Evaluating Spoken Language as a Biomarker for Automated Screening of Cognitive Impairment

Audio Large Language Models Can Be Descriptive Speech Quality Evaluators

Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies