Score: 1

Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech

Published: June 9, 2025 | arXiv ID: 2506.11119v1

By: Jingyu Li , Lingchao Mao , Hairong Wang and more

Potential Business Impact:

Finds Alzheimer's early from talking.

Business Areas:

Speech Recognition Data and Analytics, Software

Background: Alzheimer's disease and related dementias (ADRD) are progressive neurodegenerative conditions where early detection is vital for timely intervention and care. Spontaneous speech contains rich acoustic and linguistic markers that may serve as non-invasive biomarkers for cognitive decline. Foundation models, pre-trained on large-scale audio or text data, produce high-dimensional embeddings encoding contextual and acoustic features. Methods: We used the PREPARE Challenge dataset, which includes audio recordings from over 1,600 participants with three cognitive statuses: healthy control (HC), mild cognitive impairment (MCI), and Alzheimer's Disease (AD). We excluded non-English, non-spontaneous, or poor-quality recordings. The final dataset included 703 (59.13%) HC, 81 (6.81%) MCI, and 405 (34.06%) AD cases. We benchmarked a range of open-source foundation speech and language models to classify cognitive status into the three categories. Results: The Whisper-medium model achieved the highest performance among speech models (accuracy = 0.731, AUC = 0.802). Among language models, BERT with pause annotation performed best (accuracy = 0.662, AUC = 0.744). ADRD detection using state-of-the-art automatic speech recognition (ASR) model-generated audio embeddings outperformed others. Including non-semantic features like pause patterns consistently improved text-based classification. Conclusion: This study introduces a benchmarking framework using foundation models and a clinically relevant dataset. Acoustic-based approaches -- particularly ASR-derived embeddings -- demonstrate strong potential for scalable, non-invasive, and cost-effective early detection of ADRD.

LLMCARE: Alzheimer's Detection via Transformer Models Enhanced by LLM-Generated Synthetic Data

Computation and Language

Finds early signs of memory loss in voices.

8 Aug 2025 0

90%

Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies

Computation and Language

Helps find Alzheimer's by listening to speech.

24 Aug 2025 0

89%

CAtCh: Cognitive Assessment through Cookie Thief

Machine Learning (CS)

Listens to voices to find thinking problems.

7 Jun 2025 1

View PDF Login to Bookmark

Country of Origin

🇺🇸 United States

Page Count

34 pages

Benchmarking Foundation Speech and Language Models for Alzheimer's Disease and Related Dementia Detection from Spontaneous Speech

Finds Alzheimer's early from talking.

Technical Abstract

LLMCARE: Alzheimer's Detection via Transformer Models Enhanced by LLM-Generated Synthetic Data

Speech-Based Cognitive Screening: A Systematic Evaluation of LLM Adaptation Strategies

CAtCh: Cognitive Assessment through Cookie Thief