Score: 0

ASR Under the Stethoscope: Evaluating Biases in Clinical Speech Recognition across Indian Languages

Published: November 30, 2025 | arXiv ID: 2512.10967v1

By: Subham Kumar , Prakrithi Shivaprakash , Abhishek Manoharan and more

Potential Business Impact:

Helps doctors understand patient voices in India.

Business Areas:

Speech Recognition Data and Analytics, Software

Automatic Speech Recognition (ASR) is increasingly used to document clinical encounters, yet its reliability in multilingual and demographically diverse Indian healthcare contexts remains largely unknown. In this study, we conduct the first systematic audit of ASR performance on real world clinical interview data spanning Kannada, Hindi, and Indian English, comparing leading models including Indic Whisper, Whisper, Sarvam, Google speech to text, Gemma3n, Omnilingual, Vaani, and Gemini. We evaluate transcription accuracy across languages, speakers, and demographic subgroups, with a particular focus on error patterns affecting patients vs. clinicians and gender based or intersectional disparities. Our results reveal substantial variability across models and languages, with some systems performing competitively on Indian English but failing on code mixed or vernacular speech. We also uncover systematic performance gaps tied to speaker role and gender, raising concerns about equitable deployment in clinical settings. By providing a comprehensive multilingual benchmark and fairness analysis, our work highlights the need for culturally and demographically inclusive ASR development for healthcare ecosystem in India.

Benchmarking Automatic Speech Recognition Models for African Languages

Computation and Language

Helps computers understand many African languages.

30 Nov 2025 1

90%

Bridging the Reality Gap: Efficient Adaptation of ASR systems for Challenging Low-Resource Domains

Computation and Language

Makes doctors' notes understandable by computers.

18 Dec 2025 1

90%

Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling

Computation and Language

Helps computers understand non-native English speakers better.

10 Mar 2025 0

View PDF Login to Bookmark

Page Count

17 pages

ASR Under the Stethoscope: Evaluating Biases in Clinical Speech Recognition across Indian Languages

Helps doctors understand patient voices in India.

Technical Abstract

Benchmarking Automatic Speech Recognition Models for African Languages

Bridging the Reality Gap: Efficient Adaptation of ASR systems for Challenging Low-Resource Domains

Automatic Speech Recognition for Non-Native English: Accuracy and Disfluency Handling