Score: 1

Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains

Published: December 18, 2025 | arXiv ID: 2512.16401v2

By: Darshil Chauhan , Adityasinh Solanki , Vansh Patel and more

Potential Business Impact:

Helps doctors record patient notes in noisy places.

Business Areas:

Speech Recognition Data and Analytics, Software

Automatic Speech Recognition (ASR) holds immense potential to assist in clinical documentation and patient report generation, particularly in resource-constrained regions. However, deployment is currently hindered by a technical deadlock: a severe "Reality Gap" between laboratory performance and noisy, real-world clinical audio, coupled with strict privacy and resource constraints. We quantify this gap, showing that a robust multilingual model (IndicWav2Vec) degrades to a 40.94% WER on rural clinical data from India, rendering it unusable. To address this, we explore a zero-data-exfiltration framework enabling localized, continual adaptation via Low-Rank Adaptation (LoRA). We conduct a rigorous investigative study of continual learning strategies, characterizing the trade-offs between data-driven and parameter-driven stability. Our results demonstrate that multi-domain Experience Replay (ER) yields the primary performance gains, achieving a 17.1% relative improvement in target WER and reducing catastrophic forgetting by 55% compared to naive adaptation. Furthermore, we observed that standard Elastic Weight Consolidation (EWC) faced numerical stability challenges when applied to LoRA in noisy environments. Our experiments show that a stabilized, linearized formulation effectively controls gradient magnitudes and enables stable convergence. Finally, we verify via a domain-specific spot check that acoustic adaptation is a fundamental prerequisite for usability which cannot be bypassed by language models alone.

Bridging the Reality Gap: Efficient Adaptation of ASR systems for Challenging Low-Resource Domains

Computation and Language

Makes doctors' notes understandable by computers.

18 Dec 2025 1

89%

Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR

Sound

Helps computers understand non-native English speakers better.

12 Oct 2025 1

89%

Automatic Speech Recognition for African Low-Resource Languages: Challenges and Future Directions

Computation and Language

Helps computers understand African languages better.

16 May 2025 0

View PDF Login to Bookmark

Country of Origin

🇮🇳 India

Page Count

17 pages

Navigating the Reality Gap: Privacy-Preserving Adaptation of ASR for Challenging Low-Resource Domains

Helps doctors record patient notes in noisy places.

Technical Abstract

Bridging the Reality Gap: Efficient Adaptation of ASR systems for Challenging Low-Resource Domains

Proficiency-Aware Adaptation and Data Augmentation for Robust L2 ASR

Automatic Speech Recognition for African Low-Resource Languages: Challenges and Future Directions