Score: 3

PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech

Published: December 29, 2025 | arXiv ID: 2512.23686v1

By: Deepak Babu Piskala

Potential Business Impact:

Helps computers understand important words better.

Business Areas:

Speech Recognition Data and Analytics, Software

Automatic Speech Recognition (ASR) in professional settings faces challenges that existing benchmarks underplay: dense domain terminology, formal register variation, and near-zero tolerance for critical entity errors. We present ProfASR-Bench, a professional-talk evaluation suite for high-stakes applications across finance, medicine, legal, and technology. Each example pairs a natural-language prompt (domain cue and/or speaker profile) with an entity-rich target utterance, enabling controlled measurement of context-conditioned recognition. The corpus supports conventional ASR metrics alongside entity-aware scores and slice-wise reporting by accent and gender. Using representative families Whisper (encoder-decoder ASR) and Qwen-Omni (audio language models) under matched no-context, profile, domain+profile, oracle, and adversarial conditions, we find a consistent pattern: lightweight textual context produces little to no change in average word error rate (WER), even with oracle prompts, and adversarial prompts do not reliably degrade performance. We term this the context-utilization gap (CUG): current systems are nominally promptable yet underuse readily available side information. ProfASR-Bench provides a standardized context ladder, entity- and slice-aware reporting with confidence intervals, and a reproducible testbed for comparing fusion strategies across model families. Dataset: https://huggingface.co/datasets/prdeepakbabu/ProfASR-Bench Code: https://github.com/prdeepakbabu/ProfASR-Bench

ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark

Audio and Speech Processing

Helps computers understand speech with special words.

8 Jul 2025 2

87%

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

Sound

Helps voice assistants understand names and rare words.

25 May 2025 1

87%

Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition

Computation and Language

Listens better to long talks, even with noise.

14 Nov 2025 2

View PDF Login to Bookmark

Repos / Data Links

github.com huggingface.co

Page Count

11 pages

PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech

Helps computers understand important words better.

Technical Abstract

ContextASR-Bench: A Massive Contextual Speech Recognition Benchmark

BR-ASR: Efficient and Scalable Bias Retrieval Framework for Contextual Biasing ASR in Speech LLM

Speech-Aware Long Context Pruning and Integration for Contextualized Automatic Speech Recognition