PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech

Deepak Babu Piskala

arXiv:2512.23686·cs.CL·December 30, 2025

PROFASR-BENCH: A Benchmark for Context-Conditioned ASR in High-Stakes Professional Speech

Deepak Babu Piskala

PDF

Open Access 1 Datasets

TL;DR

ProfASR-Bench introduces a comprehensive benchmark for evaluating context-conditioned automatic speech recognition in high-stakes professional domains, revealing a significant gap in current models' utilization of available contextual information.

Contribution

The paper presents a new benchmark dataset and evaluation framework that measures how well ASR systems leverage contextual cues in professional speech, highlighting the underuse of available context.

Findings

01

Lightweight textual context has minimal impact on WER.

02

Adversarial prompts do not reliably degrade ASR performance.

03

Current systems exhibit a context-utilization gap (CUG) in high-stakes settings.

Abstract

Automatic Speech Recognition (ASR) in professional settings faces challenges that existing benchmarks underplay: dense domain terminology, formal register variation, and near-zero tolerance for critical entity errors. We present ProfASR-Bench, a professional-talk evaluation suite for high-stakes applications across finance, medicine, legal, and technology. Each example pairs a natural-language prompt (domain cue and/or speaker profile) with an entity-rich target utterance, enabling controlled measurement of context-conditioned recognition. The corpus supports conventional ASR metrics alongside entity-aware scores and slice-wise reporting by accent and gender. Using representative families Whisper (encoder-decoder ASR) and Qwen-Omni (audio language models) under matched no-context, profile, domain+profile, oracle, and adversarial conditions, we find a consistent pattern: lightweight…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

prdeepakbabu/ProfASR-Bench
dataset· 118 dl
118 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Authorship Attribution and Profiling · Topic Modeling