PAREDA: A Multi-Accent Speech Dataset of Natural Language Processing Research Discussions

Sicheng Jin; Dipankar Srirag; Aditya Joshi

arXiv:2605.17860·cs.CL·May 19, 2026

PAREDA: A Multi-Accent Speech Dataset of Natural Language Processing Research Discussions

Sicheng Jin, Dipankar Srirag, Aditya Joshi

PDF

TL;DR

PAREDA is a novel multi-accent speech dataset featuring natural language processing research discussions, designed to evaluate and improve the robustness of ASR systems across diverse accents and spontaneous speech.

Contribution

Introduces PAREDA, the first multi-accent dataset of NLP discussion speech, and demonstrates its utility in evaluating and enhancing ASR model performance on real-world, accented speech.

Findings

01

State-of-the-art models perform worse on PAREDA without fine-tuning.

02

Fine-tuning on PAREDA significantly reduces Word Error Rate.

03

PAREDA captures linguistic features often missing from existing datasets.

Abstract

While modern Automatic Speech Recognition (ASR) systems achieve high accuracy on benchmark corpora, their performance often degrades when there is real-world variability. This work focuses on variability arising due to accented, spontaneous, and domain-specific speech. In particular, we introduce PAper REading DAtaset (PAREDA), a first-of-its-kind multi-accent speech dataset consisting of discussions on academic Natural Language Processing (NLP) papers between speakers with Australian, Indian-English, and Chinese English accents. Each session elicits a spontaneous monologue (a summary of a paper's abstract) and a non-monologue (a question-and-answer session between participants), resulting in a corpus rich with technical jargon and conversational phenomena. We evaluate the performance of SOTA ASR models on PAREDA, analysing the impact of accent mixing and increased speech rate. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.