BRADS and BRWDS: Multipurpose audio and text datasets for automatic Bangla regional speech recognition

Umme Aiman; Md Nakibul Islam; MD Hana Sultan Chowdhury; Md. Sadekur Rahman; Md. Tarek Habib; Mahady Hasan

PMC · DOI:10.1016/j.dib.2025.112177·October 15, 2025

BRADS and BRWDS: Multipurpose audio and text datasets for automatic Bangla regional speech recognition

Umme Aiman, Md Nakibul Islam, MD Hana Sultan Chowdhury, Md. Sadekur Rahman, Md. Tarek Habib, Mahady Hasan

PDF

Open Access

TL;DR

This paper introduces a new dataset for Bangla speech recognition, focusing on regional variations and aiming to improve NLP research in Bangladesh.

Contribution

The dataset includes regional Bangla words and pronunciations, collected from native speakers across multiple Bangla-speaking regions.

Findings

01

The dataset includes 298 Bangla words with regional pronunciations collected from native speakers.

02

It contains 2439 audio segments from 85 contributors, allowing for real-world scenario modeling.

03

The modular design supports future expansion with new regional words.

Abstract

This paper presents an innovative approach to Bangla voice recognition. Although Bangla is the seventh most spoken native language globally, it remains underrepresented in voice recognition research. The dataset contains 298 frequently used Bangla words, including 233 regional words and 65 standard Bangla words. These terms, encompassing various regional pronunciations and meanings, were collected from native speakers in Dhaka, Chattogram, Barisal, Mymensingh, Rajshahi, Sylhet, Rangpur, and Khulna. The 2439 audio segments in the dataset were contributed voluntarily by 85 native speakers and assessed by ten university students. This resource is intended for researchers working on automatic Bangla regional speech recognition systems, with an emphasis on capturing regional pronunciation and linguistic differences. The dataset allows researchers to recreate real-world scenarios during model…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Figures8

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Speech and Audio Processing