MLS: A Large-Scale Multilingual Dataset for Speech Research
Vineel Pratap, Qiantong Xu, Anuroop Sriram, Gabriel Synnaeve, Ronan, Collobert

TL;DR
The paper presents MLS, a comprehensive multilingual speech dataset with extensive transcribed audio from LibriVox, enabling advancements in ASR and TTS research across multiple languages.
Contribution
Introduction of MLS, a large-scale multilingual speech dataset with accompanying language models and baseline ASR models for 8 languages.
Findings
44.5K hours of English speech
6K hours of speech across 7 other languages
Baseline ASR models provided for all languages
Abstract
This paper introduces Multilingual LibriSpeech (MLS) dataset, a large multilingual corpus suitable for speech research. The dataset is derived from read audiobooks from LibriVox and consists of 8 languages, including about 44.5K hours of English and a total of about 6K hours for other languages. Additionally, we provide Language Models (LM) and baseline Automatic Speech Recognition (ASR) models and for all the languages in our dataset. We believe such a large transcribed dataset will open new avenues in ASR and Text-To-Speech (TTS) research. The dataset will be made freely available for anyone at http://www.openslr.org.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗nvidia/parakeet-tdt-0.6b-v3model· 254k dl· ♡ 747254k dl♡ 747
- 🤗nvidia/canary-1b-v2model· 123k dl· ♡ 371123k dl♡ 371
- 🤗lgris/bp400-xlsrmodel· 1 dl· ♡ 31 dl♡ 3
- 🤗lgris/bp500-base100k_voxpopulimodel· 1 dl· ♡ 11 dl♡ 1
- 🤗lgris/bp500-base10k_voxpopulimodel· 1 dl1 dl
- 🤗lgris/bp500-xlsrmodel· 2 dl· ♡ 12 dl♡ 1
- 🤗lgris/wav2vec2-large-xlsr-open-brazilian-portuguese-v2model· 612k dl· ♡ 20612k dl♡ 20
- 🤗lgris/wav2vec2-large-xlsr-open-brazilian-portuguesemodel· 41 dl· ♡ 1041 dl♡ 10
- 🤗niobures/GigaAMmodel
- 🤗SoSolaris/parakeet-tdt-0.6b-v3model· 7 dl7 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
