Vibravox: A Dataset of French Speech Captured with Body-conduction Audio   Sensors

Julien Hauret; Malo Olivier; Thomas Joubaud; Christophe; Langrenne; Sarah Poir\'ee; V\'eronique Zimpfer; \'Eric Bavu

arXiv:2407.11828·eess.AS·March 28, 2025

Vibravox: A Dataset of French Speech Captured with Body-conduction Audio Sensors

Julien Hauret, Malo Olivier, Thomas Joubaud, Christophe, Langrenne, Sarah Poir\'ee, V\'eronique Zimpfer, \'Eric Bavu

PDF

Open Access 1 Repo 10 Models 4 Datasets

TL;DR

Vibravox is a comprehensive GDPR-compliant dataset of French speech and physiological sounds captured with multiple body-conduction sensors and airborne microphones, enabling research on speech recognition, enhancement, and verification.

Contribution

The paper introduces Vibravox, a novel dataset with diverse sensor recordings and annotations, facilitating advanced research in body-conduction audio processing.

Findings

01

Sensor-specific performance insights for speech tasks

02

Comparison of body-conduction and airborne microphone data

03

Evaluation of state-of-the-art models on new sensor data

Abstract

Vibravox is a dataset compliant with the General Data Protection Regulation (GDPR) containing audio recordings using five different body-conduction audio sensors: two in-ear microphones, two bone conduction vibration pickups, and a laryngophone. The dataset also includes audio data from an airborne microphone used as a reference. The Vibravox corpus contains 45 hours per sensor of speech samples and physiological sounds recorded by 188 participants under different acoustic conditions imposed by a high order ambisonics 3D spatializer. Annotations about the recording conditions and linguistic transcriptions are also included in the corpus. We conducted a series of experiments on various speech-related tasks, including speech recognition, speech enhancement, and speaker verification. These experiments were carried out using state-of-the-art models to evaluate and compare their performances…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jhauret/vibravox
pytorchOfficial

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis · Music and Audio Processing

MethodsSparse Evolutionary Training