BIRD: Big Impulse Response Dataset
Fran\c{c}ois Grondin, Jean-Samuel Lauzon, Simon Michaud, Mirco, Ravanelli, Fran\c{c}ois Michaud

TL;DR
BIRD is the largest open dataset of 100,000 multichannel room impulse responses, enabling efficient data augmentation for speech processing tasks involving multiple microphones and sources.
Contribution
This paper presents BIRD, the largest multichannel RIR dataset generated via simulation, facilitating improved data augmentation in speech applications.
Findings
BIRD enables efficient online data augmentation.
The dataset supports scenarios with two microphones and multiple sound sources.
Use cases demonstrate improved speech processing performance.
Abstract
This paper introduces BIRD, the Big Impulse Response Dataset. This open dataset consists of 100,000 multichannel room impulse responses (RIRs) generated from simulations using the Image Method, making it the largest multichannel open dataset currently available. These RIRs can be used toperform efficient online data augmentation for scenarios that involve two microphones and multiple sound sources. The paper also introduces use cases to illustrate how BIRD can perform data augmentation with existing speech corpora.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications
