DiPCo -- Dinner Party Corpus
Maarten Van Segbroeck, Ahmed Zaid, Ksenia Kutsenko, Cirenia Huerta,, Tinh Nguyen, Xuewen Luo, Bj\"orn Hoffmeister, Jan Trmal, Maurizio Omologo,, Roland Maas

TL;DR
DiPCo is a publicly available speech corpus simulating natural dinner party conversations in home environments, recorded with multiple microphones to aid research in noise-robust and distant speech processing.
Contribution
The paper introduces a new, publicly accessible corpus of dinner party conversations with multi-microphone recordings for advancing noise-robust speech technologies.
Findings
Contains 10 sessions of natural conversations in home settings.
Includes multi-microphone recordings and human transcripts.
Aims to facilitate benchmarking in distant speech processing.
Abstract
We present a speech data corpus that simulates a "dinner party" scenario taking place in an everyday home environment. The corpus was created by recording multiple groups of four Amazon employee volunteers having a natural conversation in English around a dining table. The participants were recorded by a single-channel close-talk microphone and by five far-field 7-microphone array devices positioned at different locations in the recording room. The dataset contains the audio recordings and human labeled transcripts of a total of 10 sessions with a duration between 15 and 45 minutes. The corpus was created to advance in the field of noise robust and distant speech processing and is intended to serve as a public research and benchmarking data set.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
