CochlScene: Acquisition of acoustic scene data using crowdsourcing
Il-Young Jeong, Jeongsoo Park

TL;DR
This paper introduces CochlScene, a large-scale acoustic scene dataset collected via crowdsourcing, along with a validation process and baseline system to facilitate future research in acoustic scene classification.
Contribution
It presents a novel crowdsourcing pipeline for collecting acoustic data and introduces the CochlScene dataset with a reliable data split and baseline system.
Findings
76,000 samples collected from 831 participants
13 distinct acoustic scenes included in the dataset
Baseline system provided for future research
Abstract
This paper describes a pipeline for collecting acoustic scene data by using crowdsourcing. The detailed process of crowdsourcing is explained, including planning, validation criteria, and actual user interfaces. As a result of data collection, we present CochlScene, a novel dataset for acoustic scene classification. Our dataset consists of 76k samples collected from 831 participants in 13 acoustic scenes. We also propose a manual data split of training, validation, and test sets to increase the reliability of the evaluation results. Finally, we provide a baseline system for future research.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
MethodsTest
