CochlScene: Acquisition of acoustic scene data using crowdsourcing

Il-Young Jeong; Jeongsoo Park

arXiv:2211.02289·eess.AS·November 7, 2022

CochlScene: Acquisition of acoustic scene data using crowdsourcing

Il-Young Jeong, Jeongsoo Park

PDF

Open Access 1 Repo

TL;DR

This paper introduces CochlScene, a large-scale acoustic scene dataset collected via crowdsourcing, along with a validation process and baseline system to facilitate future research in acoustic scene classification.

Contribution

It presents a novel crowdsourcing pipeline for collecting acoustic data and introduces the CochlScene dataset with a reliable data split and baseline system.

Findings

01

76,000 samples collected from 831 participants

02

13 distinct acoustic scenes included in the dataset

03

Baseline system provided for future research

Abstract

This paper describes a pipeline for collecting acoustic scene data by using crowdsourcing. The detailed process of crowdsourcing is explained, including planning, validation criteria, and actual user interfaces. As a result of data collection, we present CochlScene, a novel dataset for acoustic scene classification. Our dataset consists of 76k samples collected from 831 participants in 13 acoustic scenes. We also propose a manual data split of training, validation, and test sets to increase the reliability of the evaluation results. Finally, we provide a baseline system for future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cochlearai/cochlscene
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies

MethodsTest