A Strongly-Labelled Polyphonic Dataset of Urban Sounds with   Spatiotemporal Context

Kenneth Ooi; Karn N. Watcharasupat; Santi Peksi; Furi Andi Karnapi,; Zhen-Ting Ong; Danny Chua; Hui-Wen Leow; Li-Long Kwok; Xin-Lei Ng; Zhen-Ann; Loh; and Woon-Seng Gan

arXiv:2111.02006·cs.SD·June 7, 2022·6 cites

A Strongly-Labelled Polyphonic Dataset of Urban Sounds with Spatiotemporal Context

Kenneth Ooi, Karn N. Watcharasupat, Santi Peksi, Furi Andi Karnapi,, Zhen-Ting Ong, Danny Chua, Hui-Wen Leow, Li-Long Kwok, Xin-Lei Ng, Zhen-Ann, Loh, and Woon-Seng Gan

PDF

Open Access 1 Repo

TL;DR

This paper presents SINGA:PURA, a comprehensive, strongly labelled urban sound dataset with spatiotemporal data from Singapore, designed for sound event detection, classification, and localization, including a hierarchical label taxonomy.

Contribution

Introduction of SINGA:PURA, a new urban sound dataset with detailed annotations and a hierarchical taxonomy, tailored for diverse sound analysis tasks in an Asian urban context.

Findings

01

Dataset includes diverse urban sound events with spatiotemporal annotations

02

Baseline model performance provided as a benchmark for future research

03

Hierarchical label taxonomy enhances compatibility with existing datasets

Abstract

This paper introduces SINGA:PURA, a strongly labelled polyphonic urban sound dataset with spatiotemporal context. The data were collected via several recording units deployed across Singapore as a part of a wireless acoustic sensor network. These recordings were made as part of a project to identify and mitigate noise sources in Singapore, but also possess a wider applicability to sound event detection, classification, and localization. This paper introduces an accompanying hierarchical label taxonomy, which has been designed to be compatible with other existing datasets for urban sound tagging while also able to capture sound events unique to the Singaporean context. This paper details the data collection, annotation, and processing methodologies for the creation of the dataset. We further perform exploratory data analysis and include the performance of a baseline model on the dataset…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ntudsp/singapura
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Animal Vocal Communication and Behavior