TL;DR
Urban Rhapsody is a comprehensive framework that integrates advanced audio analysis, machine learning, and visual tools to explore, classify, and understand urban soundscapes, aiding noise management and public health efforts.
Contribution
It introduces a novel interactive system combining audio representation, machine learning, and visualization to analyze urban noise, addressing data complexity and lack of labeled datasets.
Findings
Effective creation of large annotated urban sound databases
Case studies demonstrating noise pattern insights in NYC
Enhanced classification of urban sound sources
Abstract
Noise is one of the primary quality-of-life issues in urban environments. In addition to annoyance, noise negatively impacts public health and educational performance. While low-cost sensors can be deployed to monitor ambient noise levels at high temporal resolutions, the amount of data they produce and the complexity of these data pose significant analytical challenges. One way to address these challenges is through machine listening techniques, which are used to extract features in attempts to classify the source of noise and understand temporal patterns of a city's noise situation. However, the overwhelming number of noise sources in the urban environment and the scarcity of labeled data makes it nearly impossible to create classification models with large enough vocabularies that capture the true dynamism of urban soundscapes In this paper, we first identify a set of requirements in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
