Unsupervised Acoustic Scene Mapping Based on Acoustic Features and   Dimensionality Reduction

Idan Cohen; Ofir Lindenbaum; Sharon Gannot

arXiv:2301.00448·eess.AS·March 14, 2024

Unsupervised Acoustic Scene Mapping Based on Acoustic Features and Dimensionality Reduction

Idan Cohen, Ofir Lindenbaum, Sharon Gannot

PDF

Open Access

TL;DR

This paper presents an unsupervised deep learning approach using local conformal autoencoders to map acoustic scenes based on features, demonstrating robustness to reverberation and noise without relying on TDOA estimation.

Contribution

The paper introduces a novel unsupervised method for acoustic scene mapping that leverages local conformal autoencoders, avoiding traditional TDOA estimation and improving robustness to reverberation.

Findings

01

LOCA learns a spatially isometric representation of microphone locations.

02

The method outperforms other dimensionality reduction techniques in simulations.

03

LOCA shows significant robustness to reverberation effects.

Abstract

Classical methods for acoustic scene mapping require the estimation of time difference of arrival (TDOA) between microphones. Unfortunately, TDOA estimation is very sensitive to reverberation and additive noise. We introduce an unsupervised data-driven approach that exploits the natural structure of the data. Our method builds upon local conformal autoencoders (LOCA) - an offline deep learning scheme for learning standardized data coordinates from measurements. Our experimental setup includes a microphone array that measures the transmitted sound source at multiple locations across the acoustic enclosure. We demonstrate that LOCA learns a representation that is isometric to the spatial locations of the microphones. The performance of our method is evaluated using a series of realistic simulations and compared with other dimensionality-reduction schemes. We further assess the influence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Music and Audio Processing · Underwater Acoustics Research