SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with   Silhouettes

Risako Tanigawa; Kenji Ishikawa; Noboru Harada; Yasuhiro Oikawa

arXiv:2411.07517·eess.SP·November 13, 2024

SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes

Risako Tanigawa, Kenji Ishikawa, Noboru Harada, Yasuhiro Oikawa

PDF

Open Access 1 Repo

TL;DR

This paper introduces SoundSil-DS, a deep learning method for denoising and segmenting sound-field images with object silhouettes, improving visualization and analysis of sound interactions in optical sound field imaging.

Contribution

It proposes a novel joint denoising and segmentation model for sound-field images, trained on a new dataset created via acoustic simulation.

Findings

01

Effective noise removal in simulated and real data

02

Accurate segmentation of sound fields and object silhouettes

03

Potential for enhanced 3D sound field reconstruction

Abstract

Development of optical technology has enabled imaging of two-dimensional (2D) sound fields. This acousto-optic sensing enables understanding of the interaction between sound and objects such as reflection and diffraction. Moreover, it is expected to be used an advanced measurement technology for sonars in self-driving vehicles and assistive robots. However, the low sound-pressure sensitivity of the acousto-optic sensing results in high intensity of noise on images. Therefore, denoising is an essential task to visualize and analyze the sound fields. In addition to denoising, segmentation of sound and object silhouette is also required to analyze interactions between them. In this paper, we propose sound-field-images-with-object-silhouette denoising and segmentation (SoundSil-DS) that jointly perform denoising and segmentation for sound fields and object silhouettes on a visualized image.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nttcslab/soundsil-ds
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies