SoundSil-DS: Deep Denoising and Segmentation of Sound-field Images with Silhouettes
Risako Tanigawa, Kenji Ishikawa, Noboru Harada, Yasuhiro Oikawa

TL;DR
This paper introduces SoundSil-DS, a deep learning method for denoising and segmenting sound-field images with object silhouettes, improving visualization and analysis of sound interactions in optical sound field imaging.
Contribution
It proposes a novel joint denoising and segmentation model for sound-field images, trained on a new dataset created via acoustic simulation.
Findings
Effective noise removal in simulated and real data
Accurate segmentation of sound fields and object silhouettes
Potential for enhanced 3D sound field reconstruction
Abstract
Development of optical technology has enabled imaging of two-dimensional (2D) sound fields. This acousto-optic sensing enables understanding of the interaction between sound and objects such as reflection and diffraction. Moreover, it is expected to be used an advanced measurement technology for sonars in self-driving vehicles and assistive robots. However, the low sound-pressure sensitivity of the acousto-optic sensing results in high intensity of noise on images. Therefore, denoising is an essential task to visualize and analyze the sound fields. In addition to denoising, segmentation of sound and object silhouette is also required to analyze interactions between them. In this paper, we propose sound-field-images-with-object-silhouette denoising and segmentation (SoundSil-DS) that jointly perform denoising and segmentation for sound fields and object silhouettes on a visualized image.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Speech and Audio Processing · Music Technology and Sound Studies
