Sound field decomposition based on two-stage neural networks

Ryo Matsuda; Makoto Otani

arXiv:2309.06661·eess.AS·September 14, 2023

Sound field decomposition based on two-stage neural networks

Ryo Matsuda, Makoto Otani

PDF

Open Access

TL;DR

This paper introduces a two-stage neural network approach for sound field decomposition and source localization, achieving higher accuracy than traditional methods through simulation-based training and regression-based localization.

Contribution

It presents a novel neural network framework that separates sound sources and localizes them as a regression task, improving accuracy over existing techniques.

Findings

01

Higher source-localization accuracy compared to conventional methods.

02

Enhanced sound-field-reconstruction accuracy.

03

Effective use of simulation-generated datasets for training.

Abstract

A method for sound field decomposition based on neural networks is proposed. The method comprises two stages: a sound field separation stage and a single-source localization stage. In the first stage, the sound pressure at microphones synthesized by multiple sources is separated into one excited by each sound source. In the second stage, the source location is obtained as a regression from the sound pressure at microphones consisting of a single sound source. The estimated location is not affected by discretization because the second stage is designed as a regression rather than a classification. Datasets are generated by simulation using Green's function, and the neural network is trained for each frequency. Numerical experiments reveal that, compared with conventional methods, the proposed method can achieve higher source-localization accuracy and higher sound-field-reconstruction…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Aerodynamics and Acoustics in Jet Flows · Flow Measurement and Analysis