Physics-Informed Direction-Aware Neural Acoustic Fields
Yoshiki Masuyama, Fran\c{c}ois G. Germain, Gordon Wichern, Christopher Ick, Jonathan Le Roux

TL;DR
This paper introduces a physics-informed neural network model specifically designed for first-order Ambisonic room impulse responses, enhancing sound field interpolation by incorporating physical sound propagation principles.
Contribution
The paper extends PINNs to model FOA RIRs by deriving physics-informed priors based on particle velocity and FOA channels, improving modeling accuracy.
Findings
The physics-informed model outperforms non-physics-based neural networks.
Derived priors effectively encode physical relationships among FOA channels.
Experimental results validate the proposed method's effectiveness.
Abstract
This paper presents a physics-informed neural network (PINN) for modeling first-order Ambisonic (FOA) room impulse responses (RIRs). PINNs have demonstrated promising performance in sound field interpolation by combining the powerful modeling capability of neural networks and the physical principles of sound propagation. In room acoustics, PINNs have typically been trained to represent the sound pressure measured by omnidirectional microphones where the wave equation or its frequency-domain counterpart, i.e., the Helmholtz equation, is leveraged. Meanwhile, FOA RIRs additionally provide spatial characteristics and are useful for immersive audio generation with a wide range of applications. In this paper, we extend the PINN framework to model FOA RIRs. We derive two physics-informed priors for FOA RIRs based on the correspondence between the particle velocity and the (X, Y, Z)-channels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · Music Technology and Sound Studies · Speech and Audio Processing
