Spatial Upsampling of Head-Related Transfer Functions Using a Physics-Informed Neural Network
Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Xingyu Chen

TL;DR
This paper introduces a physics-informed neural network (PINN) approach for upsampling sparse head-related transfer functions (HRTFs), leveraging the Helmholtz equation to produce physically valid and generalizable HRTF estimations for personalized virtual acoustics.
Contribution
The novel PINN method integrates the Helmholtz equation into neural network training for HRTF upsampling, improving physical validity and generalization over existing data-driven approaches.
Findings
PINN outperforms SH and HRTF field methods in interpolation.
PINN generalizes well to unseen HRTFs.
Physically regularized upsampling enhances virtual acoustic realism.
Abstract
Head-related transfer function (HRTF) capture the information that a person uses to localize sound sources in space, and thus is crucial for creating personalized virtual acoustic experiences. However, practical HRTF measurement systems may only measure a person's HRTFs sparsely, and this necessitates HRTF upsampling. This paper proposes a physics-informed neural network (PINN) method for HRTF upsampling. The PINN exploits the Helmholtz equation, the governing equation of acoustic wave propagation, for regularizing the upsampling process. This helps the generation of physically valid upsamplings which generalize beyond the measured HRTF. Furthermore, the size (width and depth) of the PINN is set according to the Helmholtz equation and its solutions, the spherical harmonics (SHs). This makes the PINN have an appropriate level of expressive power and thus does not suffer from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Acoustic Wave Phenomena Research
