Spatial Upsampling of Head-Related Transfer Functions Using a   Physics-Informed Neural Network

Fei Ma; Thushara D. Abhayapala; Prasanga N. Samarasinghe; Xingyu Chen

arXiv:2307.14650·eess.AS·December 12, 2023·1 cites

Spatial Upsampling of Head-Related Transfer Functions Using a Physics-Informed Neural Network

Fei Ma, Thushara D. Abhayapala, Prasanga N. Samarasinghe, Xingyu Chen

PDF

Open Access 1 Repo

TL;DR

This paper introduces a physics-informed neural network (PINN) approach for upsampling sparse head-related transfer functions (HRTFs), leveraging the Helmholtz equation to produce physically valid and generalizable HRTF estimations for personalized virtual acoustics.

Contribution

The novel PINN method integrates the Helmholtz equation into neural network training for HRTF upsampling, improving physical validity and generalization over existing data-driven approaches.

Findings

01

PINN outperforms SH and HRTF field methods in interpolation.

02

PINN generalizes well to unseen HRTFs.

03

Physically regularized upsampling enhances virtual acoustic realism.

Abstract

Head-related transfer function (HRTF) capture the information that a person uses to localize sound sources in space, and thus is crucial for creating personalized virtual acoustic experiences. However, practical HRTF measurement systems may only measure a person's HRTFs sparsely, and this necessitates HRTF upsampling. This paper proposes a physics-informed neural network (PINN) method for HRTF upsampling. The PINN exploits the Helmholtz equation, the governing equation of acoustic wave propagation, for regularizing the upsampling process. This helps the generation of physically valid upsamplings which generalize beyond the measured HRTF. Furthermore, the size (width and depth) of the PINN is set according to the Helmholtz equation and its solutions, the spherical harmonics (SHs). This makes the PINN have an appropriate level of expressive power and thus does not suffer from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

feima1024/PINN-for-HRTF-upsampling
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Hearing Loss and Rehabilitation · Acoustic Wave Phenomena Research