Graph Neural Field with Spatial-Correlation Augmentation for HRTF Personalization
De Hu, Junsheng Hu, Cuicui Jiang

TL;DR
This paper introduces GraphNF-SCA, a novel graph neural network-based method for personalized HRTF generation that leverages spatial correlations to improve accuracy and efficiency in immersive spatial audio rendering.
Contribution
The paper presents a new GNN-based framework with spatial-correlation augmentation for HRTF personalization, outperforming existing position-by-position estimation methods.
Findings
Achieves state-of-the-art HRTF personalization accuracy.
Effectively models spatial correlations across HRTFs.
Enhances spatial consistency in personalized HRTFs.
Abstract
To achieve immersive spatial audio rendering on VR/AR devices, high-quality Head-Related Transfer Functions (HRTFs) are essential. In general, HRTFs are subject-dependent and position-dependent, and their measurement is time-consuming and tedious. To address this challenge, we propose the Graph Neural Field with Spatial-Correlation Augmentation (GraphNF-SCA) for HRTF personalization, which can be used to generate individual HRTFs for unseen subjects. The GraphNF-SCA consists of three key components: an HRTF personalization (HRTF-P) module, an HRTF upsampling (HRTF-U) module, and a fine-tuning stage. In the HRTF-P module, we predict HRTFs of the target subject via the Graph Neural Network (GNN) with an encoder-decoder architecture, where the encoder extracts universal features and the decoder incorporates the target-relevant features and produces individualized HRTFs. The HRTF-U module…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · EEG and Brain-Computer Interfaces · Speech and Audio Processing
