Mitigating Cross-Database Differences for Learning Unified HRTF Representation
Yutong Wen, You Zhang, Zhiyao Duan

TL;DR
This paper addresses the challenge of cross-database differences in HRTF data by identifying their causes and proposing a normalization method, enabling more unified HRTF representations for machine learning applications.
Contribution
The authors introduce a novel normalization technique to mitigate measurement setup variations, improving the learning of unified HRTF models across multiple databases.
Findings
Normalized HRTFs cannot be classified by database origin
Normalized HRTFs enable more unified HRTF representations
Normalization reduces measurement setup differences
Abstract
Individualized head-related transfer functions (HRTFs) are crucial for accurate sound positioning in virtual auditory displays. As the acoustic measurement of HRTFs is resource-intensive, predicting individualized HRTFs using machine learning models is a promising approach at scale. Training such models require a unified HRTF representation across multiple databases to utilize their respectively limited samples. However, in addition to differences on the spatial sampling locations, recent studies have shown that, even for the common location, HRTFs across databases manifest consistent differences that make it trivial to tell which databases they come from. This poses a significant challenge for learning a unified HRTF representation across databases. In this work, we first identify the possible causes of these cross-database differences, attributing them to variations in the measurement…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Underwater Acoustics Research
