Mitigating Cross-Database Differences for Learning Unified HRTF   Representation

Yutong Wen; You Zhang; Zhiyao Duan

arXiv:2307.14547·eess.AS·July 28, 2023

Mitigating Cross-Database Differences for Learning Unified HRTF Representation

Yutong Wen, You Zhang, Zhiyao Duan

PDF

Open Access 2 Repos

TL;DR

This paper addresses the challenge of cross-database differences in HRTF data by identifying their causes and proposing a normalization method, enabling more unified HRTF representations for machine learning applications.

Contribution

The authors introduce a novel normalization technique to mitigate measurement setup variations, improving the learning of unified HRTF models across multiple databases.

Findings

01

Normalized HRTFs cannot be classified by database origin

02

Normalized HRTFs enable more unified HRTF representations

03

Normalization reduces measurement setup differences

Abstract

Individualized head-related transfer functions (HRTFs) are crucial for accurate sound positioning in virtual auditory displays. As the acoustic measurement of HRTFs is resource-intensive, predicting individualized HRTFs using machine learning models is a promising approach at scale. Training such models require a unified HRTF representation across multiple databases to utilize their respectively limited samples. However, in addition to differences on the spatial sampling locations, recent studies have shown that, even for the common location, HRTFs across databases manifest consistent differences that make it trivial to tell which databases they come from. This poses a significant challenge for learning a unified HRTF representation across databases. In this work, we first identify the possible causes of these cross-database differences, attributing them to variations in the measurement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Underwater Acoustics Research