Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization

Yoshiki Masuyama; Gordon Wichern; Fran\c{c}ois G. Germain; Christopher; Ick; Jonathan Le Roux

arXiv:2501.13017·eess.AS·January 23, 2025

Retrieval-Augmented Neural Field for HRTF Upsampling and Personalization

Yoshiki Masuyama, Gordon Wichern, Fran\c{c}ois G. Germain, Christopher, Ick, Jonathan Le Roux

PDF

Open Access 1 Repo

TL;DR

This paper introduces RANF, a retrieval-augmented neural field method that improves HRTF upsampling and personalization by leveraging similar subjects' data, achieving notable results with limited measurements.

Contribution

The paper presents a novel retrieval-augmented neural field approach that enhances HRTF upsampling and personalization from minimal measurements, outperforming existing methods.

Findings

01

RANF improves HRTF upsampling accuracy.

02

RANF outperforms baseline methods on SONICOM dataset.

03

RANF contributed to winning the listener acoustic personalization challenge 2024.

Abstract

Head-related transfer functions (HRTFs) with dense spatial grids are desired for immersive binaural audio generation, but their recording is time-consuming. Although HRTF spatial upsampling has shown remarkable progress with neural fields, spatial upsampling only from a few measured directions, e.g., 3 or 5 measurements, is still challenging. To tackle this problem, we propose a retrieval-augmented neural field (RANF). RANF retrieves a subject whose HRTFs are close to those of the target subject from a dataset. The HRTF of the retrieved subject at the desired direction is fed into the neural field in addition to the sound source direction itself. Furthermore, we present a neural network that can efficiently handle multiple retrieved subjects, inspired by a multi-channel processing technique called transform-average-concatenate. Our experiments confirm the benefits of RANF on the SONICOM…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

merlresearch/ranf-hrtf
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAI and HR Technologies