Exploring Frequency-Domain Feature Modeling for HRTF Magnitude Upsampling

Xingyu Chen; Hanwen Bi; Fei Ma; Sipei Zhao; Eva Cheng; and Ian S. Burnett

arXiv:2602.11670·eess.AS·February 13, 2026

Exploring Frequency-Domain Feature Modeling for HRTF Magnitude Upsampling

Xingyu Chen, Hanwen Bi, Fei Ma, Sipei Zhao, Eva Cheng, and Ian S. Burnett

PDF

Open Access

TL;DR

This paper explores frequency-domain modeling techniques for upsampling HRTFs, demonstrating that explicit spectral modeling with a Conformer architecture improves accuracy, especially under sparse measurement conditions.

Contribution

It introduces a frequency-domain Conformer-based architecture that effectively captures spectral dependencies, advancing HRTF upsampling methods beyond prior spatial-only models.

Findings

01

Explicit spectral modeling improves reconstruction accuracy.

02

The proposed method outperforms existing approaches on benchmark datasets.

03

Joint local and long-range spectral feature capture enhances performance.

Abstract

Accurate upsampling of Head-Related Transfer Functions (HRTFs) from sparse measurements is crucial for personalized spatial audio rendering. Traditional interpolation methods, such as kernel-based weighting or basis function expansions, rely on measurements from a single subject and are limited by the spatial sampling theorem, resulting in significant performance degradation under sparse sampling. Recent learning-based methods alleviate this limitation by leveraging cross-subject information, yet most existing neural architectures primarily focus on modeling spatial relationships across directions, while spectral dependencies along the frequency dimension are often modeled implicitly or treated independently. However, HRTF magnitude responses exhibit strong local continuity and long-range structure in the frequency domain, which are not fully exploited. This work investigates…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Music and Audio Processing