Exploring Frequency-Domain Feature Modeling for HRTF Magnitude Upsampling
Xingyu Chen, Hanwen Bi, Fei Ma, Sipei Zhao, Eva Cheng, and Ian S. Burnett

TL;DR
This paper explores frequency-domain modeling techniques for upsampling HRTFs, demonstrating that explicit spectral modeling with a Conformer architecture improves accuracy, especially under sparse measurement conditions.
Contribution
It introduces a frequency-domain Conformer-based architecture that effectively captures spectral dependencies, advancing HRTF upsampling methods beyond prior spatial-only models.
Findings
Explicit spectral modeling improves reconstruction accuracy.
The proposed method outperforms existing approaches on benchmark datasets.
Joint local and long-range spectral feature capture enhances performance.
Abstract
Accurate upsampling of Head-Related Transfer Functions (HRTFs) from sparse measurements is crucial for personalized spatial audio rendering. Traditional interpolation methods, such as kernel-based weighting or basis function expansions, rely on measurements from a single subject and are limited by the spatial sampling theorem, resulting in significant performance degradation under sparse sampling. Recent learning-based methods alleviate this limitation by leveraging cross-subject information, yet most existing neural architectures primarily focus on modeling spatial relationships across directions, while spectral dependencies along the frequency dimension are often modeled implicitly or treated independently. However, HRTF magnitude responses exhibit strong local continuity and long-range structure in the frequency domain, which are not fully exploited. This work investigates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Music and Audio Processing
