Denoising of photogrammetric dummy head ear point clouds for individual Head-Related Transfer Functions computation
Fabio Di Giusto, Francesc Llu\'is, Sjoerd van Ophem, Elke Deckers

TL;DR
This paper explores the use of deep neural networks to denoise photogrammetric ear scans, improving the accuracy of head-related transfer functions for realistic virtual audio rendering.
Contribution
It introduces a DNN-based denoising approach tailored for photogrammetric ear scans and benchmarks its performance against classical methods, advancing HRTF computation accuracy.
Findings
DNNs marginally improve HRTF accuracy from noisy scans
Correlation analysis identifies key geometric and HRTF metrics
Enhanced DNN retraining further reduces deviation levels
Abstract
Individual Head-Related Transfer Functions (HRTFs), crucial for realistic virtual audio rendering, can be efficiently numerically computed from precise three-dimensional head and ear scans. While photogrammetry scanning is promising, it generally lacks accuracy, leading to HRTFs showing significant perceptual deviation from reference data, mainly due to scanning errors affecting the most occluded pinna structures. This paper examines the application of Deep Neural Networks (DNNs) for denoising photogrammetric ear scans. Several DNNs, fine-tuned on pinna samples corrupted with synthetic error modelled to mimic that observed in photogrammetric dummy head scans, are tested and benchmarked against a classical denoising method. One DNN is further modified and retrained to enhance its denoising performance. The comparison of HRTFs derived from original and denoised scans against reference…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHearing Loss and Rehabilitation
