Dataset Augmentation and Dimensionality Reduction of Pinna-Related   Transfer Functions

Corentin Guezenoc (IETR); Renaud Seguier (IETR)

arXiv:2010.04546·cs.SD·October 12, 2020

Dataset Augmentation and Dimensionality Reduction of Pinna-Related Transfer Functions

Corentin Guezenoc (IETR), Renaud Seguier (IETR)

PDF

Open Access

TL;DR

This paper explores how dataset augmentation and principal component analysis can effectively model individual variations in head-related transfer functions for personalized binaural audio synthesis.

Contribution

It demonstrates that PCA models trained on an augmented WiDESPREaD dataset outperform those trained on original data in reducing dimensionality of PRTFs.

Findings

01

WiDESPREaD dataset improves PCA performance

02

Model trained on WiDESPREaD outperforms others

03

Effective dimensionality reduction of PRTFs achieved

Abstract

Efficient modeling of the inter-individual variations of head-related transfer functions (HRTFs) is a key matterto the individualization of binaural synthesis. In previous work, we augmented a dataset of 119 pairs of earshapes and pinna-related transfer functions (PRTFs), thus creating a wide dataset of 1005 ear shapes and PRTFsgenerated by random ear drawings (WiDESPREaD) and acoustical simulations. In this article, we investigate thedimensionality reduction capacity of two principal component analysis (PCA) models of magnitude PRTFs, trainedon WiDESPREaD and on the original dataset, respectively. We find that the model trained on the WiDESPREaDdataset performs best, regardless of the number of retained principal components.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHearing Loss and Rehabilitation · Speech and Audio Processing · Music and Audio Processing