Nonlinear ISA with Auxiliary Variables for Learning Speech   Representations

Amrith Setlur; Barnabas Poczos; Alan W Black

arXiv:2007.12948·eess.AS·July 28, 2020

Nonlinear ISA with Auxiliary Variables for Learning Speech Representations

Amrith Setlur, Barnabas Poczos, Alan W Black

PDF

TL;DR

This paper introduces a theoretical framework for nonlinear ISA with auxiliary variables to learn speech representations, demonstrating improved speaker verification and phoneme recognition performance.

Contribution

It extends nonlinear ICA to a more general nonlinear ISA framework with auxiliary variables, providing conditions for subspace identifiability and a practical algorithm.

Findings

01

Improved speaker verification accuracy.

02

Enhanced phoneme recognition performance.

03

Theoretical guarantees for subspace identifiability.

Abstract

This paper extends recent work on nonlinear Independent Component Analysis (ICA) by introducing a theoretical framework for nonlinear Independent Subspace Analysis (ISA) in the presence of auxiliary variables. Observed high dimensional acoustic features like log Mel spectrograms can be considered as surface level manifestations of nonlinear transformations over individual multivariate sources of information like speaker characteristics, phonological content etc. Under assumptions of energy based models we use the theory of nonlinear ISA to propose an algorithm that learns unsupervised speech representations whose subspaces are independent and potentially highly correlated with the original non-stationary multivariate sources. We show how nonlinear ICA with auxiliary variables can be extended to a generic identifiable model for subspaces as well while also providing sufficient conditions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsIndependent Component Analysis