Vector-Quantized Autoregressive Predictive Coding
Yu-An Chung, Hao Tang, James Glass

TL;DR
This paper introduces VQ-APC, a novel self-supervised model that produces quantized representations, enabling explicit control over information encoding and revealing how speech features are preserved or amplified as model capacity varies.
Contribution
The paper proposes VQ-APC, a new model that produces quantized representations, allowing detailed analysis of information encoding in self-supervised speech models.
Findings
Quantized representations control information encoding in speech models.
Speech information is preserved or amplified depending on model capacity.
Codes learned correspond well to English phonemes.
Abstract
Autoregressive Predictive Coding (APC), as a self-supervised objective, has enjoyed success in learning representations from large amounts of unlabeled data, and the learned representations are rich for many downstream tasks. However, the connection between low self-supervised loss and strong performance in downstream tasks remains unclear. In this work, we propose Vector-Quantized Autoregressive Predictive Coding (VQ-APC), a novel model that produces quantized representations, allowing us to explicitly control the amount of information encoded in the representations. By studying a sequence of increasingly limited models, we reveal the constituents of the learned representations. In particular, we confirm the presence of information with probing tasks, while showing the absence of information with mutual information, uncovering the model's preference in preserving speech information as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Natural Language Processing Techniques
