Contrastive Predictive Coding Based Feature for Automatic Speaker   Verification

Cheng-I Lai

arXiv:1904.01575·cs.CL·April 4, 2019·21 cites

Contrastive Predictive Coding Based Feature for Automatic Speaker Verification

Cheng-I Lai

PDF

Open Access 1 Repo

TL;DR

This work explores the use of Contrastive Predictive Coding features to improve automatic speaker verification systems by leveraging predictive coding and noise contrastive estimation techniques.

Contribution

It introduces CPC-based features into speaker verification, detailing methods, experiments, and analysis to enhance system performance.

Findings

01

CPC features improve speaker verification accuracy

02

Enhanced robustness to noise and variability

03

Demonstrated effectiveness over traditional features

Abstract

This thesis describes our ongoing work on Contrastive Predictive Coding (CPC) features for speaker verification. CPC is a recently proposed representation learning framework based on predictive coding and noise contrastive estimation. We focus on incorporating CPC features into the standard automatic speaker verification systems, and we present our methods, experiments, and analysis. This thesis also details necessary background knowledge in past and recent work on automatic speaker verification systems, conventional speech features, and the motivation and techniques behind CPC.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

jefflai108/Contrastive-Predictive-Coding-PyTorch
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing

MethodsInfoNCE · Contrastive Predictive Coding