Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation

Zhong Meng; Jinyu Li; Yifan Gong; Biing-Hwang (Fred) Juang

arXiv:1804.00644·eess.AS·May 1, 2019

Adversarial Teacher-Student Learning for Unsupervised Domain Adaptation

Zhong Meng, Jinyu Li, Yifan Gong, Biing-Hwang (Fred) Juang

PDF

TL;DR

This paper introduces adversarial teacher-student learning for unsupervised domain adaptation in speech recognition, explicitly learning condition-invariant features to improve robustness against variability and noise.

Contribution

It proposes a novel adversarial T/S learning framework that jointly optimizes for condition invariance and robustness, extending prior T/S methods with adversarial training and multi-factorial adaptation.

Findings

01

Achieved 44.60% relative WER reduction over the source model.

02

Achieved 5.38% relative WER reduction over baseline T/S learning.

03

Demonstrated effectiveness on noisy CHiME-3 dataset.

Abstract

The teacher-student (T/S) learning has been shown effective in unsupervised domain adaptation [1]. It is a form of transfer learning, not in terms of the transfer of recognition decisions, but the knowledge of posteriori probabilities in the source domain as evaluated by the teacher model. It learns to handle the speaker and environment variability inherent in and restricted to the speech signal in the target domain without proactively addressing the robustness to other likely conditions. Performance degradation may thus ensue. In this work, we advance T/S learning by proposing adversarial T/S learning to explicitly achieve condition-robust unsupervised domain adaptation. In this method, a student acoustic model and a condition classifier are jointly optimized to minimize the Kullback-Leibler divergence between the output distributions of the teacher and student models, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.