On Unsupervised Uncertainty-Driven Speech Pseudo-Label Filtering and Model Calibration
Nauman Dawalatabad, Sameer Khurana, Antoine Laurent, James Glass

TL;DR
This paper improves unsupervised speech domain adaptation by enhancing pseudo-label filtering through uncertainty estimation and model calibration, leading to more reliable self-training iterations.
Contribution
It introduces a theoretically grounded uncertainty-based pseudo-label filtering method and explores the impact of model calibration on self-training effectiveness.
Findings
Uncertainty-based filtering can fail under severe domain mismatch.
Calibrated models improve pseudo-label filtering reliability.
Proposed methods enhance unsupervised speech adaptation performance.
Abstract
Pseudo-label (PL) filtering forms a crucial part of Self-Training (ST) methods for unsupervised domain adaptation. Dropout-based Uncertainty-driven Self-Training (DUST) proceeds by first training a teacher model on source domain labeled data. Then, the teacher model is used to provide PLs for the unlabeled target domain data. Finally, we train a student on augmented labeled and pseudo-labeled data. The process is iterative, where the student becomes the teacher for the next DUST iteration. A crucial step that precedes the student model training in each DUST iteration is filtering out noisy PLs that could lead the student model astray. In DUST, we proposed a simple, effective, and theoretically sound PL filtering strategy based on the teacher model's uncertainty about its predictions on unlabeled speech utterances. We estimate the model's uncertainty by computing disagreement amongst…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and dialogue systems · Natural Language Processing Techniques
Methodsfail
