Investigating Active-learning-based Training Data Selection for Speech   Spoofing Countermeasure

Xin Wang; Junich Yamagishi

arXiv:2203.14553·eess.AS·October 10, 2022·SLT

Investigating Active-learning-based Training Data Selection for Speech Spoofing Countermeasure

Xin Wang, Junich Yamagishi

PDF

Open Access 1 Repo

TL;DR

This paper explores active learning strategies to improve the generalization of speech spoofing countermeasures, demonstrating that selective data training enhances performance with less data.

Contribution

It investigates active learning methods for training spoofing countermeasures, showing improved generalization and data efficiency over traditional training approaches.

Findings

01

Active learning improves CM generalization on unseen data.

02

AL-based CMs perform comparably to using full data pools with less data.

03

Diverse data inclusion and cautious selection are key for effective AL.

Abstract

Training a spoofing countermeasure (CM) that generalizes to various unseen data is desired but challenging. While methods such as data augmentation and self-supervised learning are applicable, the imperfect CM performance on diverse test sets still calls for additional strategies. This study took the initiative and investigated CM training using active learning (AL), a framework that iteratively selects useful data from a large pool set and fine-tunes the CM. This study compared a few methods to measure the data usefulness and the impact of using different pool sets collected from various sources. The results showed that the AL-based CMs achieved better generalization than our strong baseline on multiple test tests. Furthermore, compared with a top-line CM that simply used the whole data pool set for training, the AL-based CMs achieved similar performance using less training data.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nii-yamagishilab/project-NN-Pytorch-scripts
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing