Replay spoofing detection system for automatic speaker verification   using multi-task learning of noise classes

Hye-Jin Shim; Jee-weon Jung; Hee-Soo Heo; Sunghyun Yoon; Ha-Jin Yu

arXiv:1808.09638·eess.AS·October 26, 2018·1 cites

Replay spoofing detection system for automatic speaker verification using multi-task learning of noise classes

Hye-Jin Shim, Jee-weon Jung, Hee-Soo Heo, Sunghyun Yoon, Ha-Jin Yu

PDF

Open Access

TL;DR

This paper introduces a replay spoofing detection system for automatic speaker verification that leverages multi-task learning to classify noise types and improve detection accuracy, showing a 30% relative performance boost.

Contribution

The novel approach combines replay attack detection with noise classification in a multi-task learning framework, enhancing detection performance.

Findings

01

30% relative improvement on evaluation set

02

Effective multi-task learning of noise and spoofing detection

03

Improved robustness against replay attacks

Abstract

In this paper, we propose a replay attack spoofing detection system for automatic speaker verification using multitask learning of noise classes. We define the noise that is caused by the replay attack as replay noise. We explore the effectiveness of training a deep neural network simultaneously for replay attack spoofing detection and replay noise classification. The multi-task learning includes classifying the noise of playback devices, recording environments, and recording devices as well as the spoofing detection. Each of the three types of the noise classes also includes a genuine class. The experiment results on the ASVspoof2017 datasets demonstrate that the performance of our proposed system is improved by 30% relatively on the evaluation set.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing