Audio-replay attack detection countermeasures
Galina Lavrentyeva, Sergey Novoselov, Egor Malykh, Alexander Kozlov,, Oleg Kudashev, Vadim Shchemelinin

TL;DR
This paper compares various spoofing detection methods for speaker verification, highlighting the effectiveness of deep learning approaches under changing acoustic conditions and the importance of feature-based classifiers.
Contribution
It introduces and evaluates multiple spoofing detection approaches, emphasizing the superior stability of deep learning methods and the role of high-level features with SVM classifiers.
Findings
Deep learning approaches show stable efficiency across acoustic conditions.
SVM classifiers with high-level features significantly improve detection performance.
Fusion systems benefit from combining different spoofing detection methods.
Abstract
This paper presents the Speech Technology Center (STC) replay attack detection systems proposed for Automatic Speaker Verification Spoofing and Countermeasures Challenge 2017. In this study we focused on comparison of different spoofing detection approaches. These were GMM based methods, high level features extraction with simple classifier and deep learning frameworks. Experiments performed on the development and evaluation parts of the challenge dataset demonstrated stable efficiency of deep learning approaches in case of changing acoustic conditions. At the same time SVM classifier with high level features provided a substantial input in the efficiency of the resulting STC systems according to the fusion systems results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech Recognition and Synthesis · Speech and Audio Processing · Music and Audio Processing
MethodsSupport Vector Machine
