Texture-based Presentation Attack Detection for Automatic Speaker   Verification

Lazaro J. Gonzalez-Soler; Jose Patino; Marta Gomez-Barrero and; Massimiliano Todisco; Christoph Busch; Nicholas Evans

arXiv:2010.04038·cs.SD·October 9, 2020

Texture-based Presentation Attack Detection for Automatic Speaker Verification

Lazaro J. Gonzalez-Soler, Jose Patino, Marta Gomez-Barrero and, Massimiliano Todisco, Christoph Busch, Nicholas Evans

PDF

TL;DR

This paper introduces a texture descriptor-based approach using Fisher vector features for presentation attack detection in automatic speaker verification, demonstrating improved generalisability and accuracy in distinguishing genuine from attack speech.

Contribution

It proposes a novel texture analysis method with Fisher vector features for PAD, enhancing detection robustness beyond ensemble classifier approaches.

Findings

01

High detection accuracy with only 16% false rejections of genuine presentations

02

Low false acceptance rate of 1% for attack presentations

03

Effective generalisability across different attack types

Abstract

Biometric systems are nowadays employed across a broad range of applications. They provide high security and efficiency and, in many cases, are user friendly. Despite these and other advantages, biometric systems in general and Automatic speaker verification (ASV) systems in particular can be vulnerable to attack presentations. The most recent ASVSpoof 2019 competition showed that most forms of attacks can be detected reliably with ensemble classifier-based presentation attack detection (PAD) approaches. These, though, depend fundamentally upon the complementarity of systems in the ensemble. With the motivation to increase the generalisability of PAD solutions, this paper reports our exploration of texture descriptors applied to the analysis of speech spectrogram images. In particular, we propose a common fisher vector feature space based on a generative model. Experimental results show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.