# Spoof detection using time-delay shallow neural network and feature   switching

**Authors:** Mari Ganesh Kumar, Suvidha Rupesh Kumar, Saranya M, B. Bharathi, Hema, A. Murthy

arXiv: 1904.07453 · 2020-07-28

## TL;DR

This paper introduces a time-delay shallow neural network (TD-SNN) for detecting spoofed speech in voice biometrics, effectively handling variable-length utterances and improving detection performance over traditional GMM methods.

## Contribution

The paper proposes a novel TD-SNN approach that manages variable-length inputs and combines feature switching to enhance spoof detection accuracy in voice biometrics.

## Key findings

- TD-SNN outperforms GMM in physical access spoof detection.
- Feature switching significantly improves overall detection performance.
- Best system achieves nearly 50% relative improvement over baseline GMMs.

## Abstract

Detecting spoofed utterances is a fundamental problem in voice-based biometrics. Spoofing can be performed either by logical accesses like speech synthesis, voice conversion or by physical accesses such as replaying the pre-recorded utterance. Inspired by the state-of-the-art \emph{x}-vector based speaker verification approach, this paper proposes a time-delay shallow neural network (TD-SNN) for spoof detection for both logical and physical access. The novelty of the proposed TD-SNN system vis-a-vis conventional DNN systems is that it can handle variable length utterances during testing. Performance of the proposed TD-SNN systems and the baseline Gaussian mixture models (GMMs) is analyzed on the ASV-spoof-2019 dataset. The performance of the systems is measured in terms of the minimum normalized tandem detection cost function (min-t-DCF). When studied with individual features, the TD-SNN system consistently outperforms the GMM system for physical access. For logical access, GMM surpasses TD-SNN systems for certain individual features. When combined with the decision-level feature switching (DLFS) paradigm, the best TD-SNN system outperforms the best baseline GMM system on evaluation data with a relative improvement of 48.03\% and 49.47\% for both logical and physical access, respectively.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.07453/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1904.07453/full.md

## References

31 references — full list in the complete paper: https://tomesphere.com/paper/1904.07453/full.md

---
Source: https://tomesphere.com/paper/1904.07453