ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild

Xuechen Liu; Xin Wang; Md Sahidullah; Jose Patino; H\'ector Delgado,; Tomi Kinnunen; Massimiliano Todisco; Junichi Yamagishi; Nicholas Evans,; Andreas Nautsch; Kong Aik Lee

arXiv:2210.02437·cs.SD·June 23, 2023

ASVspoof 2021: Towards Spoofed and Deepfake Speech Detection in the Wild

Xuechen Liu, Xin Wang, Md Sahidullah, Jose Patino, H\'ector Delgado,, Tomi Kinnunen, Massimiliano Todisco, Junichi Yamagishi, Nicholas Evans,, Andreas Nautsch, Kong Aik Lee

PDF

1 Repo

TL;DR

The paper summarizes the ASVspoof 2021 challenge, evaluating speech spoofing and deepfake detection methods across various scenarios, highlighting robustness, limitations, and future directions in real-world conditions.

Contribution

It provides a comprehensive overview of the 2021 challenge results, analyzing system performance, robustness issues, and proposing future research directions in speech spoofing detection.

Findings

01

Countermeasures are robust to encoding and transmission effects.

02

Detection of replay attacks is feasible in real environments.

03

Deepfake detection methods struggle with generalization across datasets.

Abstract

Benchmarking initiatives support the meaningful comparison of competing solutions to prominent problems in speech and language processing. Successive benchmarking evaluations typically reflect a progressive evolution from ideal lab conditions towards to those encountered in the wild. ASVspoof, the spoofing and deepfake detection initiative and challenge series, has followed the same trend. This article provides a summary of the ASVspoof 2021 challenge and the results of 54 participating teams that submitted to the evaluation phase. For the logical access (LA) task, results indicate that countermeasures are robust to newly introduced encoding and transmission effects. Results for the physical access (PA) task indicate the potential to detect replay attacks in real, as opposed to simulated physical spaces, but a lack of robustness to variations between simulated and real acoustic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

asvspoof-challenge/2021
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.