Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing

Thanapat Trachu; Thanathai Lertpetchpun; Ekapol Chuangsuwanich

arXiv:2506.11542·cs.SD·June 16, 2025

Amplifying Artifacts with Speech Enhancement in Voice Anti-spoofing

Thanapat Trachu, Thanathai Lertpetchpun, Ekapol Chuangsuwanich

PDF

Open Access

TL;DR

This paper presents a novel pipeline that amplifies artifacts in spoofed speech using noise addition and speech enhancement, significantly improving the accuracy of voice anti-spoofing systems.

Contribution

We introduce a model-agnostic artifact amplification pipeline that enhances spoof detection by leveraging speech enhancement and noise manipulation techniques.

Findings

01

Improves spoof detection accuracy by up to 44.44% on ASVspoof2019.

02

Enhances detection performance by 26.34% on ASVspoof2021.

03

Compatible with various speech enhancement models and countermeasure architectures.

Abstract

Spoofed utterances always contain artifacts introduced by generative models. While several countermeasures have been proposed to detect spoofed utterances, most primarily focus on architectural improvements. In this work, we investigate how artifacts remain hidden in spoofed speech and how to enhance their presence. We propose a model-agnostic pipeline that amplifies artifacts using speech enhancement and various types of noise. Our approach consists of three key steps: noise addition, noise extraction, and noise amplification. First, we introduce noise into the raw speech. Then, we apply speech enhancement to extract the entangled noise and artifacts. Finally, we amplify these extracted features. Moreover, our pipeline is compatible with different speech enhancement models and countermeasure architectures. Our method improves spoof detection performance by up to 44.44\% on ASVspoof2019…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and Audio Processing · Speech Recognition and Synthesis