ClearMask: Noise-Free and Naturalness-Preserving Protection Against Voice Deepfake Attacks

Yuanda Wang; Bocheng Chen; Hanqing Guo; Guangjing Wang; Weikang Ding; Qiben Yan

arXiv:2508.17660·cs.SD·August 26, 2025

ClearMask: Noise-Free and Naturalness-Preserving Protection Against Voice Deepfake Attacks

Yuanda Wang, Bocheng Chen, Hanqing Guo, Guangjing Wang, Weikang Ding, Qiben Yan

PDF

TL;DR

ClearMask introduces a noise-free, frequency-based audio modification technique combined with style transfer and reverberation to defend against voice deepfake attacks, maintaining natural sound quality and working in real-time.

Contribution

It proposes a novel noise-free defense mechanism, ClearMask, that selectively filters frequencies and applies style transfer to prevent voice deepfakes without degrading audio quality.

Findings

01

Effective against unseen voice synthesis models

02

Resilient to adaptive attackers

03

Works in real-time for streaming speech

Abstract

Voice deepfake attacks, which artificially impersonate human speech for malicious purposes, have emerged as a severe threat. Existing defenses typically inject noise into human speech to compromise voice encoders in speech synthesis models. However, these methods degrade audio quality and require prior knowledge of the attack approaches, limiting their effectiveness in diverse scenarios. Moreover, real-time audios, such as speech in virtual meetings and voice messages, are still exposed to voice deepfake threats. To overcome these limitations, we propose ClearMask, a noise-free defense mechanism against voice deepfake attacks. Unlike traditional approaches, ClearMask modifies the audio mel-spectrogram by selectively filtering certain frequencies, inducing a transferable voice feature loss without injecting noise. We then apply audio style transfer to further deceive voice decoders while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.