WeDefense: A Toolkit to Defend Against Fake Audio

Lin Zhang; Johan Rohdin; Xin Wang; Junyi Peng; Tianchi Liu; You Zhang; Hieu-Thi Luong; Shuai Wang; Chengdong Liang; Anna Silnova; Nicholas Evans

arXiv:2601.15240·cs.SD·January 22, 2026

WeDefense: A Toolkit to Defend Against Fake Audio

Lin Zhang, Johan Rohdin, Xin Wang, Junyi Peng, Tianchi Liu, You Zhang, Hieu-Thi Luong, Shuai Wang, Chengdong Liang, Anna Silnova, Nicholas Evans

PDF

Open Access

TL;DR

WeDefense is an open-source toolkit designed to standardize and facilitate the benchmarking, detection, and localization of fake audio generated by AI, supporting fair comparison of different solutions.

Contribution

It introduces the first unified toolkit for fake audio detection and localization, including comprehensive features like augmentation, calibration, and analysis tools.

Findings

01

Supports both detection and localization of fake audio

02

Provides standardized evaluation metrics and protocols

03

Includes interactive demos for practical use

Abstract

The advances in generative AI have enabled the creation of synthetic audio which is perceptually indistinguishable from real, genuine audio. Although this stellar progress enables many positive applications, it also raises risks of misuse, such as for impersonation, disinformation and fraud. Despite a growing number of open-source fake audio detection codes released through numerous challenges and initiatives, most are tailored to specific competitions, datasets or models. A standardized and unified toolkit that supports the fair benchmarking and comparison of competing solutions with not just common databases, protocols, metrics, but also a shared codebase, is missing. To address this, we propose WeDefense, the first open-source toolkit to support both fake audio detection and localization. Beyond model training, WeDefense emphasizes critical yet often overlooked components: flexible…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis