Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score   Fusion

Yuxiang Zhang; Jingze Lu; Xingming Wang; Zhuo Li; Runqiu Xiao; Wenchao; Wang; Ming Li; Pengyuan Zhang

arXiv:2210.06818·eess.AS·October 14, 2022

Deepfake Detection System for the ADD Challenge Track 3.2 Based on Score Fusion

Yuxiang Zhang, Jingze Lu, Xingming Wang, Zhuo Li, Runqiu Xiao, Wenchao, Wang, Ming Li, Pengyuan Zhang

PDF

TL;DR

This paper presents a deepfake audio detection system using score fusion of multiple LCNN models with various features, achieving top performance in the ADD Challenge Track 3.2.

Contribution

It introduces a score-level fusion approach with diverse features and analyzes the impact of overfitting and data augmentation on detection robustness.

Findings

01

Score fusion improves detection accuracy.

02

Overfitting causes score distribution issues.

03

Data augmentation enhances robustness.

Abstract

This paper describes the deepfake audio detection system submitted to the Audio Deep Synthesis Detection (ADD) Challenge Track 3.2 and gives an analysis of score fusion. The proposed system is a score-level fusion of several light convolutional neural network (LCNN) based models. Various front-ends are used as input features, including low-frequency short-time Fourier transform and Constant Q transform. Due to the complex noise and rich synthesis algorithms, it is difficult to obtain the desired performance using the training set directly. Online data augmentation methods effectively improve the robustness of fake audio detection systems. In particular, the reasons for the poor improvement of score fusion are explored through visualization of the score distributions and comparison with score distribution on another dataset. The overfitting of the model to the training set leads to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.