SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection

Piotr Kawa; Marcin Plata; Piotr Syga

arXiv:2210.06105·cs.SD·October 13, 2022

SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection

Piotr Kawa, Marcin Plata, Piotr Syga

PDF

Open Access 1 Repo

TL;DR

SpecRNet is a fast, low-resource neural network architecture for audio DeepFake detection, achieving comparable accuracy to state-of-the-art models while significantly reducing inference time, making it more accessible for widespread use.

Contribution

The paper introduces SpecRNet, a novel neural network architecture that offers faster inference and lower computational requirements for audio DeepFake detection without sacrificing accuracy.

Findings

01

SpecRNet reduces processing time by up to 40% compared to LCNN.

02

SpecRNet maintains performance comparable to top DeepFake detection models.

03

Benchmarks confirm SpecRNet's effectiveness across various low-resource and attack scenarios.

Abstract

Audio DeepFakes are utterances generated with the use of deep neural networks. They are highly misleading and pose a threat due to use in fake news, impersonation, or extortion. In this work, we focus on increasing accessibility to the audio DeepFake detection methods by providing SpecRNet, a neural network architecture characterized by a quick inference time and low computational requirements. Our benchmark shows that SpecRNet, requiring up to about 40% less time to process an audio sample, provides performance comparable to LCNN architecture - one of the best audio DeepFake detection models. Such a method can not only be used by online multimedia services to verify a large bulk of content uploaded daily but also, thanks to its low requirements, by average citizens to evaluate materials on their devices. In addition, we provide benchmarks in three unique settings that confirm the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

piotrkawa/specrnet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech Recognition and Synthesis · Music and Audio Processing · Speech and Audio Processing