MFAAN: Unveiling Audio Deepfakes with a Multi-Feature Authenticity   Network

Karthik Sivarama Krishnan; Koushik Sivarama Krishnan

arXiv:2311.03509·cs.SD·February 28, 2024·1 cites

MFAAN: Unveiling Audio Deepfakes with a Multi-Feature Authenticity Network

Karthik Sivarama Krishnan, Koushik Sivarama Krishnan

PDF

Open Access

TL;DR

This paper introduces MFAAN, a multi-feature neural network that effectively detects audio deepfakes by combining various audio representations, achieving high accuracy on benchmark datasets and enhancing the fight against manipulated audio content.

Contribution

MFAAN is a novel architecture that fuses multiple audio features for robust deepfake detection, outperforming existing methods on standard datasets.

Findings

01

Achieved 98.93% accuracy on 'In-the-Wild' dataset.

02

Achieved 94.47% accuracy on Fake-or-Real dataset.

03

Demonstrated superior performance over baseline models.

Abstract

In the contemporary digital age, the proliferation of deepfakes presents a formidable challenge to the sanctity of information dissemination. Audio deepfakes, in particular, can be deceptively realistic, posing significant risks in misinformation campaigns. To address this threat, we introduce the Multi-Feature Audio Authenticity Network (MFAAN), an advanced architecture tailored for the detection of fabricated audio content. MFAAN incorporates multiple parallel paths designed to harness the strengths of different audio representations, including Mel-frequency cepstral coefficients (MFCC), linear-frequency cepstral coefficients (LFCC), and Chroma Short Time Fourier Transform (Chroma-STFT). By synergistically fusing these features, MFAAN achieves a nuanced understanding of audio content, facilitating robust differentiation between genuine and manipulated recordings. Preliminary…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Digital Media Forensic Detection · Diverse Musicological Studies