Audio Tampering Detection Based on Shallow and Deep Feature   Representation Learning

Zhifeng Wang; Yao Yang; Chunyan Zeng; Shuai Kong; Shixiong Feng; Nan; Zhao

arXiv:2210.10506·cs.SD·October 20, 2022

Audio Tampering Detection Based on Shallow and Deep Feature Representation Learning

Zhifeng Wang, Yao Yang, Chunyan Zeng, Shuai Kong, Shixiong Feng, Nan, Zhao

PDF

Open Access

TL;DR

This paper introduces a fusion approach combining shallow and deep features to improve digital audio tampering detection accuracy, outperforming existing methods across multiple datasets.

Contribution

It proposes a novel fusion method leveraging complementary features at different levels to enhance ENF-based audio tampering detection accuracy.

Findings

01

Achieved 97.03% accuracy on three classic databases

02

Achieved 88.31% accuracy on the GAUDI-DI database

03

Outperforms state-of-the-art methods in detection accuracy

Abstract

Digital audio tampering detection can be used to verify the authenticity of digital audio. However, most current methods use standard electronic network frequency (ENF) databases for visual comparison analysis of ENF continuity of digital audio or perform feature extraction for classification by machine learning methods. ENF databases are usually tricky to obtain, visual methods have weak feature representation, and machine learning methods have more information loss in features, resulting in low detection accuracy. This paper proposes a fusion method of shallow and deep features to fully use ENF information by exploiting the complementary nature of features at different levels to more accurately describe the changes in inconsistency produced by tampering operations to raw digital audio. The method achieves 97.03% accuracy on three classic databases: Carioca 1, Carioca 2, and New…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Music and Audio Processing · Speech and Audio Processing