Digital Audio Tampering Detection Based on ENF Spatio-temporal Features   Representation Learning

Chunyan Zeng; Shuai Kong; Zhifeng Wang; Xiangkui Wan; Yunfan Chen

arXiv:2208.11920·cs.SD·August 26, 2022

Digital Audio Tampering Detection Based on ENF Spatio-temporal Features Representation Learning

Chunyan Zeng, Shuai Kong, Zhifeng Wang, Xiangkui Wan, Yunfan Chen

PDF

Open Access

TL;DR

This paper introduces a novel ENF spatio-temporal feature learning method using CNN and BiLSTM for digital audio tampering detection, significantly improving accuracy by capturing dynamic ENF variations.

Contribution

It proposes a parallel spatio-temporal network model that extracts both spatial and temporal ENF features for enhanced tampering detection accuracy.

Findings

01

Improves detection accuracy by 2.12%-7.12% on public datasets.

02

Effectively captures ENF variations in both space and time.

03

Utilizes attention mechanism for better feature weighting.

Abstract

Most digital audio tampering detection methods based on electrical network frequency (ENF) only utilize the static spatial information of ENF, ignoring the variation of ENF in time series, which limit the ability of ENF feature representation and reduce the accuracy of tampering detection. This paper proposes a new method for digital audio tampering detection based on ENF spatio-temporal features representation learning. A parallel spatio-temporal network model is constructed using CNN and BiLSTM, which deeply extracts ENF spatial feature information and ENF temporal feature information to enhance the feature representation capability to improve the tampering detection accuracy. In order to extract the spatial and temporal features of the ENF, this paper firstly uses digital audio high-precision Discrete Fourier Transform analysis to extract the phase sequences of the ENF. The unequal…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Music and Audio Processing