Audio Splicing Detection and Localization Using Environmental Signature
Hong Zhao, Yifan Chen, Rui Wang, Hafiz Malik

TL;DR
This paper introduces a novel environmental signature-based method for detecting and localizing audio splicing, demonstrating high accuracy and robustness against compression attacks in various acoustic environments.
Contribution
The paper proposes using acoustic channel impulse response and ambient noise as environmental signatures for audio splicing detection and localization, outperforming previous methods.
Findings
High detection and localization accuracy near perfect
Robustness to MP3 compression attacks
Superior performance compared to previous schemes
Abstract
Audio splicing is one of the most common manipulation techniques in the area of audio forensics. In this paper, the magnitudes of acoustic channel impulse response and ambient noise are proposed as the environmental signature. Specifically, the spliced audio segments are detected according to the magnitude correlation between the query frames and reference frames via a statically optimal threshold. The detection accuracy is further refined by comparing the adjacent frames. The effectiveness of the proposed method is tested on two data sets. One is generated from TIMIT database, and the other one is made in four acoustic environments using a commercial grade microphones. Experimental results show that the proposed method not only detects the presence of spliced frames, but also localizes the forgery segments with near perfect accuracy. Comparison results illustrate that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Music and Audio Processing · Speech and Audio Processing
