DeformTrace: A Deformable State Space Model with Relay Tokens for Temporal Forgery Localization
Xiaodong Zhu, Suting Wang, Yuanming Zheng, Junqi Yang, Yangxu Liao, Yuhong Yang, Weiping Tu, Zhongyuan Wang

TL;DR
DeformTrace introduces a deformable state space model with relay tokens that significantly improves the accuracy and efficiency of temporal forgery localization in videos and audio, addressing boundary ambiguity and sparse forgeries.
Contribution
The paper presents DeformTrace, a novel hybrid model combining deformable SSMs and relay mechanisms for enhanced temporal reasoning and forgery detection.
Findings
Achieves state-of-the-art accuracy in TFL tasks.
Fewer parameters and faster inference compared to existing methods.
Demonstrates robustness against sparse and ambiguous forgeries.
Abstract
Temporal Forgery Localization (TFL) aims to precisely identify manipulated segments in video and audio, offering strong interpretability for security and forensics. While recent State Space Models (SSMs) show promise in precise temporal reasoning, their use in TFL is hindered by ambiguous boundaries, sparse forgeries, and limited long-range modeling. We propose DeformTrace, which enhances SSMs with deformable dynamics and relay mechanisms to address these challenges. Specifically, Deformable Self-SSM (DS-SSM) introduces dynamic receptive fields into SSMs for precise temporal localization. To further enhance its capacity for temporal reasoning and mitigate long-range decay, a Relay Token Mechanism is integrated into DS-SSM. Besides, Deformable Cross-SSM (DC-SSM) partitions the global state space into query-specific subspaces, reducing non-forgery information accumulation and boosting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning
