A Lightweight and Effective Image Tampering Localization Network with Vision Mamba
Kun Guo, Gang Cao, Zijie Lou, Xianglin Huang, Jiaoyun Liu

TL;DR
This paper introduces ForMa, a lightweight image tampering localization network leveraging vision Mamba for efficient global feature extraction, achieving state-of-the-art results with low computational cost.
Contribution
The paper proposes a novel ForMa network based on vision Mamba, combining multi-scale global features and a noise-assisted decoder for improved tampering localization.
Findings
Achieves state-of-the-art generalization on 10 datasets.
Maintains lowest computational complexity among peers.
Demonstrates robustness and efficiency in tampering detection.
Abstract
Current image tampering localization methods primarily rely on Convolutional Neural Networks (CNNs) and Transformers. While CNNs suffer from limited local receptive fields, Transformers offer global context modeling at the expense of quadratic computational complexity. Recently, the state space model Mamba has emerged as a competitive alternative, enabling linear-complexity global dependency modeling. Inspired by it, we propose a lightweight and effective FORensic network based on vision MAmba (ForMa) for blind image tampering localization. Firstly, ForMa captures multi-scale global features that achieves efficient global dependency modeling through linear complexity. Then the pixel-wise localization map is generated by a lightweight decoder, which employs a parameter-free pixel shuffle layer for upsampling. Additionally, a noise-assisted decoding strategy is proposed to integrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Advanced Steganography and Watermarking Techniques · Image Processing Techniques and Applications
MethodsMamba: Linear-Time Sequence Modeling with Selective State Spaces
