TL;DR
MVSS-Net introduces a multi-view, multi-scale deep learning approach for robust, generalizable image manipulation detection, effectively identifying tampering artifacts across diverse datasets and common image distortions.
Contribution
The paper presents MVSS-Net, a novel multi-view, multi-scale neural network that improves generalization and robustness in image manipulation detection tasks.
Findings
MVSS-Net++ outperforms existing methods in cross-dataset tests.
The approach is robust against JPEG compression, Gaussian blur, and re-capturing.
Multi-view and multi-scale supervision enhance detection accuracy.
Abstract
As manipulating images by copy-move, splicing and/or inpainting may lead to misinterpretation of the visual content, detecting these sorts of manipulations is crucial for media forensics. Given the variety of possible attacks on the content, devising a generic method is nontrivial. Current deep learning based methods are promising when training and test data are well aligned, but perform poorly on independent tests. Moreover, due to the absence of authentic test images, their image-level detection specificity is in doubt. The key question is how to design and train a deep neural network capable of learning generalizable features sensitive to manipulations in novel data, whilst specific to prevent false alarms on the authentic. We propose multi-view feature learning to jointly exploit tampering boundary artifacts and the noise view of the input image. As both clues are meant to be…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInpainting · Dense Connections
