Manipulation Detection in Satellite Images Using Vision Transformer
J\'anos Horv\'ath, Sriram Baireddy, Hanxiang Hao, Daniel Mas, Montserrat, Edward J. Delp

TL;DR
This paper introduces an unsupervised Vision Transformer-based method for detecting manipulated regions in satellite images, addressing the limitations of existing techniques designed for consumer camera images.
Contribution
It presents a novel unsupervised approach using Vision Transformers and a new dataset for improved manipulation detection in satellite imagery.
Findings
Outperforms existing unsupervised detection methods
Introduces a new dataset of manipulated satellite images
Demonstrates effectiveness of Vision Transformer in this domain
Abstract
A growing number of commercial satellite companies provide easily accessible satellite imagery. Overhead imagery is used by numerous industries including agriculture, forestry, natural disaster analysis, and meteorology. Satellite images, just as any other images, can be tampered with image manipulation tools. Manipulation detection methods created for images captured by "consumer cameras" tend to fail when used on satellite images due to the differences in image sensors, image acquisition, and processing. In this paper we propose an unsupervised technique that uses a Vision Transformer to detect spliced areas within satellite images. We introduce a new dataset which includes manipulated satellite images that contain spliced objects. We show that our proposed approach performs better than existing unsupervised splicing detection techniques.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Adam · Vision Transformer · Layer Normalization · Softmax
