Manipulation Detection in Satellite Images Using Vision Transformer

J\'anos Horv\'ath; Sriram Baireddy; Hanxiang Hao; Daniel Mas; Montserrat; Edward J. Delp

arXiv:2105.06373·eess.IV·May 14, 2021·CVPR

Manipulation Detection in Satellite Images Using Vision Transformer

J\'anos Horv\'ath, Sriram Baireddy, Hanxiang Hao, Daniel Mas, Montserrat, Edward J. Delp

PDF

TL;DR

This paper introduces an unsupervised Vision Transformer-based method for detecting manipulated regions in satellite images, addressing the limitations of existing techniques designed for consumer camera images.

Contribution

It presents a novel unsupervised approach using Vision Transformers and a new dataset for improved manipulation detection in satellite imagery.

Findings

01

Outperforms existing unsupervised detection methods

02

Introduces a new dataset of manipulated satellite images

03

Demonstrates effectiveness of Vision Transformer in this domain

Abstract

A growing number of commercial satellite companies provide easily accessible satellite imagery. Overhead imagery is used by numerous industries including agriculture, forestry, natural disaster analysis, and meteorology. Satellite images, just as any other images, can be tampered with image manipulation tools. Manipulation detection methods created for images captured by "consumer cameras" tend to fail when used on satellite images due to the differences in image sensors, image acquisition, and processing. In this paper we propose an unsupervised technique that uses a Vision Transformer to detect spliced areas within satellite images. We introduce a new dataset which includes manipulated satellite images that contain spliced objects. We show that our proposed approach performs better than existing unsupervised splicing detection techniques.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Dropout · Dense Connections · Adam · Vision Transformer · Layer Normalization · Softmax