Anomaly Detection in Aerial Videos with Transformers
Pu Jin, Lichao Mou, Gui-Song Xia, Xiao Xiang Zhu

TL;DR
This paper introduces DroneAnomaly, a new aerial video dataset for anomaly detection, and proposes a Transformer-based model, ANDT, to identify abnormal events by modeling temporal dynamics in UAV footage.
Contribution
The paper presents a novel aerial video dataset and a Transformer-based baseline model for anomaly detection, advancing the capabilities in UAV video analysis.
Findings
Created DroneAnomaly dataset with 87,488 frames from 7 scenes.
Proposed ANDT model using Transformers for anomaly detection.
Benchmark results on multiple datasets demonstrate effectiveness.
Abstract
Unmanned aerial vehicles (UAVs) are widely applied for purposes of inspection, search, and rescue operations by the virtue of low-cost, large-coverage, real-time, and high-resolution data acquisition capacities. Massive volumes of aerial videos are produced in these processes, in which normal events often account for an overwhelming proportion. It is extremely difficult to localize and extract abnormal events containing potentially valuable information from long video streams manually. Therefore, we are dedicated to developing anomaly detection methods to solve this issue. In this paper, we create a new dataset, named DroneAnomaly, for anomaly detection in aerial videos. This dataset provides 37 training video sequences and 22 testing video sequences from 7 different realistic scenes with various anomalous events. There are 87,488 color video frames (51,635 for training and 35,853 for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnomaly Detection Techniques and Applications · Artificial Immune Systems Applications · Video Surveillance and Tracking Methods
MethodsMulti-Head Attention · Attention Is All You Need · Test · Linear Layer · Byte Pair Encoding · Softmax · Dropout · Dense Connections · Residual Connection · Absolute Position Encodings
