Anti-UAV: A Large Multi-Modal Benchmark for UAV Tracking
Nan Jiang, Kuiran Wang, Xiaoke Peng, Xuehui Yu, Qiang Wang, Junliang, Xing, Guorong Li, Jian Zhao, Guodong Guo, Zhenjun Han

TL;DR
This paper introduces Anti-UAV, a large multi-modal dataset for UAV tracking, along with a novel dual-flow semantic consistency method that enhances tracking robustness and accuracy.
Contribution
It provides the first large-scale UAV tracking dataset and proposes a new tracking approach leveraging semantic flow for improved performance.
Findings
Anti-UAV dataset contains over 580k annotated bounding boxes.
The proposed DFSC method improves tracking accuracy on the Anti-UAV benchmark.
Anti-UAV is a challenging dataset that pushes UAV tracking research forward.
Abstract
Unmanned Aerial Vehicle (UAV) offers lots of applications in both commerce and recreation. With this, monitoring the operation status of UAVs is crucially important. In this work, we consider the task of tracking UAVs, providing rich information such as location and trajectory. To facilitate research on this topic, we propose a dataset, Anti-UAV, with more than 300 video pairs containing over 580k manually annotated bounding boxes. The releasing of such a large-scale dataset could be a useful initial step in research of tracking UAVs. Furthermore, the advancement of addressing research challenges in Anti-UAV can help the design of anti-UAV systems, leading to better surveillance of UAVs. Besides, a novel approach named dual-flow semantic consistency (DFSC) is proposed for UAV tracking. Modulated by the semantic flow across video sequences, the tracker learns more robust class-level…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Fire Detection and Safety Systems · Human Pose and Action Recognition
