How Far are Modern Trackers from UAV-Anti-UAV? A Million-Scale Benchmark and New Baseline
Chunhui Zhang, Li Liu, Zhipeng Zhang, Yong Wang, Hao Wen, Xi Zhou, Shiming Ge, Yanfeng Wang

TL;DR
This paper introduces a new challenging UAV-Anti-UAV tracking task, creates a large-scale dataset, and proposes a baseline method that leverages spatial, temporal, and semantic features to improve tracking accuracy.
Contribution
It presents the first large-scale UAV-Anti-UAV dataset and a novel baseline method combining Mamba and Transformer models for integrated spatial-temporal-semantic learning.
Findings
Significant room for improvement in UAV-Anti-UAV tracking algorithms.
The proposed MambaSTS baseline outperforms existing methods on the new dataset.
The dataset and code are publicly available for further research.
Abstract
Unmanned Aerial Vehicles (UAVs) offer wide-ranging applications but also pose significant safety and privacy violation risks in areas like airport and infrastructure inspection, spurring the rapid development of Anti-UAV technologies in recent years. However, current Anti-UAV research primarily focuses on RGB, infrared (IR), or RGB-IR videos captured by fixed ground cameras, with little attention to tracking target UAVs from another moving UAV platform. To fill this gap, we propose a new multi-modal visual tracking task termed UAV-Anti-UAV, which involves a pursuer UAV tracking a target adversarial UAV in the video stream. Compared to existing Anti-UAV tasks, UAV-Anti-UAV is more challenging due to severe dual-dynamic disturbances caused by the rapid motion of both the capturing platform and the target. To advance research in this domain, we construct a million-scale dataset consisting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsUAV Applications and Optimization · Video Surveillance and Tracking Methods · Advanced Neural Network Applications
