A Tri-Modal Dataset and a Baseline System for Tracking Unmanned Aerial Vehicles
Tianyang Xu, Jinjie Gu, Xuefeng Zhu, XiaoJun Wu, Josef Kittler

TL;DR
This paper introduces MM-UAV, a large-scale multi-modal dataset for UAV tracking, along with a baseline tracking framework that leverages RGB, IR, and event data to improve robustness in challenging conditions.
Contribution
The paper provides the first comprehensive multi-modal UAV tracking dataset and a novel baseline framework with innovative modules for sensor alignment, fusion, and event-based identity tracking.
Findings
The proposed framework outperforms existing methods in challenging scenarios.
Multi-modal data significantly improves UAV tracking robustness.
The dataset enables future research in multi-modal UAV tracking.
Abstract
With the proliferation of low altitude unmanned aerial vehicles (UAVs), visual multi-object tracking is becoming a critical security technology, demanding significant robustness even in complex environmental conditions. However, tracking UAVs using a single visual modality often fails in challenging scenarios, such as low illumination, cluttered backgrounds, and rapid motion. Although multi-modal multi-object UAV tracking is more resilient, the development of effective solutions has been hindered by the absence of dedicated public datasets. To bridge this gap, we release MM-UAV, the first large-scale benchmark for Multi-Modal UAV Tracking, integrating three key sensing modalities, e.g. RGB, infrared (IR), and event signals. The dataset spans over 30 challenging scenarios, with 1,321 synchronised multi-modal sequences, and more than 2.8 million annotated frames. Accompanying the dataset,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · UAV Applications and Optimization · Advanced Neural Network Applications
