UAV traffic scene understanding: A regulation embedded multi-modal network and a unified benchmark
Yu Zhang, Zhicheng Zhao, Ze Luo, Chenglong Li, Jin Tang

TL;DR
This paper introduces MTCNet, a multi-modal network with regulatory knowledge embedding and spectral compensation, to improve UAV traffic scene understanding under challenging conditions, supported by a large-scale optical-thermal benchmark.
Contribution
We propose a novel multi-modal network with regulatory knowledge embedding and spectral compensation modules, and establish the first large-scale optical-thermal UAV traffic understanding benchmark.
Findings
MTCNet outperforms existing methods in traffic scene cognition.
The PGKE module effectively incorporates domain-specific traffic regulations.
QASC enhances robustness under adverse environmental conditions.
Abstract
Traffic scene understanding from unmanned aerial vehicle (UAV) platforms is crucial for intelligent transportation systems due to its flexible deployment and wide-area monitoring capabilities. However, existing methods face significant challenges in real-world surveillance, as their heavy reliance on optical imagery leads to severe performance degradation under adverse illumination conditions like nighttime and fog. Furthermore, current Visual Question Answering (VQA) models are restricted to elementary perception tasks, lacking the domain-specific regulatory knowledge required to assess complex traffic behaviors. To address these limitations, we propose a novel Multi-modal Traffic Cognition Network (MTCNet) for robust UAV traffic scene understanding. Specifically, we design a Prototype-Guided Knowledge Embedding (PGKE) module that leverages high-level semantic prototypes from an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Autonomous Vehicle Technology and Safety
