TAME: Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification
Zhenyuan Xiao, Huanran Hu, Guili Xu, Junwei He

TL;DR
TAME is a novel audio-based drone detection system that improves trajectory estimation and classification by capturing temporal and spectral features using advanced neural modules, outperforming existing benchmarks.
Contribution
Introduces TAME, a new model combining a parallel selective state-space approach with a Temporal Feature Enhancement Module for improved drone detection.
Findings
Achieves superior accuracy on MMUAD benchmarks.
Effectively captures both temporal and spectral audio features.
Provides publicly available code and models.
Abstract
The increasing prevalence of compact UAVs has introduced significant risks to public safety, while traditional drone detection systems are often bulky and costly. To address these challenges, we present TAME, the Temporal Audio-based Mamba for Enhanced Drone Trajectory Estimation and Classification. This innovative anti-UAV detection model leverages a parallel selective state-space model to simultaneously capture and learn both the temporal and spectral features of audio, effectively analyzing propagation of sound. To further enhance temporal features, we introduce a Temporal Feature Enhancement Module, which integrates spectral features into temporal data using residual cross-attention. This enhanced temporal information is then employed for precise 3D trajectory estimation and classification. Our model sets a new standard of performance on the MMUAD benchmarks, demonstrating superior…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods
