Exploring Modality-Aware Fusion and Decoupled Temporal Propagation for Multi-Modal Object Tracking
Shilei Wang, Pujian Lai, Dong Gao, Jifeng Ning, Gong Cheng

TL;DR
This paper introduces MDTrack, a multimodal object tracking framework that employs modality-specific fusion and decoupled temporal propagation to improve tracking accuracy across various sensor modalities.
Contribution
The paper proposes a novel modality aware fusion method with expert gating and decoupled temporal propagation using separate state space models for each modality, enhancing temporal feature discrimination.
Findings
Achieves state-of-the-art performance on five multimodal tracking benchmarks.
Effectively captures modality-specific temporal information through decoupled models.
Demonstrates superior adaptive fusion via Mixture of Experts with gating mechanism.
Abstract
Most existing multimodal trackers adopt uniform fusion strategies, overlooking the inherent differences between modalities. Moreover, they propagate temporal information through mixed tokens, leading to entangled and less discriminative temporal representations. To address these limitations, we propose MDTrack, a novel framework for modality aware fusion and decoupled temporal propagation in multimodal object tracking. Specifically, for modality aware fusion, we allocate dedicated experts to each modality, including infrared, event, depth, and RGB, to process their respective representations. The gating mechanism within the Mixture of Experts dynamically selects the optimal experts based on the input features, enabling adaptive and modality specific fusion. For decoupled temporal propagation, we introduce two separate State Space Model structures to independently store and update the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Gaze Tracking and Assistive Technology · Advanced Technologies in Various Fields
