AFter: Attention-based Fusion Router for RGBT Tracking
Andong Lu, Wanyu Wang, Chenglong Li, Jin Tang, Bin Luo

TL;DR
The paper introduces AFter, an attention-based fusion router that dynamically adapts multi-modal feature fusion structures for robust RGBT tracking, outperforming existing methods across multiple datasets.
Contribution
It proposes a novel attention-based fusion router with a dynamic routing algorithm to optimize fusion structures adaptively in RGBT tracking.
Findings
Outperforms state-of-the-art RGBT trackers on five datasets.
Effectively adapts to various challenging scenarios.
Demonstrates superior robustness and accuracy.
Abstract
Multi-modal feature fusion as a core investigative component of RGBT tracking emerges numerous fusion studies in recent years. However, existing RGBT tracking methods widely adopt fixed fusion structures to integrate multi-modal feature, which are hard to handle various challenges in dynamic scenarios. To address this problem, this work presents a novel \emph{A}ttention-based \emph{F}usion rou\emph{ter} called AFter, which optimizes the fusion structure to adapt to the dynamic challenging scenarios, for robust RGBT tracking. In particular, we design a fusion structure space based on the hierarchical attention network, each attention-based fusion unit corresponding to a fusion operation and a combination of these attention units corresponding to a fusion structure. Through optimizing the combination of attention-based fusion units, we can dynamically select the fusion structure to adapt…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBrain Tumor Detection and Classification · Video Surveillance and Tracking Methods
