Less is More: Token Context-aware Learning for Object Tracking
Chenlong Xu, Bineng Zhong, Qihua Liang, Yaozong Zheng, Guorong Li,, Shuxiang Song

TL;DR
This paper introduces LMTrack, a novel object tracking method that selectively learns high-quality reference tokens using a token memory and attention mechanism, leading to improved accuracy and robustness in visual tracking.
Contribution
The paper proposes a token context-aware tracking pipeline with a new memory module and attention mechanism to automatically select relevant tokens, reducing noise and redundancy.
Findings
Achieves state-of-the-art results on GOT-10K, TrackingNet, and LaSOT.
Effectively filters background noise to improve tracking accuracy.
Demonstrates robustness across various challenging tracking scenarios.
Abstract
Recently, several studies have shown that utilizing contextual information to perceive target states is crucial for object tracking. They typically capture context by incorporating multiple video frames. However, these naive frame-context methods fail to consider the importance of each patch within a reference frame, making them susceptible to noise and redundant tokens, which deteriorates tracking performance. To address this challenge, we propose a new token context-aware tracking pipeline named LMTrack, designed to automatically learn high-quality reference tokens for efficient visual tracking. Embracing the principle of Less is More, the core idea of LMTrack is to analyze the importance distribution of all reference tokens, where important tokens are collected, continually attended to, and updated. Specifically, a novel Token Context Memory module is designed to dynamically collect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Energy Efficient Wireless Sensor Networks · Context-Aware Activity Recognition Systems
MethodsSoftmax · Attention Is All You Need
