Dynamic Attention guided Multi-Trajectory Analysis for Single Object Tracking
Xiao Wang, Zhe Chen, Jin Tang, Bin Luo, Yaowei Wang, Yonghong Tian,, Feng Wu

TL;DR
This paper introduces a dynamic multi-trajectory tracking method that uses multiple templates and attention mechanisms to improve single object tracking, especially under occlusions and out-of-view movements.
Contribution
It proposes a novel dynamic attention-guided multi-trajectory strategy with multiple templates and a selection network, enhancing tracking robustness and accuracy.
Findings
Achieves superior performance on large-scale benchmarks.
Effectively handles occlusions and out-of-view movements.
Maintains diversified tracking results for improved accuracy.
Abstract
Most of the existing single object trackers track the target in a unitary local search window, making them particularly vulnerable to challenging factors such as heavy occlusions and out-of-view movements. Despite the attempts to further incorporate global search, prevailing mechanisms that cooperate local and global search are relatively static, thus are still sub-optimal for improving tracking performance. By further studying the local and global search results, we raise a question: can we allow more dynamics for cooperating both results? In this paper, we propose to introduce more dynamics by devising a dynamic attention-guided multi-trajectory tracking strategy. In particular, we construct dynamic appearance model that contains multiple target templates, each of which provides its own attention for locating the target in the new frame. Guided by different attention, we maintain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Visual Attention and Saliency Detection · Human Pose and Action Recognition
