Learning Localization-aware Target Confidence for Siamese Visual Tracking
Jiahao Nie, Han Wu, Zhiwei He, Yuxiang Yang, Mingyu Gao, Zhekang Dong

TL;DR
This paper introduces SiamLA, a novel Siamese tracking paradigm that enhances target confidence scores by integrating localization-awareness, leading to improved accuracy and stability in visual tracking tasks.
Contribution
The paper proposes localization-aware components, including LADL, LALS, and LAFA, to align classification and regression tasks, improving target confidence estimation in Siamese trackers.
Findings
Achieves state-of-the-art results on six benchmarks.
Improves accuracy and efficiency of visual tracking.
Demonstrates stability suitable for real-world applications.
Abstract
Siamese tracking paradigm has achieved great success, providing effective appearance discrimination and size estimation by the classification and regression. While such a paradigm typically optimizes the classification and regression independently, leading to task misalignment (accurate prediction boxes have no high target confidence scores). In this paper, to alleviate this misalignment, we propose a novel tracking paradigm, called SiamLA. Within this paradigm, a series of simple, yet effective localization-aware components are introduced, to generate localization-aware target confidence scores. Specifically, with the proposed localization-aware dynamic label (LADL) loss and localization-aware label smoothing (LALS) strategy, collaborative optimization between the classification and regression is achieved, enabling classification scores to be aware of location state, not just…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods
MethodsLabel Smoothing · Attentive Walk-Aggregating Graph Neural Network
