Target Transformed Regression for Accurate Tracking
Yutao Cui, Cheng Jiang, Limin Wang, Gangshan Wu

TL;DR
This paper introduces Target Transformed Regression (TREG), a Transformer-inspired method for anchor-free visual tracking that models pair-wise relations to improve bounding box accuracy amidst appearance and deformation challenges.
Contribution
It proposes a novel TREG framework that leverages pair-wise relation modeling and a template update mechanism for improved accuracy and robustness in tracking.
Findings
Achieves state-of-the-art performance on multiple benchmarks.
Runs at around 30 FPS, suitable for real-time applications.
Demonstrates robustness to appearance variations and deformations.
Abstract
Accurate tracking is still a challenging task due to appearance variations, pose and view changes, and geometric deformations of target in videos. Recent anchor-free trackers provide an efficient regression mechanism but fail to produce precise bounding box estimation. To address these issues, this paper repurposes a Transformer-alike regression branch, termed as Target Transformed Regression (TREG), for accurate anchor-free tracking. The core to our TREG is to model pair-wise relation between elements in target template and search region, and use the resulted target enhanced visual representation for accurate bounding box regression. This target contextualized representation is able to enhance the target relevant information to help precisely locate the box boundaries, and deal with the object deformation to some extent due to its local and dense matching mechanism. In addition, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Human Pose and Action Recognition
