RTrack: Accelerating Convergence for Visual Object Tracking via Pseudo-Boxes Exploration
Guotian Zeng, Bi Zeng, Hong Zhang, Jianqi Liu, Qingmao Wei

TL;DR
RTrack introduces a novel pseudo-box representation for visual object tracking, significantly reducing training time while achieving state-of-the-art performance and faster convergence.
Contribution
The paper presents RTrack, a new tracker using pseudo bounding boxes and a one-to-many assignment strategy, improving training efficiency and tracking accuracy.
Findings
Achieves state-of-the-art results on GOT-10k dataset.
Reduces training time to 10% of previous SOTA trackers.
Demonstrates faster convergence in training.
Abstract
Single object tracking (SOT) heavily relies on the representation of the target object as a bounding box. However, due to the potential deformation and rotation experienced by the tracked targets, the genuine bounding box fails to capture the appearance information explicitly and introduces cluttered background. This paper proposes RTrack, a novel object representation baseline tracker that utilizes a set of sample points to get a pseudo bounding box. RTrack automatically arranges these points to define the spatial extents and highlight local areas. Building upon the baseline, we conducted an in-depth exploration of the training potential and introduced a one-to-many leading assignment strategy. It is worth noting that our approach achieves competitive performance to the state-of-the-art trackers on the GOT-10k dataset while reducing training time to just 10% of the previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Neural Network Applications · Air Quality Monitoring and Forecasting
