Joint Feature Learning and Relation Modeling for Tracking: A One-Stream Framework
Botao Ye, Hong Chang, Bingpeng Ma, Shiguang Shan, and Xilin Chen

TL;DR
This paper introduces OSTrack, a unified one-stream framework for visual tracking that enhances feature discrimination and relation modeling, achieving state-of-the-art results with high efficiency and fast convergence.
Contribution
The paper proposes a novel one-stream tracking framework that unifies feature learning and relation modeling, improving discriminability and efficiency over traditional two-stream methods.
Findings
Achieves 73.7% AO on GOT-10k, surpassing previous best by 4.3%.
Runs at a fast speed due to parallelized implementation.
Maintains a good performance-speed trade-off and faster convergence.
Abstract
The current popular two-stream, two-stage tracking framework extracts the template and the search region features separately and then performs relation modeling, thus the extracted features lack the awareness of the target and have limited target-background discriminability. To tackle the above issue, we propose a novel one-stream tracking (OSTrack) framework that unifies feature learning and relation modeling by bridging the template-search image pairs with bidirectional information flows. In this way, discriminative target-oriented features can be dynamically extracted by mutual guidance. Since no extra heavy relation modeling module is needed and the implementation is highly parallelized, the proposed tracker runs at a fast speed. To further improve the inference efficiency, an in-network candidate early elimination module is proposed based on the strong similarity prior calculated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Chemical Sensor Technologies · Air Quality Monitoring and Forecasting
MethodsArtemisinin Optimization based on Malaria Therapy: Algorithm and Applications to Medical Image Segmentation
