Visual Tracking via Shallow and Deep Collaborative Model
Bohan Zhuang, Lijun Wang, Huchuan Lu

TL;DR
This paper introduces a robust visual tracking method that combines shallow generative and deep discriminative models, utilizing incremental learning and online fine-tuning to handle occlusion and appearance changes effectively.
Contribution
It presents a novel collaborative model integrating shallow and deep features with incremental learning and online adaptation for improved tracking performance.
Findings
Outperforms state-of-the-art trackers on challenging sequences.
Effectively handles occlusion and appearance variations.
Demonstrates robustness and accuracy through extensive evaluations.
Abstract
In this paper, we propose a robust tracking method based on the collaboration of a generative model and a discriminative classifier, where features are learned by shallow and deep architectures, respectively. For the generative model, we introduce a block-based incremental learning scheme, in which a local binary mask is constructed to deal with occlusion. The similarity degrees between the local patches and their corresponding subspace are integrated to formulate a more accurate global appearance model. In the discriminative model, we exploit the advances of deep learning architectures to learn generic features which are robust to both background clutters and foreground appearance variations. To this end, we first construct a discriminative training set from auxiliary video sequences. A deep classification neural network is then trained offline on this training set. Through online…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Face recognition and analysis · Visual Attention and Saliency Detection
