Online Unsupervised Feature Learning for Visual Tracking
Fayao Liu, Chunhua Shen, Ian Reid, Anton van den Hengel

TL;DR
This paper introduces a simple yet effective online unsupervised feature learning framework for visual tracking, outperforming state-of-the-art methods by capturing appearance changes through dictionary learning and spatial pooling.
Contribution
It presents a novel online feature learning pipeline integrated into a tracking-by-detection framework, demonstrating superior performance and flexibility in visual tracking tasks.
Findings
Outperforms all tested state-of-the-art trackers
Effective in capturing appearance and background changes
Flexible integration with existing tracking methods
Abstract
Feature encoding with respect to an over-complete dictionary learned by unsupervised methods, followed by spatial pyramid pooling, and linear classification, has exhibited powerful strength in various vision applications. Here we propose to use the feature learning pipeline for visual tracking. Tracking is implemented using tracking-by-detection and the resulted framework is very simple yet effective. First, online dictionary learning is used to build a dictionary, which captures the appearance changes of the tracking target as well as the background changes. Given a test image window, we extract local image patches from it and each local patch is encoded with respect to the dictionary. The encoded features are then pooled over a spatial pyramid to form an aggregated feature vector. Finally, a simple linear classifier is trained on these features. Our experiments show that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
