Self-taught learning of a deep invariant representation for visual tracking via temporal slowness principle
Jason Kuen, Kian Ming Lim, Chin Poo Lee

TL;DR
This paper introduces a deep invariant visual representation learned via temporal slowness and autoencoders, improving tracking robustness by capturing motion-invariant features and integrating them into a particle filter framework.
Contribution
It proposes a novel method to learn complex-valued invariant features from unlabeled data using temporal slowness and autoencoders, enhancing visual tracking performance.
Findings
Outperforms several state-of-the-art trackers on benchmark sequences.
Learns robust invariant features that improve tracking under appearance changes.
Integrates deep representations into a particle filter for effective tracking.
Abstract
Visual representation is crucial for a visual tracking method's performances. Conventionally, visual representations adopted in visual tracking rely on hand-crafted computer vision descriptors. These descriptors were developed generically without considering tracking-specific information. In this paper, we propose to learn complex-valued invariant representations from tracked sequential image patches, via strong temporal slowness constraint and stacked convolutional autoencoders. The deep slow local representations are learned offline on unlabeled data and transferred to the observational model of our proposed tracker. The proposed observational model retains old training samples to alleviate drift, and collect negative samples which are coherent with target's motion pattern for better discriminative tracking. With the learned representation and online training samples, a logistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLogistic Regression
