Online Descriptor Enhancement via Self-Labelling Triplets for Visual Data Association
Yorai Shaoul, Katherine Liu, Kyel Ok, Nicholas Roy

TL;DR
This paper introduces a self-supervised online descriptor refinement method for visual data association, enhancing multi-object tracking by adaptively improving descriptors during inference without extensive domain-specific training.
Contribution
It proposes a novel online self-labelling triplet-based training approach that refines deep descriptors in real-time, outperforming existing methods in object tracking tasks.
Findings
Achieves 94% reduction in parameters enabling online optimization.
Improves descriptor quality for multi-object tracking.
Surpasses other data association methods in tracking performance.
Abstract
Object-level data association is central to robotic applications such as tracking-by-detection and object-level simultaneous localization and mapping. While current learned visual data association methods outperform hand-crafted algorithms, many rely on large collections of domain-specific training examples that can be difficult to obtain without prior knowledge. Additionally, such methods often remain fixed during inference-time and do not harness observed information to better their performance. We propose a self-supervised method for incrementally refining visual descriptors to improve performance in the task of object-level visual data association. Our method optimizes deep descriptor generators online, by continuously training a widely available image classification network pre-trained with domain-independent data. We show that earlier layers in the network outperform later-stage…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Advanced Image and Video Retrieval Techniques · Advanced Vision and Imaging
