Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking

Aljo\v{s}a O\v{s}ep; Wolfgang Mehner; Paul Voigtlaender; Bastian Leibe

arXiv:1712.07920·cs.CV·December 22, 2017

Track, then Decide: Category-Agnostic Vision-based Multi-Object Tracking

Aljo\v{s}a O\v{s}ep, Wolfgang Mehner, Paul Voigtlaender, Bastian Leibe

PDF

TL;DR

This paper introduces a category-agnostic, model-free multi-object tracking method using segmentation masks, capable of tracking known and unknown objects without relying on specific detectors, suitable for complex environments.

Contribution

It proposes a novel segmentation mask-based tracker that can track both known and unknown objects, eliminating the need for category-specific detectors.

Findings

01

Achieves comparable performance to state-of-the-art methods on common categories

02

Can discover and track a wide variety of unknown objects

03

Maintains robust tracking in diverse environments

Abstract

The most common paradigm for vision-based multi-object tracking is tracking-by-detection, due to the availability of reliable detectors for several important object categories such as cars and pedestrians. However, future mobile systems will need a capability to cope with rich human-made environments, in which obtaining detectors for every possible object category would be infeasible. In this paper, we propose a model-free multi-object tracking approach that uses a category-agnostic image segmentation method to track objects. We present an efficient segmentation mask-based tracker which associates pixel-precise masks reported by the segmentation. Our approach can utilize semantic information whenever it is available for classifying objects at the track level, while retaining the capability to track generic unknown objects in the absence of such information. We demonstrate experimentally…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.