Siam R-CNN: Visual Tracking by Re-Detection

Paul Voigtlaender; Jonathon Luiten; Philip H.S. Torr; and Bastian; Leibe

arXiv:1911.12836·cs.CV·April 3, 2020

Siam R-CNN: Visual Tracking by Re-Detection

Paul Voigtlaender, Jonathon Luiten, Philip H.S. Torr, and Bastian, Leibe

PDF

1 Repo 1 Video

TL;DR

Siam R-CNN introduces a Siamese re-detection architecture combined with dynamic programming and hard example mining, significantly improving long-term visual object tracking performance across multiple benchmarks.

Contribution

The paper presents Siam R-CNN, a novel architecture that leverages re-detection, dynamic programming, and hard example mining for enhanced long-term object tracking.

Findings

01

Achieves state-of-the-art results on ten tracking benchmarks.

02

Excels particularly in long-term tracking scenarios.

03

Demonstrates robustness to distractors and occlusions.

Abstract

We present Siam R-CNN, a Siamese re-detection architecture which unleashes the full power of two-stage object detection approaches for visual object tracking. We combine this with a novel tracklet-based dynamic programming algorithm, which takes advantage of re-detections of both the first-frame template and previous-frame predictions, to model the full history of both the object to be tracked and potential distractor objects. This enables our approach to make better tracking decisions, as well as to re-detect tracked objects after long occlusion. Finally, we propose a novel hard example mining strategy to improve Siam R-CNN's robustness to similar looking objects. Siam R-CNN achieves the current best performance on ten tracking benchmarks, with especially strong results for long-term tracking. We make our code and models available at www.vision.rwth-aachen.de/page/siamrcnn.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

VisualComputingInstitute/SiamR-CNN
tf

Videos

Siam R-CNN: Visual Tracking by Re-Detection· youtube