A Novel Performance Evaluation Methodology for Single-Target Trackers
Matej Kristan, Jiri Matas, Ales Leonardis, Tomas Vojir, Roman, Pflugfelder, Gustavo Fernandez, Georg Nebehay, Fatih Porikli, Luka Cehovin

TL;DR
This paper introduces a new, interpretable evaluation methodology for single-target trackers, including a diverse, fully-annotated dataset and a multi-platform system, demonstrated on the large VOT2014 benchmark.
Contribution
It proposes a simple, statistically sound evaluation framework and a highly diverse annotated dataset for improved tracker comparison.
Findings
Most state-of-the-art trackers outperform baselines
The dataset's visual diversity enhances evaluation robustness
The new visualization aids in comparing tracker performance
Abstract
This paper addresses the problem of single-target tracker performance evaluation. We consider the performance measures, the dataset and the evaluation system to be the most important components of tracker evaluation and propose requirements for each of them. The requirements are the basis of a new evaluation methodology that aims at a simple and easily interpretable tracker comparison. The ranking-based methodology addresses tracker equivalence in terms of statistical significance and practical differences. A fully-annotated dataset with per-frame annotations with several visual attributes is introduced. The diversity of its visual properties is maximized in a novel way by clustering a large number of videos according to their visual attributes. This makes it the most sophistically constructed and annotated dataset to date. A multi-platform evaluation system allowing easy integration of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
