Visual Object Tracking in First Person Vision
Matteo Dunnhofer, Antonino Furnari, Giovanni Maria Farinella,, Christian Micheloni

TL;DR
This paper systematically evaluates 42 visual object trackers in First Person Vision, revealing new challenges and opportunities for improving tracking algorithms tailored to FPV scenarios.
Contribution
It provides the first comprehensive analysis of tracking algorithms in FPV, introduces a new dataset, and highlights key challenges and future research directions.
Findings
Current trackers face new challenges in FPV environments.
FPV-specific factors significantly affect tracking performance.
Trackers improve performance in downstream FPV tasks despite difficulties.
Abstract
The understanding of human-object interactions is fundamental in First Person Vision (FPV). Visual tracking algorithms which follow the objects manipulated by the camera wearer can provide useful information to effectively model such interactions. In the last years, the computer vision community has significantly improved the performance of tracking algorithms for a large variety of target objects and scenarios. Despite a few previous attempts to exploit trackers in the FPV domain, a methodical analysis of the performance of state-of-the-art trackers is still missing. This research gap raises the question of whether current solutions can be used ``off-the-shelf'' or more domain-specific investigations should be carried out. This paper aims to provide answers to such questions. We present the first systematic investigation of single object tracking in FPV. Our study extensively analyses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsVideo Surveillance and Tracking Methods · Visual Attention and Saliency Detection · Human Pose and Action Recognition
