TL;DR
This paper introduces Depth-Track, a large RGBD tracking dataset, and demonstrates that deep RGBD tracking models benefit significantly from training on genuine RGBD data, advancing the field.
Contribution
The work provides a new, extensive RGBD tracking dataset and establishes a baseline showing the importance of dedicated RGBD training data for improved tracking performance.
Findings
Deep RGBD trackers outperform RGB-only trackers when trained on Depth-Track.
The new dataset contains 200 sequences, 40 scene types, and 90 objects, doubling previous datasets.
Training on genuine RGBD data significantly improves tracking accuracy.
Abstract
RGBD (RGB plus depth) object tracking is gaining momentum as RGBD sensors have become popular in many application fields such as robotics.However, the best RGBD trackers are extensions of the state-of-the-art deep RGB trackers. They are trained with RGB data and the depth channel is used as a sidekick for subtleties such as occlusion detection. This can be explained by the fact that there are no sufficiently large RGBD datasets to 1) train deep depth trackers and to 2) challenge RGB trackers with sequences for which the depth cue is essential. This work introduces a new RGBD tracking dataset - Depth-Track - that has twice as many sequences (200) and scene types (40) than in the largest existing dataset, and three times more objects (90). In addition, the average length of the sequences (1473), the number of deformable objects (16) and the number of annotated tracking attributes (15)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
