TrackAgent: 6D Object Tracking via Reinforcement Learning

Konstantin R\"ohrl; Dominik Bauer; Timothy Patten; and Markus Vincze

arXiv:2307.15671·cs.CV·July 31, 2023

TrackAgent: 6D Object Tracking via Reinforcement Learning

Konstantin R\"ohrl, Dominik Bauer, Timothy Patten, and Markus Vincze

PDF

Open Access

TL;DR

TrackAgent introduces a reinforcement learning-based method for 6D object pose tracking using point cloud alignment, simplifying the process and reducing data requirements compared to RGB(D)-based approaches.

Contribution

The paper presents a novel RL approach for 6D object tracking that operates on point clouds, enabling effective tracking with limited data and integrated reinitialization strategies.

Findings

01

Effective 6D pose tracking with limited point cloud data

02

Reinforcement learning improves frame-to-model alignment

03

Uncertainty and mask propagation aid reinitialization

Abstract

Tracking an object's 6D pose, while either the object itself or the observing camera is moving, is important for many robotics and augmented reality applications. While exploiting temporal priors eases this problem, object-specific knowledge is required to recover when tracking is lost. Under the tight time constraints of the tracking task, RGB(D)-based methods are often conceptionally complex or rely on heuristic motion models. In comparison, we propose to simplify object tracking to a reinforced point cloud (depth only) alignment task. This allows us to train a streamlined approach from scratch with limited amounts of sparse 3D point clouds, compared to the large datasets of diverse RGBD sequences required in previous works. We incorporate temporal frame-to-frame registration with object-based recovery by frame-to-model refinement using a reinforcement learning (RL) agent that jointly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Vision and Imaging · 3D Shape Modeling and Analysis · Robotics and Sensor-Based Localization