MSL-RAPTOR: A 6DoF Relative Pose Tracker for Onboard Robotic Perception
Benjamin Ramtoula, Adam Caccavale, Giovanni Beltrame, and Mac Schwager

TL;DR
MSL-RAPTOR is a fast, neural network-based two-stage algorithm for real-time 6DoF relative pose tracking of objects using monocular cameras, suitable for diverse robotics applications.
Contribution
It introduces a novel two-stage approach combining neural detection with UKF-based pose estimation, enabling efficient and generalizable 6DoF tracking without depth sensors.
Findings
Achieves comparable accuracy to RGB-D methods on NOCS-REAL275 dataset.
Outperforms existing methods in speed by a factor of 3 in drone tracking.
Reduces median translation and rotation errors by 66% and 23%, respectively.
Abstract
Determining the relative position and orientation of objects in an environment is a fundamental building block for a wide range of robotics applications. To accomplish this task efficiently in practical settings, a method must be fast, use common sensors, and generalize easily to new objects and environments. We present MSL-RAPTOR, a two-stage algorithm for tracking a rigid body with a monocular camera. The image is first processed by an efficient neural network-based front-end to detect new objects and track 2D bounding boxes between frames. The class label and bounding box is passed to the back-end that updates the object's pose using an unscented Kalman filter (UKF). The measurement posterior is fed back to the 2D tracker to improve robustness. The object's class is identified so a class-specific UKF can be used if custom dynamics and constraints are known. Adapting to track the pose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
