RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through   Embedding Predictive Pre-Training

Raktim Gautam Goswami; Prashanth Krishnamurthy; Yann LeCun; Farshad; Khorrami

arXiv:2411.17662·cs.RO·May 5, 2025

RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-Training

Raktim Gautam Goswami, Prashanth Krishnamurthy, Yann LeCun, Farshad, Khorrami

PDF

Open Access 1 Repo

TL;DR

RoboPEPP introduces a self-supervised, masking-based pre-training approach that leverages the robot's physical model to improve vision-based pose and joint angle estimation, especially under occlusions.

Contribution

The paper presents RoboPEPP, a novel embedding predictive pre-training method that enhances encoder understanding of robot structure for better pose estimation.

Findings

01

Achieves state-of-the-art accuracy in robot pose and joint angle estimation.

02

Demonstrates robustness to occlusions and truncations.

03

Requires less execution time than existing methods.

Abstract

Vision-based pose estimation of articulated robots with unknown joint angles has applications in collaborative robotics and human-robot interaction tasks. Current frameworks use neural network encoders to extract image features and downstream layers to predict joint angles and robot pose. While images of robots inherently contain rich information about the robot's physical structures, existing methods often fail to leverage it fully; therefore, limiting performance under occlusions and truncations. To address this, we introduce RoboPEPP, a method that fuses information about the robot's physical model into the encoder using a masking-based self-supervised embedding-predictive architecture. Specifically, we mask the robot's joints and pre-train an encoder-predictor model to infer the joints' embeddings from surrounding unmasked regions, enhancing the encoder's understanding of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

raktimgg/robopepp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Soft Robotics and Applications