P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of   Robot Policies

Mara Levy; Siddhant Haldar; Lerrel Pinto; Abhinav Shirivastava

arXiv:2412.06784·cs.RO·December 10, 2024

P3-PO: Prescriptive Point Priors for Visuo-Spatial Generalization of Robot Policies

Mara Levy, Siddhant Haldar, Lerrel Pinto, Abhinav Shirivastava

PDF

Open Access

TL;DR

P3-PO introduces a novel environment representation using human-prescribed points propagated through vision models, significantly enhancing the generalization of robot policies to new objects and cluttered environments.

Contribution

The paper presents P3-PO, a new framework that leverages human annotations and vision models to improve out-of-distribution generalization in robot manipulation policies.

Findings

01

43% improvement over prior methods in real-world tasks

02

58% gain on new object instances

03

80% gain in cluttered environments

Abstract

Developing generalizable robot policies that can robustly handle varied environmental conditions and object instances remains a fundamental challenge in robot learning. While considerable efforts have focused on collecting large robot datasets and developing policy architectures to learn from such data, naively learning from visual inputs often results in brittle policies that fail to transfer beyond the training data. This work presents Prescriptive Point Priors for Policies or P3-PO, a novel framework that constructs a unique state representation of the environment leveraging recent advances in computer vision and robot learning to achieve improved out-of-distribution generalization for robot manipulation. This representation is obtained through two steps. First, a human annotator prescribes a set of semantically meaningful points on a single demonstration frame. These points are then…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotics and Sensor-Based Localization · Robotic Mechanisms and Dynamics

MethodsSparse Evolutionary Training