Object-Centric Representations Improve Policy Generalization in Robot Manipulation

Alexandre Chapin (imagine); Bruno Machado (imagine); Emmanuel Dellandrea (imagine); Liming Chen (imagine)

arXiv:2505.11563·cs.RO·May 20, 2025

Object-Centric Representations Improve Policy Generalization in Robot Manipulation

Alexandre Chapin (imagine), Bruno Machado (imagine), Emmanuel Dellandrea (imagine), Liming Chen (imagine)

PDF

Open Access

TL;DR

This paper demonstrates that object-centric visual representations significantly enhance the generalization of robotic manipulation policies across diverse visual conditions, outperforming traditional dense and global features.

Contribution

It introduces the use of object-centric representations for robotic manipulation and benchmarks their effectiveness against other visual encodings in various tasks.

Findings

01

OCR-based policies outperform dense and global representations in generalization.

02

Object-centric representations maintain robustness under lighting, texture, and distractor variations.

03

OCR achieves these improvements without task-specific pretraining.

Abstract

Visual representations are central to the learning and generalization capabilities of robotic manipulation policies. While existing methods rely on global or dense features, such representations often entangle task-relevant and irrelevant scene information, limiting robustness under distribution shifts. In this work, we investigate object-centric representations (OCR) as a structured alternative that segments visual input into a finished set of entities, introducing inductive biases that align more naturally with manipulation tasks. We benchmark a range of visual encoders-object-centric, global and dense methods-across a suite of simulated and real-world manipulation tasks ranging from simple to complex, and evaluate their generalization under diverse visual conditions including changes in lighting, texture, and the presence of distractors. Our findings reveal that OCR-based policies…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Explainable Artificial Intelligence (XAI)

MethodsSparse Evolutionary Training · ALIGN