Spotlighting Task-Relevant Features: Object-Centric Representations for Better Generalization in Robotic Manipulation
Alexandre Chapin (LIRIS), Bruno Machado (LIRIS), Emmanuel Dellandr\'ea (LIRIS), Liming Chen (LIRIS)

TL;DR
This paper introduces Slot-Based Object-Centric Representations (SBOCR) for robotic manipulation, demonstrating that they improve generalization over traditional global and dense features under various visual changes.
Contribution
The work proposes SBOCR as an intermediate structured visual representation, showing its effectiveness in enhancing generalization in robotic manipulation tasks.
Findings
SBOCR outperforms dense and global features in generalization tests.
SBOCR maintains task performance under lighting, texture, and distractor variations.
No task-specific pretraining needed for SBOCR to be effective.
Abstract
The generalization capabilities of robotic manipulation policies are heavily influenced by the choice of visual representations. Existing approaches typically rely on representations extracted from pre-trained encoders, using two dominant types of features: global features, which summarize an entire image via a single pooled vector, and dense features, which preserve a patch-wise embedding from the final encoder layer. While widely used, both feature types mix task-relevant and irrelevant information, leading to poor generalization under distribution shifts, such as changes in lighting, textures, or the presence of distractors. In this work, we explore an intermediate structured alternative: Slot-Based Object-Centric Representations (SBOCR), which group dense features into a finite set of object-like entities. This representation permits to naturally reduce the noise provided to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Modular Robots and Swarm Intelligence · Reinforcement Learning in Robotics
