Sample Efficient Robot Learning with Structured World Models

Tuluhan Akbulut; Max Merlin; Shane Parr; Benedict Quartey; Skye; Thompson

arXiv:2210.12278·cs.RO·October 25, 2022

Sample Efficient Robot Learning with Structured World Models

Tuluhan Akbulut, Max Merlin, Shane Parr, Benedict Quartey, Skye, Thompson

PDF

Open Access

TL;DR

This paper investigates how structured world models, especially using keypoints, enhance sample efficiency and performance in robotic cloth-folding tasks, demonstrating significant improvements over unstructured observations.

Contribution

It introduces the use of structured feature spaces like keypoints in world models for robotic manipulation, showing improved efficiency and performance in cloth-folding tasks.

Findings

01

Keypoints increased task performance by 50%.

02

Structured feature spaces improved sample efficiency.

03

State transition predictor had no notable effect.

Abstract

Reinforcement learning has been demonstrated as a flexible and effective approach for learning a range of continuous control tasks, such as those used by robots to manipulate objects in their environment. But in robotics particularly, real-world rollouts are costly, and sample efficiency can be a major limiting factor when learning a new skill. In game environments, the use of world models has been shown to improve sample efficiency while still achieving good performance, especially when images or other rich observations are provided. In this project, we explore the use of a world model in a deformable robotic manipulation task, evaluating its effect on sample efficiency when learning to fold a cloth in simulation. We compare the use of RGB image observation with a feature space leveraging built-in structure (keypoints representing the cloth configuration), a common approach in robot…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Machine Learning and Data Classification · Machine Learning and Algorithms