Reward-Based Environment States for Robot Manipulation Policy Learning

C\'ed\'erick Mouliets; Isabelle Ferran\'e; Heriberto Cuay\'ahuitl

arXiv:2112.05621·cs.RO·December 13, 2021

Reward-Based Environment States for Robot Manipulation Policy Learning

C\'ed\'erick Mouliets, Isabelle Ferran\'e, Heriberto Cuay\'ahuitl

PDF

Open Access

TL;DR

This paper introduces a reward-based state representation for robot manipulation that improves policy learning efficiency, achieving up to 97% success in simulated tasks with deep reinforcement learning.

Contribution

The paper proposes a novel, compact state representation based on predicted rewards from an image classifier, enhancing robot manipulation policy learning.

Findings

01

Achieved up to 97% task success in simulation

02

Effective with deep reinforcement learning algorithms

03

Simplifies state representation for manipulation tasks

Abstract

Training robot manipulation policies is a challenging and open problem in robotics and artificial intelligence. In this paper we propose a novel and compact state representation based on the rewards predicted from an image-based task success classifier. Our experiments, using the Pepper robot in simulation with two deep reinforcement learning algorithms on a grab-and-lift task, reveal that our proposed state representation can achieve up to 97% task success using our best policies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Artificial Intelligence in Games