Reward-Based Environment States for Robot Manipulation Policy Learning
C\'ed\'erick Mouliets, Isabelle Ferran\'e, Heriberto Cuay\'ahuitl

TL;DR
This paper introduces a reward-based state representation for robot manipulation that improves policy learning efficiency, achieving up to 97% success in simulated tasks with deep reinforcement learning.
Contribution
The paper proposes a novel, compact state representation based on predicted rewards from an image classifier, enhancing robot manipulation policy learning.
Findings
Achieved up to 97% task success in simulation
Effective with deep reinforcement learning algorithms
Simplifies state representation for manipulation tasks
Abstract
Training robot manipulation policies is a challenging and open problem in robotics and artificial intelligence. In this paper we propose a novel and compact state representation based on the rewards predicted from an image-based task success classifier. Our experiments, using the Pepper robot in simulation with two deep reinforcement learning algorithms on a grab-and-lift task, reveal that our proposed state representation can achieve up to 97% task success using our best policies.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Artificial Intelligence in Games
