Autonomous Robotic Reinforcement Learning with Asynchronous Human   Feedback

Max Balsells; Marcel Torne; Zihan Wang; Samedh Desai; Pulkit Agrawal,; Abhishek Gupta

arXiv:2310.20608·cs.LG·November 1, 2023·1 cites

Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback

Max Balsells, Marcel Torne, Zihan Wang, Samedh Desai, Pulkit Agrawal,, Abhishek Gupta

PDF

Open Access

TL;DR

This paper presents GEAR, a system enabling robots to autonomously learn in real-world environments through asynchronous human feedback, overcoming challenges of reward shaping and reset-free training.

Contribution

The work introduces a practical system that combines human-in-the-loop feedback with self-supervised learning for autonomous, reset-free robotic reinforcement learning in real environments.

Findings

01

Effective in simulation and real-world tasks

02

Enables continuous autonomous improvement without resets

03

Utilizes asynchronous crowdsourced human feedback

Abstract

Ideally, we would place a robot in a real-world environment and leave it there improving on its own by gathering more experience autonomously. However, algorithms for autonomous robotic learning have been challenging to realize in the real world. While this has often been attributed to the challenge of sample complexity, even sample-efficient techniques are hampered by two major challenges - the difficulty of providing well "shaped" rewards, and the difficulty of continual reset-free training. In this work, we describe a system for real-world reinforcement learning that enables agents to show continual improvement by training directly in the real world without requiring painstaking effort to hand-design reward functions or reset mechanisms. Our system leverages occasional non-expert human-in-the-loop feedback from remote users to learn informative distance functions to guide exploration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing