Autonomous Robotic Reinforcement Learning with Asynchronous Human Feedback
Max Balsells, Marcel Torne, Zihan Wang, Samedh Desai, Pulkit Agrawal,, Abhishek Gupta

TL;DR
This paper presents GEAR, a system enabling robots to autonomously learn in real-world environments through asynchronous human feedback, overcoming challenges of reward shaping and reset-free training.
Contribution
The work introduces a practical system that combines human-in-the-loop feedback with self-supervised learning for autonomous, reset-free robotic reinforcement learning in real environments.
Findings
Effective in simulation and real-world tasks
Enables continuous autonomous improvement without resets
Utilizes asynchronous crowdsourced human feedback
Abstract
Ideally, we would place a robot in a real-world environment and leave it there improving on its own by gathering more experience autonomously. However, algorithms for autonomous robotic learning have been challenging to realize in the real world. While this has often been attributed to the challenge of sample complexity, even sample-efficient techniques are hampered by two major challenges - the difficulty of providing well "shaped" rewards, and the difficulty of continual reset-free training. In this work, we describe a system for real-world reinforcement learning that enables agents to show continual improvement by training directly in the real world without requiring painstaking effort to hand-design reward functions or reset mechanisms. Our system leverages occasional non-expert human-in-the-loop feedback from remote users to learn informative distance functions to guide exploration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Data Stream Mining Techniques · Mobile Crowdsensing and Crowdsourcing
