Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning   During Deployment

Huihan Liu; Soroush Nasiriany; Lance Zhang; Zhiyao Bao; Yuke Zhu

arXiv:2211.08416·cs.RO·July 6, 2023

Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment

Huihan Liu, Soroush Nasiriany, Lance Zhang, Zhiyao Bao, Yuke Zhu

PDF

Open Access 1 Datasets

TL;DR

This paper introduces Sirius, a human-in-the-loop framework for robot learning during deployment that combines human oversight with a new weighted behavioral cloning algorithm, leading to improved performance and efficiency in contact-rich manipulation tasks.

Contribution

The paper presents Sirius, a novel framework integrating human oversight with a new sample re-weighting algorithm for robot learning during deployment, enhancing safety and efficiency.

Findings

01

Sirius outperforms baselines with 8% and 27% success rate improvements in simulation and hardware.

02

The approach achieves twice faster convergence than existing methods.

03

Memory usage is reduced by 85%, enabling more efficient deployment.

Abstract

With the rapid growth of computing powers and recent advances in deep learning, we have witnessed impressive demonstrations of novel robot capabilities in research settings. Nonetheless, these learning systems exhibit brittle generalization and require excessive training data for practical tasks. To harness the capabilities of state-of-the-art robot learning models while embracing their imperfections, we present Sirius, a principled framework for humans and robots to collaborate through a division of work. In this framework, partially autonomous robots are tasked with handling a major portion of decision-making where they work reliably; meanwhile, human operators monitor the process and intervene in challenging situations. Such a human-robot team ensures safe deployments in complex tasks. Further, we introduce a new learning algorithm to improve the policy's performance on the data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

lerobot/austin_sirius_dataset
dataset· 2.8k dl
2.8k dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAge of Information Optimization · Human-Automation Interaction and Safety · Reinforcement Learning in Robotics