Robot Learning on the Job: Human-in-the-Loop Autonomy and Learning During Deployment
Huihan Liu, Soroush Nasiriany, Lance Zhang, Zhiyao Bao, Yuke Zhu

TL;DR
This paper introduces Sirius, a human-in-the-loop framework for robot learning during deployment that combines human oversight with a new weighted behavioral cloning algorithm, leading to improved performance and efficiency in contact-rich manipulation tasks.
Contribution
The paper presents Sirius, a novel framework integrating human oversight with a new sample re-weighting algorithm for robot learning during deployment, enhancing safety and efficiency.
Findings
Sirius outperforms baselines with 8% and 27% success rate improvements in simulation and hardware.
The approach achieves twice faster convergence than existing methods.
Memory usage is reduced by 85%, enabling more efficient deployment.
Abstract
With the rapid growth of computing powers and recent advances in deep learning, we have witnessed impressive demonstrations of novel robot capabilities in research settings. Nonetheless, these learning systems exhibit brittle generalization and require excessive training data for practical tasks. To harness the capabilities of state-of-the-art robot learning models while embracing their imperfections, we present Sirius, a principled framework for humans and robots to collaborate through a division of work. In this framework, partially autonomous robots are tasked with handling a major portion of decision-making where they work reliably; meanwhile, human operators monitor the process and intervene in challenging situations. Such a human-robot team ensures safe deployments in complex tasks. Further, we introduce a new learning algorithm to improve the policy's performance on the data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAge of Information Optimization · Human-Automation Interaction and Safety · Reinforcement Learning in Robotics
