Learning from Human Directional Corrections

Wanxin Jin; Todd D. Murphey; Zehui Lu; Shaoshuai Mou

arXiv:2011.15014·cs.RO·August 8, 2022

Learning from Human Directional Corrections

Wanxin Jin, Todd D. Murphey, Zehui Lu, Shaoshuai Mou

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new robot learning method that uses only human directional corrections to efficiently learn objectives, reducing over-corrections and effort compared to magnitude-based methods.

Contribution

The proposed approach uniquely learns from directional corrections alone, with theoretical convergence guarantees and demonstrated effectiveness in simulations, user studies, and real-world experiments.

Findings

01

Higher success rate in learning tasks

02

Requires fewer human corrections

03

More accessible and effort-efficient

Abstract

This paper proposes a novel approach that enables a robot to learn an objective function incrementally from human directional corrections. Existing methods learn from human magnitude corrections; since a human needs to carefully choose the magnitude of each correction, those methods can easily lead to over-corrections and learning inefficiency. The proposed method only requires human directional corrections -- corrections that only indicate the direction of an input change without indicating its magnitude. We only assume that each correction, regardless of its magnitude, points in a direction that improves the robot's current motion relative to an unknown objective function. The allowable corrections satisfying this assumption account for half of the input space, as opposed to the magnitude corrections which have to lie in a shrinking level set. For each directional correction, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wanxinjin/Learning-from-Directional-Corrections
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobot Manipulation and Learning · Robotic Mechanisms and Dynamics · Reinforcement Learning in Robotics