Learning from Human Directional Corrections
Wanxin Jin, Todd D. Murphey, Zehui Lu, Shaoshuai Mou

TL;DR
This paper introduces a new robot learning method that uses only human directional corrections to efficiently learn objectives, reducing over-corrections and effort compared to magnitude-based methods.
Contribution
The proposed approach uniquely learns from directional corrections alone, with theoretical convergence guarantees and demonstrated effectiveness in simulations, user studies, and real-world experiments.
Findings
Higher success rate in learning tasks
Requires fewer human corrections
More accessible and effort-efficient
Abstract
This paper proposes a novel approach that enables a robot to learn an objective function incrementally from human directional corrections. Existing methods learn from human magnitude corrections; since a human needs to carefully choose the magnitude of each correction, those methods can easily lead to over-corrections and learning inefficiency. The proposed method only requires human directional corrections -- corrections that only indicate the direction of an input change without indicating its magnitude. We only assume that each correction, regardless of its magnitude, points in a direction that improves the robot's current motion relative to an unknown objective function. The allowable corrections satisfying this assumption account for half of the input space, as opposed to the magnitude corrections which have to lie in a shrinking level set. For each directional correction, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Robotic Mechanisms and Dynamics · Reinforcement Learning in Robotics
