Enhancing Goal Inference via Correction Timing
Anjiabei Wang, Shuangge Wang, Tesca Fitzgerald

TL;DR
This paper explores how the timing of human corrections to robots can serve as a valuable signal for improving robot learning, especially in identifying motion features, inferring goals, and refining task constraints.
Contribution
It introduces the novel idea that correction timing, beyond the correction itself, can enhance robot learning and demonstrates its effectiveness in three key applications.
Findings
Correction timing improves identification of motion features prompting corrections.
Timing helps quickly infer the human's correction goal.
Timing enhances learning of more precise task constraints.
Abstract
Corrections offer a natural modality for people to provide feedback to a robot, by (i) intervening in the robot's behavior when they believe the robot is failing (or will fail) the task objectives and (ii) modifying the robot's behavior to successfully fulfill the task. Each correction offers information on what the robot should and should not do, where the corrected behavior is more aligned with task objectives than the original behavior. Most prior work on learning from corrections involves interpreting a correction as a new demonstration (consisting of the modified robot behavior), or a preference (for the modified trajectory compared to the robot's original behavior). However, this overlooks one essential element of the correction feedback, which is the human's decision to intervene in the robot's behavior in the first place. This decision can be influenced by multiple factors…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Social Robot Interaction and HRI · Reinforcement Learning in Robotics
