Learning to Arbitrate Human and Robot Control using Disagreement between Sub-Policies
Yoojin Oh, Marc Toussaint, Jim Mainprice

TL;DR
This paper introduces a reinforcement learning approach for optimal arbitration in teleoperation, dynamically balancing human and robot control at decision points to improve performance and safety.
Contribution
It presents a novel RL-based arbitration method that detects decision points via mixture distribution modality and adapts control blending accordingly.
Findings
Outperforms direct control in teleoperation tasks.
Enhances safety and accuracy in human-robot control.
Flexible control blending improves user performance.
Abstract
In the context of teleoperation, arbitration refers to deciding how to blend between human and autonomous robot commands. We present a reinforcement learning solution that learns an optimal arbitration strategy that allocates more control authority to the human when the robot comes across a decision point in the task. A decision point is where the robot encounters multiple options (sub-policies), such as having multiple paths to get around an obstacle or deciding between two candidate goals. By expressing each directional sub-policy as a von Mises distribution, we identify the decision points by observing the modality of the mixture distribution. Our reward function reasons on this modality and prioritizes to match its learned policy to either the user or the robot accordingly. We report teleoperation experiments on reach-and-grasping objects using a robot manipulator arm with different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobot Manipulation and Learning · Reinforcement Learning in Robotics · EEG and Brain-Computer Interfaces
