Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning
Jensen Gao, Siddharth Reddy, Glen Berseth, Anca D. Dragan, Sergey, Levine

TL;DR
This paper introduces a reinforcement learning approach that combines offline pre-training and online fine-tuning to create adaptive human-machine interfaces capable of handling noisy, high-dimensional command signals for tasks like robotic navigation.
Contribution
It presents a novel RL algorithm with a new method for inferring user intent, improving interface adaptability with limited user data and noisy inputs.
Findings
Enhanced goal navigation success rate over baseline interfaces.
Effective denoising of user commands and shared autonomy.
Improved performance in simulated tasks like pushing and lunar lander.
Abstract
Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practice. In this paper, we propose a reinforcement learning algorithm to address this by training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning. To address the challenges posed by noisy command signals and sparse rewards, we develop a novel method for representing and inferring the user's long-term intent for a given trajectory. We primarily evaluate our method's ability to assist users who can only communicate through noisy,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEEG and Brain-Computer Interfaces · Gaze Tracking and Assistive Technology · Human-Automation Interaction and Safety
