Bootstrapping Adaptive Human-Machine Interfaces with Offline   Reinforcement Learning

Jensen Gao; Siddharth Reddy; Glen Berseth; Anca D. Dragan; Sergey; Levine

arXiv:2309.03839·cs.RO·September 8, 2023

Bootstrapping Adaptive Human-Machine Interfaces with Offline Reinforcement Learning

Jensen Gao, Siddharth Reddy, Glen Berseth, Anca D. Dragan, Sergey, Levine

PDF

Open Access

TL;DR

This paper introduces a reinforcement learning approach that combines offline pre-training and online fine-tuning to create adaptive human-machine interfaces capable of handling noisy, high-dimensional command signals for tasks like robotic navigation.

Contribution

It presents a novel RL algorithm with a new method for inferring user intent, improving interface adaptability with limited user data and noisy inputs.

Findings

01

Enhanced goal navigation success rate over baseline interfaces.

02

Effective denoising of user commands and shared autonomy.

03

Improved performance in simulated tasks like pushing and lunar lander.

Abstract

Adaptive interfaces can help users perform sequential decision-making tasks like robotic teleoperation given noisy, high-dimensional command signals (e.g., from a brain-computer interface). Recent advances in human-in-the-loop machine learning enable such systems to improve by interacting with users, but tend to be limited by the amount of data that they can collect from individual users in practice. In this paper, we propose a reinforcement learning algorithm to address this by training an interface to map raw command signals to actions using a combination of offline pre-training and online fine-tuning. To address the challenges posed by noisy command signals and sparse rewards, we develop a novel method for representing and inferring the user's long-term intent for a given trajectory. We primarily evaluate our method's ability to assist users who can only communicate through noisy,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces · Gaze Tracking and Assistive Technology · Human-Automation Interaction and Safety