Neuroprosthetic decoder training as imitation learning

Josh Merel; David Carlson; Liam Paninski; John P. Cunningham

arXiv:1511.04156·stat.ML·September 28, 2016

Neuroprosthetic decoder training as imitation learning

Josh Merel, David Carlson, Liam Paninski, John P. Cunningham

PDF

TL;DR

This paper introduces a novel imitation learning framework for training neuroprosthetic decoders, enabling scalable and goal-oriented control of complex effectors like robotic arms using brain-computer interfaces.

Contribution

It adapts the dataset aggregation (DAgger) imitation learning algorithm to brain-computer interfaces and combines it with optimal control for scalable neuroprosthetic training.

Findings

01

Derived new regret bounds for BCI algorithms

02

Demonstrated control of a 26-DOF robotic arm in simulation

03

Provided a scalable framework for naturalistic neuroprosthetic control

Abstract

Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user's intention is not directly observable, recent methods have demonstrated value in training the decoder against a surrogate for the user's intended movement. We describe how training a decoder in this way is a novel variant of an imitation learning problem, where an oracle or expert is employed for supervised training in lieu of direct observations, which are not available. Specifically, we describe how a generic imitation learning meta-algorithm, dataset aggregation (DAgger, [1]), can be adapted to train a generic brain-computer interface. By deriving existing learning algorithms for brain-computer…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.