Neuroprosthetic decoder training as imitation learning
Josh Merel, David Carlson, Liam Paninski, John P. Cunningham

TL;DR
This paper introduces a novel imitation learning framework for training neuroprosthetic decoders, enabling scalable and goal-oriented control of complex effectors like robotic arms using brain-computer interfaces.
Contribution
It adapts the dataset aggregation (DAgger) imitation learning algorithm to brain-computer interfaces and combines it with optimal control for scalable neuroprosthetic training.
Findings
Derived new regret bounds for BCI algorithms
Demonstrated control of a 26-DOF robotic arm in simulation
Provided a scalable framework for naturalistic neuroprosthetic control
Abstract
Neuroprosthetic brain-computer interfaces function via an algorithm which decodes neural activity of the user into movements of an end effector, such as a cursor or robotic arm. In practice, the decoder is often learned by updating its parameters while the user performs a task. When the user's intention is not directly observable, recent methods have demonstrated value in training the decoder against a surrogate for the user's intended movement. We describe how training a decoder in this way is a novel variant of an imitation learning problem, where an oracle or expert is employed for supervised training in lieu of direct observations, which are not available. Specifically, we describe how a generic imitation learning meta-algorithm, dataset aggregation (DAgger, [1]), can be adapted to train a generic brain-computer interface. By deriving existing learning algorithms for brain-computer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
