From Data-Fitting to Discovery: Interpreting the Neural Dynamics of   Motor Control through Reinforcement Learning

Eugene R. Rush; Kaushik Jayaram; J. Sean Humbert

arXiv:2305.11107·q-bio.NC·May 19, 2023·2 cites

From Data-Fitting to Discovery: Interpreting the Neural Dynamics of Motor Control through Reinforcement Learning

Eugene R. Rush, Kaushik Jayaram, J. Sean Humbert

PDF

Open Access

TL;DR

This paper investigates neural dynamics in reinforcement learning models of legged locomotion, revealing structured, less tangled neural trajectories that align with biological findings in primate movement.

Contribution

It demonstrates that embodied agents trained for walking exhibit neural dynamics similar to biological systems, bridging the gap between data-fitting models and biological plausibility.

Findings

01

Neural trajectories in the agent are less tangled in recurrent layers.

02

Identified speed axes that maximize variance in neural activity.

03

Structured neural activity supports experimental primate movement data.

Abstract

In motor neuroscience, artificial recurrent neural networks models often complement animal studies. However, most modeling efforts are limited to data-fitting, and the few that examine virtual embodied agents in a reinforcement learning context, do not draw direct comparisons to their biological counterparts. Our study addressing this gap, by uncovering structured neural activity of a virtual robot performing legged locomotion that directly support experimental findings of primate walking and cycling. We find that embodied agents trained to walk exhibit smooth dynamics that avoid tangling -- or opposing neural trajectories in neighboring neural space -- a core principle in computational neuroscience. Specifically, across a wide suite of gaits, the agent displays neural trajectories in the recurrent layers are less tangled than those in the input-driven actuation layers. To better…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRobotic Locomotion and Control · Muscle activation and electromyography studies · Reinforcement Learning in Robotics

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings