Measuring and Controlling Solution Degeneracy across Task-Trained Recurrent Neural Networks
Ann Huang, Satpreet H. Singh, Flavio Martinelli, Kanaka Rajan

TL;DR
This paper introduces a framework to quantify and control solution degeneracy in task-trained RNNs, revealing how factors like task complexity and regularization influence internal variability across behavior, neural dynamics, and weights.
Contribution
It provides a systematic method to analyze and manipulate solution degeneracy in RNNs across multiple levels, validated on a large set of trained networks.
Findings
Higher task complexity reduces degeneracy in neural dynamics.
Larger networks and regularization decrease degeneracy across all levels.
Regularization and network size influence the variability of RNN solutions.
Abstract
Task-trained recurrent neural networks (RNNs) are widely used in neuroscience and machine learning to model dynamical computations. To gain mechanistic insight into how neural systems solve tasks, prior work often reverse-engineers individual trained networks. However, different RNNs trained on the same task and achieving similar performance can exhibit strikingly different internal solutions, a phenomenon known as solution degeneracy. Here, we develop a unified framework to systematically quantify and control solution degeneracy across three levels: behavior, neural dynamics, and weight space. We apply this framework to 3,400 RNNs trained on four neuroscience-relevant tasks: flip-flop memory, sine wave generation, delayed discrimination, and path integration, while systematically varying task complexity, learning regime, network size, and regularization. We find that higher task…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
