Mechanistic Foundations of Goal-Directed Control
Alma Lago

TL;DR
This paper extends mechanistic interpretability to embodied control systems, demonstrating how control strategies emerge and compete during infant motor learning through phase transitions and circuit formation governed by inductive biases.
Contribution
It introduces a mechanistic framework for sensorimotor development, revealing how control circuits form and compete, with insights into phase transitions and critical parameters like context window size.
Findings
Identification of causal control circuits from inductive biases
Demonstration of phase transition in arbitration gate behavior
Critical role of context window size in circuit formation
Abstract
Mechanistic interpretability has transformed the analysis of transformer circuits by decomposing model behavior into competing algorithms, identifying phase transitions during training, and deriving closed-form predictions for when and why strategies shift. However, this program has remained largely confined to sequence-prediction architectures, leaving embodied control systems without comparable mechanistic accounts. Here we extend this framework to sensorimotor-cognitive development, using infant motor learning as a model system. We show that foundational inductive biases give rise to causal control circuits, with learned gating mechanisms converging toward theoretically motivated uncertainty thresholds. The resulting dynamics reveal a clean phase transition in the arbitration gate whose commitment behavior is well described by a closed-form exponential moving-average surrogate. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEmbodied and Extended Cognition · Motor Control and Adaptation · Child and Animal Learning Development
