Neural Policy Composition from Free Energy Minimization
Francesca Rossi, Veronica Centorrino, Francesco Bullo, Giovanni Russo

TL;DR
This paper introduces a normative framework based on free energy minimization for policy composition, providing a neural implementation and demonstrating its effectiveness across various behavioral and control tasks.
Contribution
It presents a novel free energy-based objective for policy gating, with a neural circuit implementation and convergence guarantees, applicable to diverse settings.
Findings
Model converges to optimal policy composition with explicit rate.
Neural implementation as a soft-competitive recurrent circuit.
Outperforms or matches established models in multiple tasks.
Abstract
The ability to flexibly compose previously acquired skills to execute intelligent behaviors is a hallmark of natural intelligence. Such compositional flexibility is often attributed to context-dependent gating mechanisms that determine how multiple policies or behavioral primitives are combined. Yet, despite remarkable efforts, the normative objective from which such gating rules should arise, and the neural computations capable of implementing them, remain unclear. Existing approaches typically rely on prespecified design choices for the gating rules, and remain tied to specific architectures, learning paradigms, or datasets. Here, we introduce a normative framework in which policy composition emerges from the minimization of a variational free energy, providing a principled and broadly applicable objective for gating. Based on this framework, we derive a continuous-time gradient flow…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural dynamics and brain function · Reinforcement Learning in Robotics · Advanced Memory and Neural Computing
