Physical Reinforcement Learning

Sam Dillavou; Shruti Mishra

arXiv:2511.17789·cs.LG·November 25, 2025

Physical Reinforcement Learning

Sam Dillavou, Shruti Mishra

PDF

Open Access

TL;DR

This paper explores the use of contrastive local learning networks (CLLNs), low-power analog systems, for reinforcement learning, demonstrating their potential advantages and unique features compared to digital hardware.

Contribution

It adapts Q-learning for CLLNs, showing their capability to perform RL tasks and discussing their advantages and biological relevance over digital systems.

Findings

01

Successful implementation of Q-learning on CLLNs for simple RL problems

02

Highlighting the natural fit of policy and value functions in CLLNs

03

Discussion of safety and secondary goals relevant to biological systems

Abstract

Digital computers are power-hungry and largely intolerant of damaged components, making them potentially difficult tools for energy-limited autonomous agents in uncertain environments. Recently developed Contrastive Local Learning Networks (CLLNs) - analog networks of self-adjusting nonlinear resistors - are inherently low-power and robust to physical damage, but were constructed to perform supervised learning. In this work we demonstrate success on two simple RL problems using Q-learning adapted for simulated CLLNs. Doing so makes explicit the components (beyond the network being trained) required to enact various tools in the RL toolbox, some of which (policy function and value function) are more natural in this system than others (replay buffer). We discuss assumptions such as the physical safety that digital hardware requires, CLLNs can forgo, and biological systems cannot rely on,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Reservoir Computing · Reinforcement Learning in Robotics · Advanced Memory and Neural Computing