Towards neural reinforcement learning for large deviations in nonequilibrium systems with memory

Venkata D. Pamulaparthy; Rosemary J. Harris

arXiv:2501.12333·cond-mat.stat-mech·March 9, 2026

Towards neural reinforcement learning for large deviations in nonequilibrium systems with memory

Venkata D. Pamulaparthy, Rosemary J. Harris

PDF

Open Access

TL;DR

This paper develops a neural reinforcement learning approach for analyzing large deviations in non-Markovian systems, especially semi-Markov processes, by extending actor-critic methods with memory processing neural networks.

Contribution

It introduces a novel neural reinforcement learning framework that incorporates memory variables for studying fluctuations in non-Markovian systems.

Findings

01

Successfully applied to current fluctuations in semi-Markov models

02

Demonstrates effectiveness in handling nonexponential waiting times

03

Extends existing actor-critic methods with memory neural networks

Abstract

We introduce a reinforcement learning method for a class of non-Markov systems; our approach extends the actor-critic framework given by Rose et al. [New J. Phys. 23 013013 (2021)] for obtaining scaled cumulant generating functions characterizing the fluctuations. The actor-critic is implemented using neural networks; a particular innovation in our method is the use of an additional neural policy for processing memory variables. We demonstrate results for current fluctuations in various memory-dependent models with special focus on semi-Markov systems where the dynamics is controlled by nonexponential interevent waiting time distributions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Thermodynamics and Statistical Mechanics