Physics-Informed Reward Machines

Daniel Ajeleye; Ashutosh Trivedi; Majid Zamani

arXiv:2508.14093·cs.LG·August 21, 2025

Physics-Informed Reward Machines

Daniel Ajeleye, Ashutosh Trivedi, Majid Zamani

PDF

Open Access

TL;DR

This paper introduces physics-informed reward machines (pRMs), a symbolic framework that enhances the expressiveness and efficiency of reinforcement learning by enabling complex reward structures and leveraging counterfactual experiences.

Contribution

The paper proposes pRMs, a novel symbolic reward machine framework that incorporates physical knowledge to improve learning speed and expressiveness in RL tasks.

Findings

01

pRMs accelerate reward learning in physical environments

02

Counterfactual experience generation improves sample efficiency

03

pRMs outperform baseline methods in control tasks

Abstract

Reward machines (RMs) provide a structured way to specify non-Markovian rewards in reinforcement learning (RL), thereby improving both expressiveness and programmability. Viewed more broadly, they separate what is known about the environment, captured by the reward mechanism, from what remains unknown and must be discovered through sampling. This separation supports techniques such as counterfactual experience generation and reward shaping, which reduce sample complexity and speed up learning. We introduce physics-informed reward machines (pRMs), a symbolic machine designed to express complex learning objectives and reward structures for RL agents, thereby enabling more programmable, expressive, and efficient learning. We present RL algorithms capable of exploiting pRMs via counterfactual experiences and reward shaping. Our experimental results show that these techniques accelerate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Time Series Analysis and Forecasting · Model Reduction and Neural Networks