Physics-Informed Reward Machines
Daniel Ajeleye, Ashutosh Trivedi, Majid Zamani

TL;DR
This paper introduces physics-informed reward machines (pRMs), a symbolic framework that enhances the expressiveness and efficiency of reinforcement learning by enabling complex reward structures and leveraging counterfactual experiences.
Contribution
The paper proposes pRMs, a novel symbolic reward machine framework that incorporates physical knowledge to improve learning speed and expressiveness in RL tasks.
Findings
pRMs accelerate reward learning in physical environments
Counterfactual experience generation improves sample efficiency
pRMs outperform baseline methods in control tasks
Abstract
Reward machines (RMs) provide a structured way to specify non-Markovian rewards in reinforcement learning (RL), thereby improving both expressiveness and programmability. Viewed more broadly, they separate what is known about the environment, captured by the reward mechanism, from what remains unknown and must be discovered through sampling. This separation supports techniques such as counterfactual experience generation and reward shaping, which reduce sample complexity and speed up learning. We introduce physics-informed reward machines (pRMs), a symbolic machine designed to express complex learning objectives and reward structures for RL agents, thereby enabling more programmable, expressive, and efficient learning. We present RL algorithms capable of exploiting pRMs via counterfactual experiences and reward shaping. Our experimental results show that these techniques accelerate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Time Series Analysis and Forecasting · Model Reduction and Neural Networks
