Physics-Informed Policy Optimization via Analytic Dynamics Regularization

Namai Chandra; Liu Mohan; Zhihao Gu; Lin Wang

arXiv:2603.14469·cs.RO·March 24, 2026

Physics-Informed Policy Optimization via Analytic Dynamics Regularization

Namai Chandra, Liu Mohan, Zhihao Gu, Lin Wang

PDF

Open Access

TL;DR

PIPER is a physics-informed reinforcement learning framework that incorporates analytical physics constraints into policy optimization, improving sample efficiency and physical consistency in robotic control.

Contribution

The paper introduces PIPER, a novel RL method that seamlessly integrates analytical physics constraints into neural policy training without modifying existing algorithms or simulators.

Findings

01

Enhanced learning efficiency and stability in robotic control tasks.

02

Significantly improved control accuracy with physics-informed regularization.

03

No need for changes to existing simulators or RL algorithms.

Abstract

Reinforcement learning (RL) has achieved strong performance in robotic control; however, state-of-the-art policy learning methods, such as actor-critic methods, still suffer from high sample complexity and often produce physically inconsistent actions. This limitation stems from neural policies implicitly rediscovering complex physics from data alone, despite accurate dynamics models being readily available in simulators. In this paper, we introduce a novel physics-informed RL framework, called PIPER, that seamlessly integrates physical constraints directly into neural policy optimization with analytical soft physics constraints. At the core of our method is the integration of a differentiable Lagrangian residual as a regularization term within the actor's objective. This residual, extracted from a robot's simulator description, subtly biases policy updates towards dynamically…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Adversarial Robustness in Machine Learning