Physics-Informed Policy Optimization via Analytic Dynamics Regularization
Namai Chandra, Liu Mohan, Zhihao Gu, Lin Wang

TL;DR
PIPER is a physics-informed reinforcement learning framework that incorporates analytical physics constraints into policy optimization, improving sample efficiency and physical consistency in robotic control.
Contribution
The paper introduces PIPER, a novel RL method that seamlessly integrates analytical physics constraints into neural policy training without modifying existing algorithms or simulators.
Findings
Enhanced learning efficiency and stability in robotic control tasks.
Significantly improved control accuracy with physics-informed regularization.
No need for changes to existing simulators or RL algorithms.
Abstract
Reinforcement learning (RL) has achieved strong performance in robotic control; however, state-of-the-art policy learning methods, such as actor-critic methods, still suffer from high sample complexity and often produce physically inconsistent actions. This limitation stems from neural policies implicitly rediscovering complex physics from data alone, despite accurate dynamics models being readily available in simulators. In this paper, we introduce a novel physics-informed RL framework, called PIPER, that seamlessly integrates physical constraints directly into neural policy optimization with analytical soft physics constraints. At the core of our method is the integration of a differentiable Lagrangian residual as a regularization term within the actor's objective. This residual, extracted from a robot's simulator description, subtly biases policy updates towards dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Adversarial Robustness in Machine Learning
