Reinforcement Learning with Physics-Informed Symbolic Program Priors for Zero-Shot Wireless Indoor Navigation
Tao Li, Haozhe Lei, Mingsheng Yin, Yaqi Hu

TL;DR
This paper introduces PiPRL, a neuro-symbolic framework that incorporates physics priors into reinforcement learning for indoor navigation, improving training efficiency and generalization by combining symbolic and neural methods.
Contribution
The work develops a hierarchical neuro-symbolic approach that distills physics priors into RL, enabling better sample efficiency and interpretability in navigation tasks.
Findings
PiPRL outperforms purely symbolic or neural policies.
Reduces training time by over 26%.
Demonstrates effective integration of physics priors in RL.
Abstract
When using reinforcement learning (RL) to tackle physical control tasks, inductive biases that encode physics priors can help improve sample efficiency during training and enhance generalization in testing. However, the current practice of incorporating these helpful physics-informed inductive biases inevitably runs into significant manual labor and domain expertise, making them prohibitive for general users. This work explores a symbolic approach to distill physics-informed inductive biases into RL agents, where the physics priors are expressed in a domain-specific language (DSL) that is human-readable and naturally explainable. Yet, the DSL priors do not translate directly into an implementable policy due to partial and noisy observations and additional physical constraints in navigation tasks. To address this gap, we develop a physics-informed program-guided RL (PiPRL) framework with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Neural Networks and Reservoir Computing · Reinforcement Learning in Robotics
