Model-based Reinforcement Learning with a Hamiltonian Canonical ODE   Network

Yao Feng; Yuhong Jiang; Hang Su; Dong Yan; Jun Zhu

arXiv:2211.00942·cs.LG·November 3, 2022

Model-based Reinforcement Learning with a Hamiltonian Canonical ODE Network

Yao Feng, Yuhong Jiang, Hang Su, Dong Yan, Jun Zhu

PDF

Open Access

TL;DR

This paper introduces NODA, a neural ODE auto-encoder leveraging Hamiltonian mechanics to improve sample efficiency and physical plausibility in model-based reinforcement learning for complex environments.

Contribution

The paper proposes NODA, a novel neural ODE auto-encoder incorporating Hamiltonian mechanics, enhancing efficiency and physical consistency in environment modeling for RL.

Findings

01

NODA effectively models environment dynamics with high sample efficiency.

02

NODA provides theoretical bounds for multi-step transition and value errors.

03

Experiments demonstrate improved early-stage RL performance with NODA.

Abstract

Model-based reinforcement learning usually suffers from a high sample complexity in training the world model, especially for the environments with complex dynamics. To make the training for general physical environments more efficient, we introduce Hamiltonian canonical ordinary differential equations into the learning process, which inspires a novel model of neural ordinary differential auto-encoder (NODA). NODA can model the physical world by nature and is flexible to impose Hamiltonian mechanics (e.g., the dimension of the physical equations) which can further accelerate training of the environment models. It can consequentially empower an RL agent with the robust extrapolation using a small amount of samples as well as the guarantee on the physical plausibility. Theoretically, we prove that NODA has uniform bounds for multi-step transition errors and value errors under certain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Reinforcement Learning in Robotics · Neural Networks and Applications