Learning Parsimonious Dynamics for Generalization in Reinforcement Learning
Tankred Saanum, Eric Schulz

TL;DR
This paper introduces a model that learns simple, interpretable latent dynamics in reinforcement learning, inspired by human reasoning, to improve generalization and planning in new environments.
Contribution
It proposes a novel approach to learn parsimonious latent dynamics using locally linear transformations and a variational objective, enhancing generalization in RL.
Findings
Improved generalization in unseen environments.
Effective planning using learned latent dynamics.
Parsimonious models require fewer transformations.
Abstract
Humans are skillful navigators: We aptly maneuver through new places, realize when we are back at a location we have seen before, and can even conceive of shortcuts that go through parts of our environments we have never visited. Current methods in model-based reinforcement learning on the other hand struggle with generalizing about environment dynamics out of the training distribution. We argue that two principles can help bridge this gap: latent learning and parsimonious dynamics. Humans tend to think about environment dynamics in simple terms -- we reason about trajectories not in reference to what we expect to see along a path, but rather in an abstract latent space, containing information about the places' spatial coordinates. Moreover, we assume that moving around in novel parts of our environment works the same way as in parts we are familiar with. These two principles work…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications
