NewtonianVAE: Proportional Control and Goal Identification from Pixels via Physical Latent Spaces
Miguel Jaques, Michael Burke, Timothy Hospedales

TL;DR
This paper introduces NewtonianVAE, a latent dynamics model that facilitates proportional control from pixels, simplifying control tasks and enabling interpretable goal discovery in vision-based imitation learning.
Contribution
The paper presents a novel latent dynamics learning framework that induces proportional controlability in the latent space, improving control simplicity and interpretability.
Findings
Enables proportional control directly from pixel inputs.
Simplifies and accelerates behavioral cloning for vision-based controllers.
Provides interpretable goal discovery in imitation learning.
Abstract
Learning low-dimensional latent state space dynamics models has been a powerful paradigm for enabling vision-based planning and learning for control. We introduce a latent dynamics learning framework that is uniquely designed to induce proportional controlability in the latent space, thus enabling the use of much simpler controllers than prior work. We show that our learned dynamics model enables proportional control from pixels, dramatically simplifies and accelerates behavioural cloning of vision-based controllers, and provides interpretable goal discovery when applied to imitation learning of switching controllers from demonstration.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
