
TL;DR
This paper introduces a GAN-based video generation method that learns a configuration space and Hamiltonian dynamics from data, enhancing interpretability and applicability in physically plausible video synthesis.
Contribution
It presents a novel GAN framework with a learned configuration space and Hamiltonian neural network, trained with a cyclic-coordinate loss for improved interpretability.
Findings
Effective on the Hamiltonian Dynamics Suite Toy Physics dataset
Learns a minimal and interpretable configuration space
Demonstrates advantages over existing Hamiltonian-based models
Abstract
A growing body of work leverages the Hamiltonian formalism as an inductive bias for physically plausible neural network based video generation. The structure of the Hamiltonian ensures conservation of a learned quantity (e.g., energy) and imposes a phase-space interpretation on the low-dimensional manifold underlying the input video. While this interpretation has the potential to facilitate the integration of learned representations in downstream tasks, existing methods are limited in their applicability as they require a structural prior for the configuration space at design time. In this work, we present a GAN-based video generation pipeline with a learned configuration space map and Hamiltonian neural network motion model, to learn a representation of the configuration space from data. We train our model with a physics-inspired cyclic-coordinate loss function which encourages a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Human Pose and Action Recognition
