The Power of Next-Frame Prediction for Learning Physical Laws
Thomas Winterbottom, G. Thomas Hudson, Daniel Kluvanec, Dean Slack,, Jamie Sterling, Junjie Shentu, Chenghao Xiao, Zheming Zhou, Noura Al Moubayed

TL;DR
This paper investigates how next-frame prediction in videos can serve as a foundational learning strategy to induce understanding of physical laws, demonstrating models can infer physical constants without explicit training for that task.
Contribution
It introduces diagnostic datasets based on physical laws and shows models trained on next-frame prediction can infer physical constants, highlighting the method's potential for general understanding.
Findings
Models predict physical constants better than random models
Training on next-frame prediction induces understanding of physical laws
Generative training improves physical constant prediction by 1.28 to 6.24 times
Abstract
Next-frame prediction is a useful and powerful method for modelling and understanding the dynamics of video data. Inspired by the empirical success of causal language modelling and next-token prediction in language modelling, we explore the extent to which next-frame prediction serves as a strong foundational learning strategy (analogous to language modelling) for inducing an understanding of the visual world. In order to quantify the specific visual understanding induced by next-frame prediction, we introduce six diagnostic simulation video datasets derived from fundamental physical laws created by varying physical constants such as gravity and mass. We demonstrate that our models trained only on next-frame prediction are capable of predicting the value of these physical constants (e.g. gravity) without having been trained directly to learn these constants via a regression task. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Neural Networks and Applications
