Filtered-CoPhy: Unsupervised Learning of Counterfactual Physics in Pixel Space
Steeven Janny, Fabien Baradel, Natalia Neverova, Madiha Nadri, Greg, Mori, Christian Wolf

TL;DR
Filtered-CoPhy introduces an unsupervised approach for learning counterfactual physical dynamics directly in pixel space, enabling long-term video forecasting without supervision of object positions, by leveraging a hybrid latent representation.
Contribution
The paper presents a novel unsupervised method for counterfactual physics learning in pixel space using a hybrid latent representation combining dense features, keypoints, and latent vectors, outperforming existing baselines.
Findings
Outperforms strong baselines in pixel-space physics prediction
Effectively captures physical dynamics with a hybrid latent representation
Introduces a new challenging counterfactual video prediction benchmark
Abstract
Learning causal relationships in high-dimensional data (images, videos) is a hard task, as they are often defined on low dimensional manifolds and must be extracted from complex signals dominated by appearance, lighting, textures and also spurious correlations in the data. We present a method for learning counterfactual reasoning of physical processes in pixel space, which requires the prediction of the impact of interventions on initial conditions. Going beyond the identification of structural relationships, we deal with the challenging problem of forecasting raw video over long horizons. Our method does not require the knowledge or supervision of any ground truth positions or other object or scene properties. Our model learns and acts on a suitable hybrid latent representation based on a combination of dense features, sets of 2D keypoints and an additional latent vector per keypoint.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Explainable Artificial Intelligence (XAI) · Anomaly Detection Techniques and Applications
