GenMatter: Perceiving Physical Objects with Generative Matter Models
Eric Li, Arijit Dasgupta, Yoni Friedman, Mathieu Huot, Vikash Mansinghka, Thomas O'Connell, William T. Freeman, and Joshua B. Tenenbaum

TL;DR
This paper introduces GenMatter, a generative model inspired by human perception that hierarchically groups motion cues and appearance features to identify and track physical objects across diverse visual settings.
Contribution
The authors propose a unified, hardware-accelerated generative framework that effectively perceives and segments moving objects in various visual environments, bridging gaps in existing computer vision methods.
Findings
Captures human-like object perception in random dot kinematograms.
Recovers 3D structure and accurate segmentation from camouflaged objects.
Tracks deforming objects in natural RGB videos for scene understanding.
Abstract
Human visual perception offers valuable insights for understanding computational principles of motion-based scene interpretation. Humans robustly detect and segment moving entities that constitute independently moveable chunks of matter, whether observing sparse moving dots, textured surfaces, or naturalistic scenes. In contrast, existing computer vision systems lack a unified approach that works across these diverse settings. Inspired by principles of human perception, we propose a generative model that hierarchically groups low-level motion cues and high-level appearance features into particles (small Gaussians representing local matter), and groups particles into clusters capturing coherently and independently moveable physical entities. We develop a hardware-accelerated inference algorithm based on parallelized block Gibbs sampling to recover stable particle motion and groupings.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
