TL;DR
This paper introduces a Bayesian inverse graphics model that enables few-shot concept learning by inferring object parameters from minimal data, outperforming neural-only methods in classification and pose estimation.
Contribution
The work presents a probabilistic generative model using inverse graphics for few-shot learning, integrating differentiable rendering and MCMC sampling for uncertainty-aware inference.
Findings
Outperforms existing few-shot neural classification methods.
Demonstrates robustness across lighting, backgrounds, and out-of-distribution shapes.
Uses a differentiable renderer and MCMC for scene parameter inference.
Abstract
Humans excel at building generalizations of new concepts from just one single example. Contrary to this, current computer vision models typically require large amount of training samples to achieve a comparable accuracy. In this work we present a Bayesian model of perception that learns using only minimal data, a prototypical probabilistic program of an object. Specifically, we propose a generative inverse graphics model of primitive shapes, to infer posterior distributions over physically consistent parameters from one or several images. We show how this representation can be used for downstream tasks such as few-shot classification and pose estimation. Our model outperforms existing few-shot neural-only classification algorithms and demonstrates generalization across varying lighting conditions, backgrounds, and out-of-distribution shapes. By design, our model is uncertainty-aware and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
