Self-supervised Neural Articulated Shape and Appearance Models
Fangyin Wei, Rohan Chabra, Lingni Ma, Christoph Lassner, Michael, Zollh\"ofer, Szymon Rusinkiewicz, Chris Sweeney, Richard Newcombe, Mira, Slavcheva

TL;DR
This paper introduces a self-supervised neural model that learns to represent the geometry, appearance, and motion of articulated objects from 2D images, enabling control, reconstruction, and synthesis without 3D supervision.
Contribution
It presents a novel end-to-end approach for learning articulated object representations from images without requiring annotations, outperforming methods that rely on 3D supervision.
Findings
Effective for various joint types like revolute and prismatic
Produces more faithful geometry and appearance from 2D data
Enables applications like few-shot reconstruction and novel view synthesis
Abstract
Learning geometry, motion, and appearance priors of object classes is important for the solution of a large variety of computer vision problems. While the majority of approaches has focused on static objects, dynamic objects, especially with controllable articulation, are less explored. We propose a novel approach for learning a representation of the geometry, appearance, and motion of a class of articulated objects given only a set of color images as input. In a self-supervised manner, our novel representation learns shape, appearance, and articulation codes that enable independent control of these semantic dimensions. Our model is trained end-to-end without requiring any articulation annotations. Experiments show that our approach performs well for different joint types, such as revolute and prismatic joints, as well as different combinations of these joints. Compared to state of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Advanced Vision and Imaging · Domain Adaptation and Few-Shot Learning
