TL;DR
AniGen is a novel framework that directly generates animatable 3D assets from a single image by representing shape, skeleton, and skinning as mutually consistent $S^3$ Fields, improving rig validity and animation quality.
Contribution
AniGen introduces a unified $S^3$ Field representation and a two-stage pipeline for directly generating animate-ready 3D assets conditioned on a single image.
Findings
Outperforms state-of-the-art baselines in rig validity and animation quality.
Generalizes effectively to diverse categories including animals, humanoids, and machinery.
Successfully generates animate-ready assets from in-the-wild images.
Abstract
Animatable 3D assets, defined as geometry equipped with an articulated skeleton and skinning weights, are fundamental to interactive graphics, embodied agents, and animation production. While recent 3D generative models can synthesize visually plausible shapes from images, the results are typically static. Obtaining usable rigs via post-hoc auto-rigging is brittle and often produces skeletons that are topologically inconsistent with the generated geometry. We present AniGen, a unified framework that directly generates animate-ready 3D assets conditioned on a single image. Our key insight is to represent shape, skeleton, and skinning as mutually consistent Fields (Shape, Skeleton, Skin) defined over a shared spatial domain. To enable the robust learning of these fields, we introduce two technical innovations: (i) a confidence-decaying skeleton field that explicitly handles the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
