SkeletonGaussian: Editable 4D Generation through Gaussian Skeletonization

Lifan Wu; Ruijie Zhu; Yubo Ai; Tianzhu Zhang

arXiv:2602.04271·cs.CV·February 5, 2026

SkeletonGaussian: Editable 4D Generation through Gaussian Skeletonization

Lifan Wu, Ruijie Zhu, Yubo Ai, Tianzhu Zhang

PDF

Open Access

TL;DR

SkeletonGaussian introduces a hierarchical, skeleton-driven framework for editable 4D dynamic 3D object generation from monocular videos, improving interpretability and control over motion synthesis.

Contribution

It presents a novel hierarchical articulated representation that explicitly decomposes motion into rigid and non-rigid components, enabling more controllable and editable 4D generation.

Findings

01

Outperforms existing methods in generation quality

02

Enables intuitive motion editing

03

Provides a new paradigm for editable 4D synthesis

Abstract

4D generation has made remarkable progress in synthesizing dynamic 3D objects from input text, images, or videos. However, existing methods often represent motion as an implicit deformation field, which limits direct control and editability. To address this issue, we propose SkeletonGaussian, a novel framework for generating editable dynamic 3D Gaussians from monocular video input. Our approach introduces a hierarchical articulated representation that decomposes motion into sparse rigid motion explicitly driven by a skeleton and fine-grained non-rigid motion. Concretely, we extract a robust skeleton and drive rigid motion via linear blend skinning, followed by a hexplane-based refinement for non-rigid deformations, enhancing interpretability and editability. Experimental results demonstrate that SkeletonGaussian surpasses existing methods in generation quality while enabling intuitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Generative Adversarial Networks and Image Synthesis · Face recognition and analysis