ActionMesh: Animated 3D Mesh Generation with Temporal 3D Diffusion
Remy Sabathier, David Novotny, Niloy J. Mitra, Tom Monnier

TL;DR
ActionMesh is a novel generative model that creates high-quality, animated 3D meshes efficiently by incorporating temporal diffusion and autoencoding, enabling rapid, topology-consistent animations from various inputs.
Contribution
The paper introduces ActionMesh, a fast, rig-free 3D animation generation framework using temporal 3D diffusion and autoencoding, improving quality and speed over prior methods.
Findings
Achieves state-of-the-art geometric accuracy and temporal consistency.
Generates animated 3D meshes from videos, text, or static meshes.
Produces results with rapid inference and topology preservation.
Abstract
Generating animated 3D objects is at the heart of many applications, yet most advanced works are typically difficult to apply in practice because of their limited setup, their long runtime, or their limited quality. We introduce ActionMesh, a generative model that predicts production-ready 3D meshes "in action" in a feed-forward manner. Drawing inspiration from early video models, our key insight is to modify existing 3D diffusion models to include a temporal axis, resulting in a framework we dubbed "temporal 3D diffusion". Specifically, we first adapt the 3D diffusion stage to generate a sequence of synchronized latents representing time-varying and independent 3D shapes. Second, we design a temporal 3D autoencoder that translates a sequence of independent shapes into the corresponding deformations of a pre-defined reference shape, allowing us to build an animation. Combining these two…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
