Drive Any Mesh: 4D Latent Diffusion for Mesh Deformation from Video

Yahao Shi; Yang Liu; Yanmin Wu; Xing Liu; Chen Zhao; Jie Luo; Bin Zhou

arXiv:2506.07489·cs.CV·June 10, 2025

Drive Any Mesh: 4D Latent Diffusion for Mesh Deformation from Video

Yahao Shi, Yang Liu, Yanmin Wu, Xing Liu, Chen Zhao, Jie Luo, Bin Zhou

PDF

Open Access

TL;DR

DriveAnyMesh introduces a 4D diffusion model that efficiently generates mesh animations from monocular videos, improving compatibility with modern rendering engines and enabling rapid, high-quality animations for complex motions.

Contribution

The paper presents a novel 4D diffusion approach using a transformer-based VAE to produce mesh animations from video, addressing efficiency and generalization issues of prior methods.

Findings

01

Rapid generation of high-quality mesh animations

02

Compatibility with modern rendering engines

03

Effective handling of complex motions

Abstract

We propose DriveAnyMesh, a method for driving mesh guided by monocular video. Current 4D generation techniques encounter challenges with modern rendering engines. Implicit methods have low rendering efficiency and are unfriendly to rasterization-based engines, while skeletal methods demand significant manual effort and lack cross-category generalization. Animating existing 3D assets, instead of creating 4D assets from scratch, demands a deep understanding of the input's 3D structure. To tackle these challenges, we present a 4D diffusion model that denoises sequences of latent sets, which are then decoded to produce mesh animations from point cloud trajectory sequences. These latent sets leverage a transformer-based variational autoencoder, simultaneously capturing 3D shape and motion information. By employing a spatiotemporal, transformer-based diffusion model, information is exchanged…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topics3D Shape Modeling and Analysis · Computer Graphics and Visualization Techniques · Human Motion and Animation