Causal Motion Diffusion Models for Autoregressive Motion Generation
Qing Yu, Akihisa Watanabe, Kent Fujiwara

TL;DR
This paper introduces Causal Motion Diffusion Models (CMDM), a novel autoregressive framework that enables real-time, high-quality human motion synthesis by combining causal diffusion transformers with semantic latent representations.
Contribution
The work presents a unified causal diffusion transformer framework for autoregressive motion generation, improving temporal causality, inference speed, and motion quality over prior diffusion and autoregressive models.
Findings
Outperforms existing models in semantic fidelity and smoothness.
Reduces inference latency significantly.
Supports real-time, long-horizon motion generation.
Abstract
Recent advances in motion diffusion models have substantially improved the realism of human motion synthesis. However, existing approaches either rely on full-sequence diffusion models with bidirectional generation, which limits temporal causality and real-time applicability, or autoregressive models that suffer from instability and cumulative errors. In this work, we present Causal Motion Diffusion Models (CMDM), a unified framework for autoregressive motion generation based on a causal diffusion transformer that operates in a semantically aligned latent space. CMDM builds upon a Motion-Language-Aligned Causal VAE (MAC-VAE), which encodes motion sequences into temporally causal latent representations. On top of this latent representation, an autoregressive diffusion transformer is trained using causal diffusion forcing to perform temporally ordered denoising across motion frames. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
