Diffusion Model-based Activity Completion for AI Motion Capture from Videos

Gao Huayu; Huang Tengjiu; Ye Xiaolong; Tsuyoshi Okita

arXiv:2505.21566·cs.CV·May 29, 2025

Diffusion Model-based Activity Completion for AI Motion Capture from Videos

Gao Huayu, Huang Tengjiu, Ye Xiaolong, Tsuyoshi Okita

PDF

Open Access

TL;DR

This paper introduces a diffusion-model-based approach for AI motion capture that enables the generation of smooth, continuous human motion sequences beyond observed data, improving naturalness and coherence.

Contribution

It presents a novel diffusion model with a gate and position-time embedding modules for action completion in AI motion capture, addressing transition gaps in training data.

Findings

01

MDC-Net outperforms existing methods in ADE, FDE, MMADE

02

MDC-Net has a smaller model size than HumanMAC

03

MDC-Net produces more natural and coherent motions

Abstract

AI-based motion capture is an emerging technology that offers a cost-effective alternative to traditional motion capture systems. However, current AI motion capture methods rely entirely on observed video sequences, similar to conventional motion capture. This means that all human actions must be predefined, and movements outside the observed sequences are not possible. To address this limitation, we aim to apply AI motion capture to virtual humans, where flexible actions beyond the observed sequences are required. We assume that while many action fragments exist in the training data, the transitions between them may be missing. To bridge these gaps, we propose a diffusion-model-based action completion technique that generates complementary human motion sequences, ensuring smooth and continuous movements. By introducing a gate module and a position-time embedding module, our approach…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · Robot Manipulation and Learning