FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion   Generation

Arvin Tashakori; Arash Tashakori; Gongbo Yang; Z. Jane Wang; Peyman; Servati

arXiv:2501.16778·cs.CV·January 29, 2025

FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation

Arvin Tashakori, Arash Tashakori, Gongbo Yang, Z. Jane Wang, Peyman, Servati

PDF

Open Access

TL;DR

FlexMotion is a lightweight, physics-aware human motion generation framework that combines a diffusion model with a Transformer encoder-decoder, enabling fast, realistic, and controllable motion synthesis without physics simulators.

Contribution

It introduces a novel, efficient diffusion-based framework with a multimodal Transformer for physically plausible and controllable human motion synthesis.

Findings

01

Achieves realistic motion with improved efficiency and control.

02

Outperforms existing methods in realism and physical plausibility.

03

Sets a new benchmark in human motion synthesis.

Abstract

Lightweight, controllable, and physically plausible human motion synthesis is crucial for animation, virtual reality, robotics, and human-computer interaction applications. Existing methods often compromise between computational efficiency, physical realism, or spatial controllability. We propose FlexMotion, a novel framework that leverages a computationally lightweight diffusion model operating in the latent space, eliminating the need for physics simulators and enabling fast and efficient training. FlexMotion employs a multimodal pre-trained Transformer encoder-decoder, integrating joint locations, contact forces, joint actuations and muscle activations to ensure the physical plausibility of the generated motions. FlexMotion also introduces a plug-and-play module, which adds spatial controllability over a range of motion parameters (e.g., joint locations, joint actuations, contact…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Pose and Action Recognition

MethodsAttention Is All You Need · Softmax · Adam · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer