FlexMotion: Lightweight, Physics-Aware, and Controllable Human Motion Generation
Arvin Tashakori, Arash Tashakori, Gongbo Yang, Z. Jane Wang, Peyman, Servati

TL;DR
FlexMotion is a lightweight, physics-aware human motion generation framework that combines a diffusion model with a Transformer encoder-decoder, enabling fast, realistic, and controllable motion synthesis without physics simulators.
Contribution
It introduces a novel, efficient diffusion-based framework with a multimodal Transformer for physically plausible and controllable human motion synthesis.
Findings
Achieves realistic motion with improved efficiency and control.
Outperforms existing methods in realism and physical plausibility.
Sets a new benchmark in human motion synthesis.
Abstract
Lightweight, controllable, and physically plausible human motion synthesis is crucial for animation, virtual reality, robotics, and human-computer interaction applications. Existing methods often compromise between computational efficiency, physical realism, or spatial controllability. We propose FlexMotion, a novel framework that leverages a computationally lightweight diffusion model operating in the latent space, eliminating the need for physics simulators and enabling fast and efficient training. FlexMotion employs a multimodal pre-trained Transformer encoder-decoder, integrating joint locations, contact forces, joint actuations and muscle activations to ensure the physical plausibility of the generated motions. FlexMotion also introduces a plug-and-play module, which adds spatial controllability over a range of motion parameters (e.g., joint locations, joint actuations, contact…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition
MethodsAttention Is All You Need · Softmax · Adam · Residual Connection · Dropout · Absolute Position Encodings · Byte Pair Encoding · Linear Layer · Multi-Head Attention · Position-Wise Feed-Forward Layer
