Not All Frames Are Equal: Complexity-Aware Masked Motion Generation via Motion Spectral Descriptors

Pengfei Zhou; Xiangyue Zhang; Xukun Shen; Yong Hu

arXiv:2603.29655·cs.CV·April 1, 2026

Not All Frames Are Equal: Complexity-Aware Masked Motion Generation via Motion Spectral Descriptors

Pengfei Zhou, Xiangyue Zhang, Xukun Shen, Yong Hu

PDF

1 Repo

TL;DR

This paper introduces the Motion Spectral Descriptor (MSD), a measure of local motion complexity, to improve masked motion generation by making it complexity-aware, leading to better synthesis of dynamic motions.

Contribution

The paper proposes MSD, a deterministic spectral measure of motion complexity, and integrates it into masked motion generation to enhance performance on complex motions.

Findings

01

MSD correlates strongly with motion dynamics.

02

DynMask improves generation on complex motions.

03

Overall FID scores are strengthened on benchmarks.

Abstract

Masked generative models have become a strong paradigm for text-to-motion synthesis, but they still treat motion frames too uniformly during masking, attention, and decoding. This is a poor match for motion, where local dynamic complexity varies sharply over time. We show that current masked motion generators degrade disproportionately on dynamically complex motions, and that frame-wise generation error is strongly correlated with motion dynamics. Motivated by this mismatch, we introduce the Motion Spectral Descriptor (MSD), a simple and parameter-free measure of local dynamic complexity computed from the short-time spectrum of motion velocity. Unlike learned difficulty predictors, MSD is deterministic, interpretable, and derived directly from the motion signal itself. We use MSD to make masked motion generation complexity-aware. In particular, MSD guides content-focused masking during…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://xiangyue-zhang.github.io/DynMask
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.