Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization

Hanchao Liu; Fang-Lue Zhang; Shining Zhang; Tai-Jiang Mu; Shi-Min Hu

arXiv:2605.08054·cs.CV·May 11, 2026

Towards Highly-Constrained Human Motion Generation with Retrieval-Guided Diffusion Noise Optimization

Hanchao Liu, Fang-Lue Zhang, Shining Zhang, Tai-Jiang Mu, Shi-Min Hu

PDF

TL;DR

This paper introduces a retrieval-guided diffusion noise optimization method for generating human motion under highly constrained, complex spatiotemporal conditions, enabling more controllable virtual agent behaviors.

Contribution

It proposes a training-free, retrieval-guided approach with relational task parsing and reward-guided initialization to handle challenging motion constraints.

Findings

01

Successfully generates human motion with severe spatial obstacles.

02

Handles specified walking steps and complex constraints effectively.

03

Leverages large motion datasets and LLM for improved reasoning and guidance.

Abstract

Generating human motion that satisfies customized zero-shot goal functions, enabling applications such as controllable character animation and behavior synthesis for virtual agents, is a critical capability. While current approaches handle many unseen constraints, they fail on tasks with very challenging spatiotemporal restrictions, such as severe spatial obstacles or specified numbers of walking steps. To equip motion generators for these highly constrained tasks, we present a retrieval-guided method built on the training-free diffusion noise optimization framework. The key idea is to search within large motion datasets for guidance that can potentially satisfy difficult constraints. We introduce relational task parsing to group target constraints and identify the difficult ones to be handled by retrieved reference. A better initialization for diffusion noise is then obtained via a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.