MotionRL: Align Text-to-Motion Generation to Human Preferences with   Multi-Reward Reinforcement Learning

Xiaoyang Liu; Yunyao Mao; Wengang Zhou; Houqiang Li

arXiv:2410.06513·cs.CV·October 10, 2024

MotionRL: Align Text-to-Motion Generation to Human Preferences with Multi-Reward Reinforcement Learning

Xiaoyang Liu, Yunyao Mao, Wengang Zhou, Houqiang Li

PDF

Open Access

TL;DR

MotionRL employs multi-reward reinforcement learning to optimize text-to-motion generation, aligning outputs with human preferences and balancing multiple objectives like motion quality and text adherence.

Contribution

It introduces a novel multi-reward RL framework for text-to-motion tasks, incorporating human preferences and multi-objective optimization to improve alignment and control.

Findings

01

Enhanced alignment with human preferences

02

Improved performance on multiple metrics

03

Allows control over generated motion results

Abstract

We introduce MotionRL, the first approach to utilize Multi-Reward Reinforcement Learning (RL) for optimizing text-to-motion generation tasks and aligning them with human preferences. Previous works focused on improving numerical performance metrics on the given datasets, often neglecting the variability and subjectivity of human feedback. In contrast, our novel approach uses reinforcement learning to fine-tune the motion generator based on human preferences prior knowledge of the human perception model, allowing it to generate motions that better align human preferences. In addition, MotionRL introduces a novel multi-objective optimization strategy to approximate Pareto optimality between text adherence, motion quality, and human preferences. Extensive experiments and user studies demonstrate that MotionRL not only allows control over the generated results across different objectives…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Human Pose and Action Recognition · Handwritten Text Recognition Techniques

MethodsALIGN