BiPO: Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis
Seong-Eun Hong, Soobin Lim, Juyeong Hwang, Minwook Chang, Hyeongyeop Kang

TL;DR
BiPO is a novel bidirectional network that improves text-to-motion synthesis by integrating part-based generation with a partial occlusion technique, achieving state-of-the-art results and versatile motion editing capabilities.
Contribution
Introduces BiPO, a bidirectional partial occlusion network that enhances text-to-motion synthesis by considering both past and future contexts and controlling individual body parts.
Findings
Achieves state-of-the-art FID scores on HumanML3D dataset.
Outperforms recent methods like ParCo, MoMask, and BAMM.
Excels in both motion generation and editing tasks.
Abstract
Generating natural and expressive human motions from textual descriptions is challenging due to the complexity of coordinating full-body dynamics and capturing nuanced motion patterns over extended sequences that accurately reflect the given text. To address this, we introduce BiPO, Bidirectional Partial Occlusion Network for Text-to-Motion Synthesis, a novel model that enhances text-to-motion synthesis by integrating part-based generation with a bidirectional autoregressive architecture. This integration allows BiPO to consider both past and future contexts during generation while enhancing detailed control over individual body parts without requiring ground-truth motion length. To relax the interdependency among body parts caused by the integration, we devise the Partial Occlusion technique, which probabilistically occludes the certain motion part information during training. In our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Handwritten Text Recognition Techniques · Image Processing and 3D Reconstruction
