Harmony-Aware Music-driven Motion Synthesis with Perceptual Constraint on UGC Datasets
Xinyi Wu, Haohong Wang, and Aggelos K. Katsaggelos

TL;DR
This paper introduces a harmony-aware GAN framework for music-driven 3D human motion synthesis from UGC datasets, emphasizing perceptual harmony constraints to improve rhythmic synchronization and realism.
Contribution
It proposes a novel harmony evaluation strategy with beat detection and saliency weighting, integrated into a GAN to enhance rhythmic harmony in generated motions.
Findings
Outperforms existing methods in rhythmic harmony metrics
Achieves realistic and synchronized motions with limited training data
Demonstrates effectiveness of perceptual harmony constraints in motion synthesis
Abstract
With the popularity of video-based user-generated content (UGC) on social media, harmony, as dictated by human perceptual principles, is critical in assessing the rhythmic consistency of audio-visual UGCs for better user engagement. In this work, we propose a novel harmony-aware GAN framework, following a specifically designed harmony evaluation strategy to enhance rhythmic synchronization in the automatic music-to-motion synthesis using a UGC dance dataset. This harmony strategy utilizes refined cross-modal beat detection to capture closely correlated audio and visual rhythms in an audio-visual pair. To mimic human attention mechanism, we introduce saliency-based beat weighting and interval-driven beat alignment, which ensures accurate harmony score estimation consistent with human perception. Building on this strategy, our model, employing efficient encoder-decoder and depth-lifting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Human Motion and Animation
