MDSC: Towards Evaluating the Style Consistency Between Music and Dance
Zixiang Zhou, Weiyuan Li, Baoyuan Wang

TL;DR
This paper introduces MDSC, a novel evaluation metric that measures the stylistic consistency between music and dance by modeling it as a clustering problem in a joint embedding space.
Contribution
MDSC is the first metric to evaluate style correlation between music and dance, moving beyond existing metrics focused on motion fidelity and rhythmic matching.
Findings
MDSC effectively measures style consistency in generated dance sequences.
The metric correlates well with user study evaluations.
It outperforms previous evaluation methods in robustness.
Abstract
We propose MDSC(Music-Dance-Style Consistency), the first evaluation metric that assesses to what degree the dance moves and music match. Existing metrics can only evaluate the motion fidelity and diversity and the degree of rhythmic matching between music and dance. MDSC measures how stylistically correlated the generated dance motion sequences and the conditioning music sequences are. We found that directly measuring the embedding distance between motion and music is not an optimal solution. We instead tackle this through modeling it as a clustering problem. Specifically, 1) we pre-train a music encoder and a motion encoder, then 2) we learn to map and align the motion and music embedding in joint space by jointly minimizing the intra-cluster distance and maximizing the inter-cluster distance, and 3) for evaluation purposes, we encode the dance moves into embedding and measure the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Music and Audio Processing · Music Technology and Sound Studies
MethodsALIGN
