Dance-to-Music Generation with Encoder-based Textual Inversion

Sifei Li; Weiming Dong; Yuxin Zhang; Fan Tang; Chongyang Ma; Oliver; Deussen; Tong-Yee Lee; Changsheng Xu

arXiv:2401.17800·cs.SD·September 16, 2024·2 cites

Dance-to-Music Generation with Encoder-based Textual Inversion

Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver, Deussen, Tong-Yee Lee, Changsheng Xu

PDF

Open Access 1 Repo

TL;DR

This paper introduces an encoder-based textual inversion method that enhances text-to-music generation by incorporating dance rhythm and genre, enabling personalized and temporally aligned dance-music synthesis.

Contribution

It proposes a dual-path rhythm-genre inversion technique with separate encoders, improving control over musical rhythm and genre in text-to-music models for dance applications.

Findings

01

Outperforms state-of-the-art methods on new InDV dataset

02

Effectively adapts to tempo changes in dance sequences

03

Enhances alignment of music with dance movements

Abstract

The seamless integration of music with dance movements is essential for communicating the artistic intent of a dance piece. This alignment also significantly improves the immersive quality of gaming experiences and animation productions. Although there has been remarkable advancement in creating high-fidelity music from textual descriptions, current methodologies mainly focus on modulating overall characteristics such as genre and emotional tone. They often overlook the nuanced management of temporal rhythm, which is indispensable in crafting music for dance, since it intricately aligns the musical beats with the dancers' movements. Recognizing this gap, we propose an encoder-based textual inversion technique to augment text-to-music models with visual control, facilitating personalized music generation. Specifically, we develop dual-path rhythm-genre inversion to effectively integrate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lsfhuihuiff/dance-to-music_siggraph_asia_2024
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman Motion and Animation · Music and Audio Processing · Music Technology and Sound Studies