Dance-to-Music Generation with Encoder-based Textual Inversion
Sifei Li, Weiming Dong, Yuxin Zhang, Fan Tang, Chongyang Ma, Oliver, Deussen, Tong-Yee Lee, Changsheng Xu

TL;DR
This paper introduces an encoder-based textual inversion method that enhances text-to-music generation by incorporating dance rhythm and genre, enabling personalized and temporally aligned dance-music synthesis.
Contribution
It proposes a dual-path rhythm-genre inversion technique with separate encoders, improving control over musical rhythm and genre in text-to-music models for dance applications.
Findings
Outperforms state-of-the-art methods on new InDV dataset
Effectively adapts to tempo changes in dance sequences
Enhances alignment of music with dance movements
Abstract
The seamless integration of music with dance movements is essential for communicating the artistic intent of a dance piece. This alignment also significantly improves the immersive quality of gaming experiences and animation productions. Although there has been remarkable advancement in creating high-fidelity music from textual descriptions, current methodologies mainly focus on modulating overall characteristics such as genre and emotional tone. They often overlook the nuanced management of temporal rhythm, which is indispensable in crafting music for dance, since it intricately aligns the musical beats with the dancers' movements. Recognizing this gap, we propose an encoder-based textual inversion technique to augment text-to-music models with visual control, facilitating personalized music generation. Specifically, we develop dual-path rhythm-genre inversion to effectively integrate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Music and Audio Processing · Music Technology and Sound Studies
