TEAdapter: Supply abundant guidance for controllable text-to-music   generation

Jialing Zou; Jiahao Mei; Xudong Nan; Jinghua Li; Daoguo Dong; Liang He

arXiv:2408.04865·cs.SD·August 12, 2024

TEAdapter: Supply abundant guidance for controllable text-to-music generation

Jialing Zou, Jiahao Mei, Xudong Nan, Jinghua Li, Daoguo Dong, Liang He

PDF

Open Access 1 Repo

TL;DR

TEAdapter is a lightweight plugin that enhances controllability in text-to-music generation, allowing users to specify diverse control conditions for more intricate and structured music creation.

Contribution

We introduce TEAdapter, a novel plugin that provides fine-grained control over music generation, adaptable to various diffusion models and capable of managing complex structural controls.

Findings

01

Enables precise control over music generation at multiple levels.

02

Ensures high-quality output with diverse control conditions.

03

Transferable to different diffusion model architectures.

Abstract

Although current text-guided music generation technology can cope with simple creative scenarios, achieving fine-grained control over individual text-modality conditions remains challenging as user demands become more intricate. Accordingly, we introduce the TEAcher Adapter (TEAdapter), a compact plugin designed to guide the generation process with diverse control information provided by users. In addition, we explore the controllable generation of extended music by leveraging TEAdapter control groups trained on data of distinct structural functionalities. In general, we consider controls over global, elemental, and structural levels. Experimental results demonstrate that the proposed TEAdapter enables multiple precise controls and ensures high-quality music generation. Our module is also lightweight and transferable to any diffusion model architecture. Available code and demos will be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ashley1101/teadapter
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Human Motion and Animation