Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models
SeungHeon Doh, Keunwoo Choi, Daeyong Kwon, Taesu Kim, Juhan Nam

TL;DR
This paper introduces a novel framework for generating large-scale, high-quality music discovery dialogue data using large language models and intent analysis, enabling improved conversational music retrieval systems.
Contribution
The authors develop a data generation framework leveraging LLMs and intent analysis to create a large synthetic music dialogue dataset, addressing data scarcity in conversational music retrieval.
Findings
Synthetic dataset is comparable to small human dialogue datasets in quality.
The generated dataset improves training for conversational music retrieval models.
Promising retrieval performance demonstrates effectiveness of the data generation approach.
Abstract
A conversational music retrieval system can help users discover music that matches their preferences through dialogue. To achieve this, a conversational music retrieval system should seamlessly engage in multi-turn conversation by 1) understanding user queries and 2) responding with natural language and retrieved music. A straightforward solution would be a data-driven approach utilizing such conversation logs. However, few datasets are available for the research and are limited in terms of volume and quality. In this paper, we present a data generation framework for rich music discovery dialogue using a large language model (LLM) and user intents, system actions, and musical attributes. This is done by i) dialogue intent analysis using grounded theory, ii) generating attribute sequences via cascading database filtering, and iii) generating utterances using large language models. By…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Advanced Text Analysis Techniques · Topic Modeling
