Music Discovery Dialogue Generation Using Human Intent Analysis and   Large Language Models

SeungHeon Doh; Keunwoo Choi; Daeyong Kwon; Taesu Kim; Juhan Nam

arXiv:2411.07439·cs.SD·November 13, 2024

Music Discovery Dialogue Generation Using Human Intent Analysis and Large Language Models

SeungHeon Doh, Keunwoo Choi, Daeyong Kwon, Taesu Kim, Juhan Nam

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel framework for generating large-scale, high-quality music discovery dialogue data using large language models and intent analysis, enabling improved conversational music retrieval systems.

Contribution

The authors develop a data generation framework leveraging LLMs and intent analysis to create a large synthetic music dialogue dataset, addressing data scarcity in conversational music retrieval.

Findings

01

Synthetic dataset is comparable to small human dialogue datasets in quality.

02

The generated dataset improves training for conversational music retrieval models.

03

Promising retrieval performance demonstrates effectiveness of the data generation approach.

Abstract

A conversational music retrieval system can help users discover music that matches their preferences through dialogue. To achieve this, a conversational music retrieval system should seamlessly engage in multi-turn conversation by 1) understanding user queries and 2) responding with natural language and retrieved music. A straightforward solution would be a data-driven approach utilizing such conversation logs. However, few datasets are available for the research and are limited in terms of volume and quality. In this paper, we present a data generation framework for rich music discovery dialogue using a large language model (LLM) and user intents, system actions, and musical attributes. This is done by i) dialogue intent analysis using grounded theory, ii) generating attribute sequences via cascading database filtering, and iii) generating utterances using large language models. By…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

seungheondoh/lp-music-dialog
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Advanced Text Analysis Techniques · Topic Modeling