Play Me Something Icy: Practical Challenges, Explainability and the   Semantic Gap in Generative AI Music

Jesse Allison; Drew Farrar; Treya Nash; Carlos Rom\'an; Morgan Weeks,; Fiona Xue Ju

arXiv:2408.07224·cs.SD·August 15, 2024

Play Me Something Icy: Practical Challenges, Explainability and the Semantic Gap in Generative AI Music

Jesse Allison, Drew Farrar, Treya Nash, Carlos Rom\'an, Morgan Weeks,, Fiona Xue Ju

PDF

Open Access

TL;DR

This paper critically examines the challenges, explainability issues, and semantic gaps in text-to-audio and text-to-music generative AI tools from the perspective of experimental musicians and researchers.

Contribution

It highlights key challenges like semantic gaps and explainability in generative AI music, providing insights and suggestions for future improvements.

Findings

01

Semantic gap in describing music with text-based prompts

02

Trade-off between explainability and usability in AI tools

03

Need for better user control and input mechanisms

Abstract

This pictorial aims to critically consider the nature of text-to-audio and text-to-music generative tools in the context of explainable AI. As a group of experimental musicians and researchers, we are enthusiastic about the creative potential of these tools and have sought to understand and evaluate them from perspectives of prompt creation, control, usability, understandability, explainability of the AI process, and overall aesthetic effectiveness of the results. One of the challenges we have identified that is not explicitly addressed by these tools is the inherent semantic gap in using text-based tools to describe something as abstract as music. Other gaps include explainability vs. useability, and user control and input vs. the human creative process. The aim of this pictorial is to raise questions for discussion and make a few general suggestions on the kinds of improvements we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic Technology and Sound Studies · Explainable Artificial Intelligence (XAI)