A Unit Selection Methodology for Music Generation Using Deep Neural Networks
Mason Bretan, Gil Weinberg, and Larry Heck

TL;DR
This paper introduces a music generation approach combining unit selection with deep neural networks, including autoencoders and LSTMs, to produce diverse musical sequences evaluated through both objective metrics and expert listening tests.
Contribution
It presents a novel deep neural network framework for music generation using unit selection and concatenation, integrating autoencoders and structured models for improved diversity.
Findings
The model effectively predicts musical units with high accuracy.
Objective metrics show competitive performance against note-level baselines.
Expert evaluations favor the proposed unit selection approach.
Abstract
Several methods exist for a computer to generate music based on data including Markov chains, recurrent neural networks, recombinancy, and grammars. We explore the use of unit selection and concatenation as a means of generating music using a procedure based on ranking, where, we consider a unit to be a variable length number of measures of music. We first examine whether a unit selection method, that is restricted to a finite size unit library, can be sufficient for encompassing a wide spectrum of music. We do this by developing a deep autoencoder that encodes a musical input and reconstructs the input by selecting from the library. We then describe a generative model that combines a deep structured semantic model (DSSM) with an LSTM to predict the next unit, where units consist of four, two, and one measures of music. We evaluate the generative model using objective metrics including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Neuroscience and Music Perception
MethodsSigmoid Activation · Tanh Activation · Solana Customer Service Number +1-833-534-1729 · Long Short-Term Memory
