TL;DR
This paper introduces a novel conditional sequence-to-sequence framework for automatic lead sheet music generation that incorporates high-level emotional and structural controls like valence, phrasing, and time signature, using neural networks.
Contribution
It presents a new method for calculating emotional valence of chord progressions and integrates this into a neural machine translation model for controllable music generation.
Findings
Controllable lead sheet generation with similar attribute distributions to training data.
Effective control of valence confirmed through listening tests.
Comparison of LSTM and Transformer architectures for music generation.
Abstract
The field of automatic music composition has seen great progress in the last few years, much of which can be attributed to advances in deep neural networks. There are numerous studies that present different strategies for generating sheet music from scratch. The inclusion of high-level musical characteristics (e.g., perceived emotional qualities), however, as conditions for controlling the generation output remains a challenge. In this paper, we present a novel approach for calculating the valence (the positivity or negativity of the perceived emotion) of a chord progression within a lead sheet, using pre-defined mood tags proposed by music experts. Based on this approach, we propose a novel strategy for conditional lead sheet generation that allows us to steer the music generation in terms of valence, phrasing, and time signature. Our approach is similar to a Neural Machine Translation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Softmax · Layer Normalization · Byte Pair Encoding · Adam
