Sequence Generation using Deep Recurrent Networks and Embeddings: A study case in music
Sebastian Garcia-Valencia, Alejandro Betancourt, Juan G., Lalinde-Pulido

TL;DR
This paper explores the use of deep recurrent neural networks with embeddings for automatic music sequence generation, analyzing different memory mechanisms and their impact on musical quality.
Contribution
It introduces a novel approach combining music theory concepts with neural network embeddings to enhance melody generation quality.
Findings
Memory mechanisms significantly affect music generation quality
Embeddings improve semantic coherence in generated melodies
Quantitative metrics effectively evaluate musical tonality
Abstract
Automatic generation of sequences has been a highly explored field in the last years. In particular, natural language processing and automatic music composition have gained importance due to the recent advances in machine learning and Neural Networks with intrinsic memory mechanisms such as Recurrent Neural Networks. This paper evaluates different types of memory mechanisms (memory cells) and analyses their performance in the field of music composition. The proposed approach considers music theory concepts such as transposition, and uses data transformations (embeddings) to introduce semantic meaning and improve the quality of the generated melodies. A set of quantitative metrics is presented to evaluate the performance of the proposed architecture automatically, measuring the tonality of the musical compositions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis
