Sequence Generation using Deep Recurrent Networks and Embeddings: A   study case in music

Sebastian Garcia-Valencia; Alejandro Betancourt; Juan G.; Lalinde-Pulido

arXiv:2012.01231·cs.SD·December 3, 2020·1 cites

Sequence Generation using Deep Recurrent Networks and Embeddings: A study case in music

Sebastian Garcia-Valencia, Alejandro Betancourt, Juan G., Lalinde-Pulido

PDF

Open Access 1 Repo

TL;DR

This paper explores the use of deep recurrent neural networks with embeddings for automatic music sequence generation, analyzing different memory mechanisms and their impact on musical quality.

Contribution

It introduces a novel approach combining music theory concepts with neural network embeddings to enhance melody generation quality.

Findings

01

Memory mechanisms significantly affect music generation quality

02

Embeddings improve semantic coherence in generated melodies

03

Quantitative metrics effectively evaluate musical tonality

Abstract

Automatic generation of sequences has been a highly explored field in the last years. In particular, natural language processing and automatic music composition have gained importance due to the recent advances in machine learning and Neural Networks with intrinsic memory mechanisms such as Recurrent Neural Networks. This paper evaluates different types of memory mechanisms (memory cells) and analyses their performance in the field of music composition. The proposed approach considers music theory concepts such as transposition, and uses data transformations (embeddings) to introduce semantic meaning and improve the quality of the generated melodies. A set of quantitative metrics is presented to evaluate the performance of the proposed architecture automatically, measuring the tonality of the musical compositions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

sebasgverde/mono-midi-transposition-dataset
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis