Using a Bi-directional LSTM Model with Attention Mechanism trained on MIDI Data for Generating Unique Music
Ashish Ranjan, Varun Nagesh Jolly Behera, Motahar Reza

TL;DR
This paper presents a bi-directional LSTM with attention mechanism trained on MIDI data to generate music that mimics the style of the training data, allowing for customizable parameters like tempo and instruments.
Contribution
It introduces a novel neural network model combining bi-directional LSTM and attention for music generation from MIDI data, enabling style mimicry and parameter customization.
Findings
Generated music follows the training style.
Model allows post-generation parameter adjustments.
Effective in capturing musical themes.
Abstract
Generating music is an interesting and challenging problem in the field of machine learning. Mimicking human creativity has been popular in recent years, especially in the field of computer vision and image processing. With the advent of GANs, it is possible to generate new similar images, based on trained data. But this cannot be done for music similarly, as music has an extra temporal dimension. So it is necessary to understand how music is represented in digital form. When building models that perform this generative task, the learning and generation part is done in some high-level representation such as MIDI (Musical Instrument Digital Interface) or scores. This paper proposes a bi-directional LSTM (Long short-term memory) model with attention mechanism capable of generating similar type of music based on MIDI data. The music generated by the model follows the theme/style of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis
MethodsTanh Activation · Sigmoid Activation · Long Short-Term Memory
