Wykorzystanie sztucznej inteligencji do generowania tre\'sci muzycznych

Mateusz Dorobek

arXiv:1912.10815·cs.SD·December 24, 2019

Wykorzystanie sztucznej inteligencji do generowania tre\'sci muzycznych

Mateusz Dorobek

PDF

TL;DR

This thesis presents a novel method for generating short musical phrases using a DCGAN trained on MIDI data converted into piano roll images, achieving musically interesting results.

Contribution

It introduces a new approach of translating MIDI data into images for GAN training, enhancing music generation quality and diversity.

Findings

01

Generated musical phrases contain interesting rhythmic and harmonic structures.

02

The image-based approach improves the quality of generated music.

03

The method shows promising results compared to existing solutions.

Abstract

This thesis is presenting a method for generating short musical phrases using a deep convolutional generative adversarial network (DCGAN). To train neural network were used datasets of classical and jazz music MIDI recordings. Our approach introduces translating the MIDI data into graphical images in a piano roll format suitable for the network input size, using the RGB channels as additional information carriers for improved performance. The network has learned to generate images that are indistinguishable from the input data and, when translated back to MIDI and played back, include several musically interesting rhythmic and harmonic structures. The results of the conducted experiments are described and discussed, with conclusions for further work and a short comparison with selected existing solutions.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.