Musika! Fast Infinite Waveform Music Generation
Marco Pasini, Jan Schl\"uter

TL;DR
Musika is a fast, efficient music generation system that uses adversarial autoencoders and GANs to produce high-quality, arbitrarily long music sequences in real-time on consumer hardware, enabling new interactive musical applications.
Contribution
Introduces Musika, a novel music generation framework that trains on limited data and generates music faster than real-time using a compact spectrogram representation and GANs.
Findings
Able to train on hundreds of hours of music with a single GPU
Generates music faster than real-time on consumer CPUs
Produces high-quality, stylistically coherent music samples
Abstract
Fast and user-controllable music generation could enable novel ways of composing or performing music. However, state-of-the-art music generation systems require large amounts of data and computational resources for training, and are slow at inference. This makes them impractical for real-time interactive use. In this work, we introduce Musika, a music generation system that can be trained on hundreds of hours of music using a single consumer GPU, and that allows for much faster than real-time generation of music of arbitrary length on a consumer CPU. We achieve this by first learning a compact invertible representation of spectrogram magnitudes and phases with adversarial autoencoders, then training a Generative Adversarial Network (GAN) on this representation for a particular music domain. A latent coordinate system enables generating arbitrarily long sequences of excerpts in parallel,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗marcop/musika_technomodel· ♡ 1♡ 1
- 🤗marcop/musika_miscmodel· ♡ 1♡ 1
- 🤗marcop/musika_misc_smallmodel· ♡ 3♡ 3
- 🤗marcop/musika_aemodel· ♡ 5♡ 5
- 🤗musika/musika_technomodel· ♡ 1♡ 1
- 🤗musika/musika_miscmodel· ♡ 1♡ 1
- 🤗Broccaloo/musika-s3rl-happy-hardcoremodel· ♡ 2♡ 2
- 🤗musika/musika-s3rl-happy-hardcoremodel· ♡ 5♡ 5
- 🤗musika/musika-halvany_oszi_rozsamodel
- 🤗musika/musika-irish-jigsmodel· ♡ 2♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Music and Audio Processing · Generative Adversarial Networks and Image Synthesis
