Setting the rhythm scene: deep learning-based drum loop generation from arbitrary language cues
Ignacio J. Tripodi

TL;DR
This paper introduces a deep learning method that generates drum patterns based on language cues, aiding music creation and live performance with novel techniques for extracting consensus drum tracks from songs.
Contribution
It presents a new approach for generating mood-based drum patterns from arbitrary language cues and a method for extracting consensus drum tracks from existing music.
Findings
Generated drum patterns embody the mood of language cues
Novel method for extracting consensus drum tracks from songs
Potential applications in electronic music and audiovisual production
Abstract
Generative artificial intelligence models can be a valuable aid to music composition and live performance, both to aid the professional musician and to help democratize the music creation process for hobbyists. Here we present a novel method that, given an English word or phrase, generates 2 compasses of a 4-piece drum pattern that embodies the "mood" of the given language cue, or that could be used for an audiovisual scene described by the language cue. We envision this tool as composition aid for electronic music and audiovisual soundtrack production, or an improvisation tool for live performance. In order to produce the training samples for this model, besides manual annotation of the "scene" or "mood" terms, we have designed a novel method to extract the consensus drum track of any song. This consists of a 2-bar, 4-piece drum pattern that represents the main percussive motif of a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Neuroscience and Music Perception
