LaunchpadGPT: Language Model as Music Visualization Designer on Launchpad
Siting Xu, Yolo Yunlong Tang, Feng Zheng

TL;DR
LaunchpadGPT is a novel language model-based system that automatically generates music visualization designs for Launchpad, aiding beginners and enhancing creative possibilities in music performance.
Contribution
It introduces a new approach using a trained language model to generate Launchpad lighting effects from music, improving accessibility and visualization quality.
Findings
Outperforms random generation methods in visualization quality
Capable of producing diverse and synchronized lighting effects
Potential for broader music visualization applications
Abstract
Launchpad is a musical instrument that allows users to create and perform music by pressing illuminated buttons. To assist and inspire the design of the Launchpad light effect, and provide a more accessible approach for beginners to create music visualization with this instrument, we proposed the LaunchpadGPT model to generate music visualization designs on Launchpad automatically. Based on the language model with excellent generation ability, our proposed LaunchpadGPT takes an audio piece of music as input and outputs the lighting effects of Launchpad-playing in the form of a video (Launchpad-playing video). We collect Launchpad-playing videos and process them to obtain music and corresponding video frame of Launchpad-playing as prompt-completion pairs, to train the language model. The experiment result shows the proposed method can create better music visualization than random…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Motion and Animation · Music and Audio Processing · Generative Adversarial Networks and Image Synthesis
