GANterpretations

Pablo Samuel Castro

arXiv:2011.05158·cs.SD·November 11, 2020

GANterpretations

Pablo Samuel Castro

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method leveraging GANs to automatically generate videos synchronized with audio recordings, enabling new multi-modal creative expressions and visual storytelling aligned with musical performances.

Contribution

It presents a new approach for using GANs to produce audio-guided videos, integrating spectral properties of audio to enhance creative multimedia applications.

Findings

01

Enables automatic video generation from audio recordings.

02

Facilitates multi-modal creative expression for musicians.

03

Supports visual storytelling aligned with audio performance.

Abstract

Since the introduction of Generative Adversarial Networks (GANs) [Goodfellow et al., 2014] there has been a regular stream of both technical advances (e.g., Arjovsky et al. [2017]) and creative uses of these generative models (e.g., [Karras et al., 2019, Zhu et al., 2017, Jin et al., 2017]). In this work we propose an approach for using the power of GANs to automatically generate videos to accompany audio recordings by aligning to spectral properties of the recording. This allows musicians to explore new forms of multi-modal creative expression, where musical performance can induce an AI-generated musical video that is guided by said performance, as well as a medium for creating a visual narrative to follow a storyline (similar to what was proposed by Frosst and Kereliuk [2019]).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

psc-g/ganterpretation
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMusic and Audio Processing · Music Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis