Example-Based Framework for Perceptually Guided Audio Texture Generation
Purnima Kamath, Chitralekha Gupta, Lonce Wyse, Suranga Nanayakkara

TL;DR
This paper introduces an example-based method to control audio texture generation using StyleGAN without labeled datasets, enabling semantic attribute manipulation and transfer in generated audio textures.
Contribution
It presents a novel framework that infers guidance vectors for semantic control in unconditionally trained StyleGANs using minimal synthetic examples.
Findings
Successfully controls semantic attributes in audio textures
Enables semantic attribute transfer between audio samples
Works without large labeled datasets
Abstract
Controllable generation using StyleGANs is usually achieved by training the model using labeled data. For audio textures, however, there is currently a lack of large semantically labeled datasets. Therefore, to control generation, we develop a method for semantic control over an unconditionally trained StyleGAN in the absence of such labeled datasets. In this paper, we propose an example-based framework to determine guidance vectors for audio texture generation based on user-defined semantic attributes. Our approach leverages the semantically disentangled latent space of an unconditionally trained StyleGAN. By using a few synthetic examples to indicate the presence or absence of a semantic attribute, we infer the guidance vectors in the latent space of the StyleGAN to control that attribute during generation. Our results show that our framework can find user-defined and perceptually…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Music Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis
MethodsAdaptive Instance Normalization · Dense Connections · HuMan(Expedia)||How do I get a human at Expedia? · R1 Regularization · Feedforward Network · Convolution · StyleGAN
