On the Joint Minimization of Regularization Loss Functions in Deep Variational Bayesian Methods for Attribute-Controlled Symbolic Music Generation
Matteo Petten\'o, Alessandro Ilic Mezza, Alberto Bernardini

TL;DR
This paper investigates balancing regularization loss functions in deep variational Bayesian models to improve attribute-controlled symbolic music generation, addressing challenges in controllability and latent space regularization.
Contribution
It introduces attribute transformations to better jointly minimize regularization objectives, enhancing controllability and regularization in music generation models.
Findings
Existing approaches struggle to jointly minimize regularization objectives.
Suitable attribute transformations improve controllability and regularization.
Balancing KLD and AR is crucial for effective attribute control.
Abstract
Explicit latent variable models provide a flexible yet powerful framework for data synthesis, enabling controlled manipulation of generative factors. With latent variables drawn from a tractable probability density function that can be further constrained, these models enable continuous and semantically rich exploration of the output space by navigating their latent spaces. Structured latent representations are typically obtained through the joint minimization of regularization loss functions. In variational information bottleneck models, reconstruction loss and Kullback-Leibler Divergence (KLD) are often linearly combined with an auxiliary Attribute-Regularization (AR) loss. However, balancing KLD and AR turns out to be a very delicate matter. When KLD dominates over AR, generative models tend to lack controllability; when AR dominates over KLD, the stochastic encoder is encouraged to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic Technology and Sound Studies · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
