On tuning consistent annealed sampling for denoising score matching
Joan Serr\`a, Santiago Pascual, Jordi Pons

TL;DR
This paper analyzes and improves the hyper-parameter tuning of consistent annealed sampling in score-based generative models, enhancing sampling efficiency and flexibility for image and audio synthesis.
Contribution
It provides a detailed overview of sampling schemes, studies hyper-parameter boundaries, and proposes a formulation that simplifies tuning with fewer or variable steps.
Findings
Hyper-parameter boundaries for consistent annealed sampling are characterized.
A new formulation facilitates hyper-parameter tuning with fewer steps.
Connections between sampling schemes are elucidated.
Abstract
Score-based generative models provide state-of-the-art quality for image and audio synthesis. Sampling from these models is performed iteratively, typically employing a discretized series of noise levels and a predefined scheme. In this note, we first overview three common sampling schemes for models trained with denoising score matching. Next, we focus on one of them, consistent annealed sampling, and study its hyper-parameter boundaries. We then highlight a possible formulation of such hyper-parameter that explicitly considers those boundaries and facilitates tuning when using few or a variable number of steps. Finally, we highlight some connections of the formulation with other sampling schemes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Generative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks
