Distilling Normalizing Flows
Steven Walton, Valeriy Klyukin, Maksim Artemev, Denis Derkach, Nikita Orlov, Humphrey Shi

TL;DR
This paper introduces novel knowledge distillation techniques for normalizing flows, enabling smaller models to achieve higher sampling quality and better density estimation, thus improving efficiency and performance.
Contribution
It presents new knowledge distillation methods tailored for compositional normalizing flows, enhancing their sampling quality and density estimation capabilities.
Findings
Smaller student flows outperform non-distilled models.
Distillation increases throughput proportional to model size.
Knowledge transfer within intermediate layers is effective.
Abstract
Explicit density learners are becoming an increasingly popular technique for generative models because of their ability to better model probability distributions. They have advantages over Generative Adversarial Networks due to their ability to perform density estimation and having exact latent-variable inference. This has many advantages, including: being able to simply interpolate, calculate sample likelihood, and analyze the probability distribution. The downside of these models is that they are often more difficult to train and have lower sampling quality. Normalizing flows are explicit density models, that use composable bijective functions to turn an intractable probability function into a tractable one. In this work, we present novel knowledge distillation techniques to increase sampling quality and density estimation of smaller student normalizing flows. We seek to study the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Statistics Education and Methodologies
MethodsNormalizing Flows · Knowledge Distillation
