Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders
Jiaming Chu, Tao Wang, Lei Jin

TL;DR
This paper reveals that VAE encoders distilled at low resolutions can generalize to higher resolutions, improving high-resolution image reconstruction without training on high-res data, by learning resolution-consistent latent manifolds.
Contribution
It uncovers a counter-intuitive phenomenon where low-resolution distilled VAEs perform better at higher resolutions and analyzes the underlying latent space behavior.
Findings
Distilled VAEs at low resolution generalize to higher resolutions.
Resolution remapping improves reconstruction metrics significantly.
Latent distributions align more closely with the teacher at higher resolutions.
Abstract
Variational Autoencoder (VAE) encoders play a critical role in modern generative models, yet their computational cost often motivates the use of knowledge distillation or quantification to obtain compact alternatives. Existing studies typically believe that the model work better on the samples closed to their training data distribution than unseen data distribution. In this work, we report a counter-intuitive phenomenon in VAE encoder distillation: a compact encoder distilled only at low resolutions exhibits poor reconstruction performance at its native resolution, but achieves dramatically improved results when evaluated at higher, unseen input resolutions. Despite never being trained beyond resolution, the distilled encoder generalizes effectively to resolution inputs, partially inheriting the teacher model's resolution preference.We further analyze latent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Domain Adaptation and Few-Shot Learning
