Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders

Jiaming Chu; Tao Wang; Lei Jin

arXiv:2603.14536·cs.CV·March 17, 2026

Distilling Latent Manifolds: Resolution Extrapolation by Variational Autoencoders

Jiaming Chu, Tao Wang, Lei Jin

PDF

Open Access

TL;DR

This paper reveals that VAE encoders distilled at low resolutions can generalize to higher resolutions, improving high-resolution image reconstruction without training on high-res data, by learning resolution-consistent latent manifolds.

Contribution

It uncovers a counter-intuitive phenomenon where low-resolution distilled VAEs perform better at higher resolutions and analyzes the underlying latent space behavior.

Findings

01

Distilled VAEs at low resolution generalize to higher resolutions.

02

Resolution remapping improves reconstruction metrics significantly.

03

Latent distributions align more closely with the teacher at higher resolutions.

Abstract

Variational Autoencoder (VAE) encoders play a critical role in modern generative models, yet their computational cost often motivates the use of knowledge distillation or quantification to obtain compact alternatives. Existing studies typically believe that the model work better on the samples closed to their training data distribution than unseen data distribution. In this work, we report a counter-intuitive phenomenon in VAE encoder distillation: a compact encoder distilled only at low resolutions exhibits poor reconstruction performance at its native resolution, but achieves dramatically improved results when evaluated at higher, unseen input resolutions. Despite never being trained beyond $25 6^{2}$ resolution, the distilled encoder generalizes effectively to $51 2^{2}$ resolution inputs, partially inheriting the teacher model's resolution preference.We further analyze latent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Domain Adaptation and Few-Shot Learning