PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse

Hao Lu; Onur C. Koyun; Yongxin Guo; Zhengjie Zhu; Abbas Alili; Metin Nafi Gurcan

arXiv:2602.18904·cs.LG·February 24, 2026

PCA-VAE: Differentiable Subspace Quantization without Codebook Collapse

Hao Lu, Onur C. Koyun, Yongxin Guo, Zhengjie Zhu, Abbas Alili, Metin Nafi Gurcan

PDF

Open Access

TL;DR

PCA-VAE introduces a fully differentiable, PCA-based latent quantization method that surpasses traditional VQ in quality, interpretability, and efficiency, avoiding codebook collapse and complex regularizations.

Contribution

It presents a simple, mathematically grounded PCA-based alternative to vector quantization for autoencoders, improving stability and interpretability without codebooks or regularization.

Findings

01

Outperforms VQ-GAN and SimVQ in reconstruction quality on CelebAHQ.

02

Uses 10-100x fewer latent bits than traditional VQ methods.

03

Produces interpretable latent dimensions such as pose and lighting.

Abstract

Vector-quantized autoencoders deliver high-fidelity latents but suffer inherent flaws: the quantizer is non-differentiable, requires straight-through hacks, and is prone to collapse. We address these issues at the root by replacing VQ with a simple, principled, and fully differentiable alternative: an online PCA bottleneck trained via Oja's rule. The resulting model, PCA-VAE, learns an orthogonal, variance-ordered latent basis without codebooks, commitment losses, or lookup noise. Despite its simplicity, PCA-VAE exceeds VQ-GAN and SimVQ in reconstruction quality on CelebAHQ while using 10-100x fewer latent bits. It also produces naturally interpretable dimensions (e.g., pose, lighting, gender cues) without adversarial regularization or disentanglement objectives. These results suggest that PCA is a viable replacement for VQ: mathematically grounded, stable, bit-efficient, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications