Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models
Abdelhakim Ziani (MICS), Andras Horvath (UNITO), Paolo Ballarini (MICS)

TL;DR
This paper identifies a fundamental limitation of Lipschitz-constrained VAEs in modeling heavy-tailed distributions and proposes replacing the Gaussian decoder with a Markov chain-based Phase-Type distribution to effectively generate heavy tails.
Contribution
The paper introduces a novel decoder based on Markov chain Phase-Type distributions, overcoming the heavy-tail limitations of traditional Lipschitz generative models.
Findings
PH-based model reduces tail Kolmogorov-Smirnov distance by up to 6 times
Extreme quantile error decreases by up to 10 times with PH decoder
Theoretical and empirical evidence shows improved heavy-tail modeling
Abstract
Heavy-tailed distributions are prevalent in performance evaluation, network traffic, and risk modeling. This behavior poses a fundamental challenge for modern deep generative models. Standard Variational Autoencoders (VAEs) employ Gaussian decoder likelihoods and Lipschitz-constrained neural networks, a combination that is structurally incapable of producing heavy-tailed outputs: the Gaussian tail decays exponentially, and Lipschitz continuity prevents the decoder from amplifying rare events from the latent space input to sufficiently overcome this decay. We provide both a theoretical characterization of this limitation and a controlled empirical demonstration using synthetic Pareto data across a grid of tail indices {2, 3, 5, 30} and dimensions d {1, 5, 10}. As a solution, we replace the Gaussian decoder with a Phase-Type (PH) distribution based on Markov chains,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
