Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

Abdelhakim Ziani (MICS); Andras Horvath (UNITO); Paolo Ballarini (MICS)

arXiv:2605.18931·stat.ML·May 20, 2026

Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

Abdelhakim Ziani (MICS), Andras Horvath (UNITO), Paolo Ballarini (MICS)

PDF

TL;DR

This paper identifies a fundamental limitation of Lipschitz-constrained VAEs in modeling heavy-tailed distributions and proposes replacing the Gaussian decoder with a Markov chain-based Phase-Type distribution to effectively generate heavy tails.

Contribution

The paper introduces a novel decoder based on Markov chain Phase-Type distributions, overcoming the heavy-tail limitations of traditional Lipschitz generative models.

Findings

01

PH-based model reduces tail Kolmogorov-Smirnov distance by up to 6 times

02

Extreme quantile error decreases by up to 10 times with PH decoder

03

Theoretical and empirical evidence shows improved heavy-tail modeling

Abstract

Heavy-tailed distributions are prevalent in performance evaluation, network traffic, and risk modeling. This behavior poses a fundamental challenge for modern deep generative models. Standard Variational Autoencoders (VAEs) employ Gaussian decoder likelihoods and Lipschitz-constrained neural networks, a combination that is structurally incapable of producing heavy-tailed outputs: the Gaussian tail decays exponentially, and Lipschitz continuity prevents the decoder from amplifying rare events from the latent space input to sufficiently overcome this decay. We provide both a theoretical characterization of this limitation and a controlled empirical demonstration using synthetic Pareto data across a grid of tail indices $α$ $\in$ {2, 3, 5, 30} and dimensions d $\in$ {1, 5, 10}. As a solution, we replace the Gaussian decoder with a Phase-Type (PH) distribution based on Markov chains,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.