
TL;DR
This paper introduces a simple data transformation technique called tail annealing, which enables flow models to better handle heavy-tailed data by compressing tails into a manageable range, improving performance on benchmarks.
Contribution
The authors propose a novel tail annealing method using a soft-log transform combined with a Hill diagnostic, enhancing flow matching for heavy-tailed distributions without complex architectural changes.
Findings
Log-FM outperforms specialized baselines on heavy-tailed benchmarks.
Zero severe divergences observed across extensive experiments.
Transform effectively maps Pareto tails to exponential, aiding flow modeling.
Abstract
Standard generative models struggle with heavy-tailed data: Lipschitz architectures cannot produce power-law tails from Gaussian noise, and interpolating between heavy-tailed data and Gaussians is ill-posed. We propose a simple fix: apply the soft-log transform coordinate-wise to data before training, then exponentiate samples after generation. A Hill diagnostic decides per-coordinate whether to transform, leaving light-tailed margins untouched at no added complexity. This compresses heavy tails into a range where standard flow matching succeeds, without heavy-tailed base distributions or architectural modifications. We provide theoretical intuition for why this works: the log-transform maps Pareto tails to exponentials, and the induced dynamics implement a form of tail annealing via power transformations. On a 144-configuration…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
