Tail Annealing for Heavy-Tailed Flow Matching

Jean Pachebat

arXiv:2605.20068·stat.ML·May 20, 2026

Tail Annealing for Heavy-Tailed Flow Matching

Jean Pachebat

PDF

TL;DR

This paper introduces a simple data transformation technique called tail annealing, which enables flow models to better handle heavy-tailed data by compressing tails into a manageable range, improving performance on benchmarks.

Contribution

The authors propose a novel tail annealing method using a soft-log transform combined with a Hill diagnostic, enhancing flow matching for heavy-tailed distributions without complex architectural changes.

Findings

01

Log-FM outperforms specialized baselines on heavy-tailed benchmarks.

02

Zero severe divergences observed across extensive experiments.

03

Transform effectively maps Pareto tails to exponential, aiding flow modeling.

Abstract

Standard generative models struggle with heavy-tailed data: Lipschitz architectures cannot produce power-law tails from Gaussian noise, and interpolating between heavy-tailed data and Gaussians is ill-posed. We propose a simple fix: apply the soft-log transform $ϕ (x) = sign (x) \cdot lo g (1 + ∣ x ∣)$ coordinate-wise to data before training, then exponentiate samples after generation. A Hill diagnostic decides per-coordinate whether to transform, leaving light-tailed margins untouched at no added complexity. This compresses heavy tails into a range where standard flow matching succeeds, without heavy-tailed base distributions or architectural modifications. We provide theoretical intuition for why this works: the log-transform maps Pareto tails to exponentials, and the induced dynamics implement a form of tail annealing via power transformations. On a 144-configuration…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.