Wasserstein-p Central Limit Theorem Rates: From Local Dependence to Markov Chains
Yixuan Zhang, Qiaomin Xie

TL;DR
This paper establishes optimal non-asymptotic Wasserstein-$p$ CLT rates for dependent data, including locally dependent sequences and Markov chains, improving previous bounds and enabling new applications.
Contribution
It provides the first optimal $W_1$ CLT rate for locally dependent sequences and multivariate U-statistics, and introduces new technical tools for dependent data analysis.
Findings
Achieved $ ext{O}(n^{-1/2})$ rate in $W_1$ for dependent sequences.
Established $W_p$ CLT rates for $p eq 1$ under mild moments.
Proved geometric tail bounds for Markov chain regeneration times.
Abstract
Non-asymptotic central limit theorem (CLT) rates play a central role in modern machine learning and operations research. In this paper, we study CLT rates for multivariate dependent data in Wasserstein- () distance, for general . We focus on two fundamental dependence structures that commonly arise in practice: locally dependent sequences and geometrically ergodic Markov chains. In both settings, we establish the first optimal rate in , as well as the first () CLT rates under mild moment assumptions, substantially improving the best previously known bounds in these dependent-data regimes. As an application of our optimal rate for locally dependent sequences, we further obtain the first optimal -CLT rate for multivariate -statistics. On the technical side, we derive a tractable auxiliary bound for Gaussian…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
