Fast Mixing of Data Augmentation Algorithms: Bayesian Probit, Logit, and Lasso Regression
Holden Lee, Kexin Zhang

TL;DR
This paper establishes polynomial upper bounds on the mixing times of data augmentation algorithms for Bayesian Probit, Logit, and Lasso regression, providing theoretical guarantees for their convergence rates.
Contribution
It introduces a modified conductance-based method to analyze mixing times, deriving the first non-asymptotic bounds for these important Bayesian DA algorithms.
Findings
ProbitDA and LogitDA require O(n log(log η/ε)) steps for convergence.
LassoDA requires O(d^2(d log d + n log n)^2 log(η/ε)) steps.
Results apply to large n and d, including imbalanced data scenarios.
Abstract
We propose using a modified conductance-based method to study the mixing time of an important class of two-block Gibbs samplers, the data augmentation (DA) algorithm. %, which is of prominent interest in both theoretical and empirical research. Using this method, we prove the first non-asymptotic polynomial upper bounds on mixing times of three important DA algorithms: DA algorithms for Bayesian Probit regression (Albert and Chib, 1993, ProbitDA) and Bayesian Logit regression (Polson, Scott, and Windle, 2013, LogitDA), and Bayesian Lasso Regression (Park and Casella, 2008, Rajaratnam et al., 2015, LassoDA). Concretely, for ProbitDA and LogitDA, we demonstrate a tight bound that explicitly depends on the design matrix and prior covariance matrix. Under the assumption that data are independently generated from either a sub-Gaussian or log-concave distribution and properly scaled, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
