Mixing of the No-U-Turn Sampler and the Geometry of Gaussian Concentration
Nawaf Bou-Rabee, Stefan Oberd\"orster

TL;DR
This paper proves that the No-U-Turn Sampler mixes rapidly in high-dimensional Gaussian settings, with mixing time scaling as the quarter power of the dimension, leveraging geometric measure concentration and addressing path length adaptation issues.
Contribution
It establishes a theoretical mixing time bound for NUTS in Gaussian concentration regions and analyzes its geometric and acceptance properties, revealing a new issue with path length adaptation.
Findings
NUTS mixes in $d^{1/4}$ time in high dimensions.
Concentration of measure leads to uniformity in NUTS transitions.
Identifies and addresses a looping issue in NUTS' path length adaptation.
Abstract
We prove that the mixing time of the No-U-Turn Sampler (NUTS), when initialized in the concentration region of the canonical Gaussian measure, scales as , up to logarithmic factors, where is the dimension. This scaling is expected to be sharp. This result is based on a coupling argument that leverages the geometric structure of the target distribution. Specifically, concentration of measure results in a striking uniformity in NUTS' locally adapted transitions, which holds with high probability. This uniformity is formalized by interpreting NUTS as an accept/reject Markov chain, where the mixing properties for the more uniform accept chain are analytically tractable. Additionally, our analysis uncovers a previously unnoticed issue with the path length adaptation procedure of NUTS, specifically related to looping behavior, which we address in detail.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Object Detection Techniques
