Balancing Gaussian vectors in high dimension
Paxton Turner, Raghu Meka, Philippe Rigollet

TL;DR
This paper investigates the discrepancy of high-dimensional Gaussian matrices, providing sharp probabilistic bounds and developing an efficient algorithm for low-discrepancy matrix balancing in high dimensions.
Contribution
It offers the first sharp probabilistic bounds for Gaussian discrepancy in high dimensions and introduces a polynomial-time algorithm for low discrepancy in large random matrices.
Findings
Discrepancy of Gaussian matrices is $ heta(\sqrt{n} 2^{-n/m})$ with high probability.
An efficient randomized algorithm achieves discrepancy $e^{- ext{polylog}(n)/m}$ for matrices with i.i.d. Lipschitz continuous entries.
The algorithm matches known guarantees in one dimension and improves bounds in higher dimensions.
Abstract
Motivated by problems in controlled experiments, we study the discrepancy of random matrices with continuous entries where the number of columns is much larger than the number of rows . Our first result shows that if , a matrix with i.i.d. standard Gaussian entries has discrepancy with high probability. This provides sharp guarantees for Gaussian discrepancy in a regime that had not been considered before in the existing literature. Our results also apply to a more general family of random matrices with continuous i.i.d entries, assuming that . The proof is non-constructive and is an application of the second moment method. Our second result is algorithmic and applies to random matrices whose entries are i.i.d. and have a Lipschitz density. We present a randomized polynomial-time algorithm that achieves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Distributed and Parallel Computing Systems · Algorithms and Data Compression
