The Littlewood-Offord Problem for Markov Chains
Shravas Rao

TL;DR
This paper extends classical Littlewood-Offord probability bounds to cases where the signs are generated by Markov chains, incorporating spectral gap factors, and introduces a pseudorandom generator for the problem.
Contribution
It generalizes known Littlewood-Offord bounds to Markov chain sign sequences and develops a pseudorandom generator using these techniques.
Findings
Extended bounds to Markov chain sign sequences with spectral gap dependence
Established bounds for integer-valued vectors with distinct entries
Constructed a pseudorandom generator for the Littlewood-Offord problem
Abstract
The celebrated Littlewood-Offord problem asks for an upper bound on the probability that the random variable lies in the Euclidean unit ball, where are independent Rademacher random variables and are fixed vectors of at least unit length.We extend many known results to the case that the are obtained from a Markov chain, including the general bounds first shown by Erd\H{o}s in the scalar case and Kleitman in the vector case, and also under the restriction that the are distinct integers due to S\'ark\"ozy and Szemeredi. In all extensions, the upper bound includes an extra factor depending on the spectral gap. We also construct a pseudorandom generator for the Littlewood-Offord problem using similar techniques.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities · Markov Chains and Monte Carlo Methods · Random Matrices and Applications
The Littlewood-Offord Problem for Markov Chains
Shravas Rao
Abstract.
The celebrated Littlewood-Offord problem asks for an upper bound on the probability that the random variable lies in the Euclidean unit ball, where are independent Rademacher random variables and are fixed vectors of at least unit length. We extend many known results to the case that the are obtained from a Markov chain, including the general bounds first shown by Erdős in the scalar case and Kleitman in the vector case, and also under the restriction that the are distinct integers due to Sárközy and Szemeredi. In all extensions, the upper bound includes an extra factor depending on the spectral gap. We also construct a pseudorandom generator for the Littlewood-Offord problem using similar techniques.
This material is based upon work supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE-1342536.
1. Introduction
Let be fixed vectors of Euclidean length at least , and let be independent Rademacher random variables, so that for all . The celebrated Littlewood-Offord problem [LO43] asks for an upper bound on the probability,
[TABLE]
for an open Euclidean ball with radius . This question was first investigated by Littlewood and Offord for the case and [LO43]. A tight bound of when is even, with the worst case being when the vectors are equal, was found by Erdős for the case using a clever combinatorial argument [Erd45]. Such bounds can be contrasted with concentration inequalities like the Hoeffding inequality in the scalar case and the Khintchine-Kahane inequality in the vector case, both of which give an upper bound on the probability for positive . In contrast, an upper bound on Eq. (1) can be considered a form of anti-concentration, that is showing that the random sum is unlikely to be in .
In the case that the are -dimensional vectors, a tight bound up to constant factors of was found by Kleitman [Kle70], and was improved by series of work [Sal83, Sal85, FF88, TV12]. In the scalar case, under the restriction that are distinct integers, an upper bound of was found by Sárközy and Szemeredi [SS65].
In this work, we investigate the case in which are not independent, but are obtained from a stationary reversible Markov chain with state space and transition matrix , and functions , using .
Let be the stationary distribution for the Markov chain, and let be the associated averaging operator defined by , so that for , where is the vector whose entries are all . Like many results on Markov chains, our generalizations will be in terms of the quantity
[TABLE]
If the are independent, that is , it follows that . Often, if is small, the corresponding Markov chain behaves almost as if it were independent. In particular, there exists a Berry-Esseen theorem for Markov chains [Man96] and various concentration inequalities for Markov chain [Gil98, Lez98, LP04]. In all of these cases, there is an extra factor in the bounds in terms of which disappears if .
We show that the Littlewood-Offord problem can also be generalized to Markov chains with an extra dependence on , for all dimensions. We additionally consider the one-dimensional case when the scalars are distinct integers. In all cases, the proof is based off a Fourier-analytic argument due to Halász [Hal77].
The random variables in all cases are defined in the same way, which we state below.
Setting 1.1**.**
Let be a stationary reversible Markov chain with state space , transition matrix , stationary probability measure , and averaging operator so that is distributed according to . Let , and let be such that for every . Then consider the random variables .
We obtain the following theorem that upper bounds the probability that the random sum is concentrated on any unit ball. In the case that the are one-dimensional, the bound is tight up to a factor of in . Note that the bound depends on the dimension, while in the independent case, there is no dependence on the dimension.
Theorem 1.2**.**
Assume the setting of 1.1. Let and for some universal constant . For every set of vectors of Euclidean length at least ,
[TABLE]
for some universal constant .
In the one-dimensional case, we also consider the restriction that are distinct integers.
Theorem 1.3**.**
Assume the setting of 1.1. Then for every set of distinct integers and ,
[TABLE]
for some universal constant .
Finally, we consider a different setting, where rather than choosing independently, we choose these uniformly at random from a subset of that we can construct explicitly.
Theorem 1.4**.**
For every , there exists an explicit set of cardinality at most for some universal constant such that the following holds. For every and and chosen uniformly at random from
[TABLE]
for some universal constant independent of .
One interpretation of Theorem 1.4 is that one can obtain similar results as in the Littlewood-Offord problem in one dimension using much less randomness, and in particular, using bits of randomness rather than .
This setting was also considered in [KKL17], in which the authors were able to construct an explicit set of cardinality , from which a random sample satisfies
[TABLE]
for any constant bounded above by . Sampling from the set in Theorem 1.4 guarantees a stronger bound on the probability that the sum lands in any interval, while requiring more randomness when .
1.1. Future Work
It would be interesting to remove the dependence on the dimension in Theorem 1.2, which does not appear in the tightest bounds for independent random variables.
The setting studied by Sárközy and Szemeredi, in which the the are distinct positive integers and the random variables are independent, was the first in a series of work investigating under what conditions Eq. (1) can be bounded more strongly. We call a set a generalized arithmetic progression (GAP) of rank if it can be expressed as
[TABLE]
for some , and . In a series of works starting with [TV09] and improved by [TV10, NV11], it was shown that when , if Eq. (1) is bounded above by for all unit-balls , then the set must be mostly contained in some GAP of rank-, where depends on . It would be interesting to see if such an analogue holds when the random variables are chosen from a Markov chain.
It would also be interesting to improve Theorem 1.4 by constructing explicit sets of cardinality smaller than that achieve similar properties.
2. Preliminaries
Given vectors (typically will be a distribution over ), we define the -norm by
[TABLE]
Additionally, we let the -operator norm of a matrix be defined as
[TABLE]
Finally, we will use in place of when is the vector whose entries are all . Note that in this case, is not a distribution.
For a vector , we let be the diagonal matrix where .
Let be a stochastic matrix, and let be a distribution for which is reversible, that is, . We let be the averaging operator on . Note that is also stochastic and reversible on .
3. The Littlewood-Offord problem for independent random variables
As warm up, we present the bound in the independent case for -dimensional vectors, or scalars. These calculations will be used later in the proofs of Theorems 1.2, 1.3, and 1.4,. This bound was first proved by Erdős [Erd45] who used a clever combinatorial argument that applies Sperner’s theorem. The proof we present is in spirit, due to Halász [Hal77] and is based on techniques from Fourier analysis.
We start by presenting the following concentration inequality due to Esséen [Ess66], which will allow us to upper-bound probabilities. This inequality is in the spirit of Fourier inversion, but written in a way that can be more readily applied for our purposes.
Theorem 3.1** (Esséen concentration inequality).**
Let be a random variable taking a finite number of values. For ,
[TABLE]
The following bound is implicit in the proof of Proposition 7.18 in [TV06] and will be used to further bound the quantities obtained from Theorem 3.1
Claim 3.2**.**
Let be such that for all . Then
[TABLE]
for some constant .
We now prove the bound in the independent case.
Theorem 3.3**.**
Let be non-zero, and let be independent random variables uniform over the set . Then for all ,
[TABLE]
for some constant independent of .
- Proof:
By Theorem 3.1, the left-hand side can be bounded above by
[TABLE]
for some constants and . The first equality follows from the independence of the , the next equality follows from the fact that is uniform over for all , and the subsequent inequality follows from Claim 3.2.
4. The Littlewood-Offord Problem for Random Variables from a Markov chain
Now we consider the case that are obtained from a Markov chain. The proof follows very closely the proof for independent random variables in Proposition 7.18 in [TV06] which itself is due to Halász [Hal77].
In order to handle the extra dependencies from the Markov chain, we will use the following technical lemma, which is a straightforward adaptation of a Lemma from [NRR17]. We include a proof in Appendix A.
Lemma 4.1**.**
Let be an integer, be -dimensional vectors such that , , and . For , let and define to be . Then,
[TABLE]
Before proving Theorem 1.2, we first prove the following that will allow us to upper-bound negative moments of binomial random variables.
Claim 4.2**.**
Let be a binomial random variable with trials, each with success probability . Then for all positive integers ,
[TABLE]
- Proof:
Note that because for all non-negative , the right-hand side is bounded above by , where the term inside the expected value can be written as
[TABLE]
The claim follows by noting that for .
We start by considering the case of -dimensional vectors, or scalars. We also consider the case in which at most one-half of the have length less than . This will allow us to generalize to higher dimensions. We note that in the case of independent random variables the corresponding statement follows from the usual Littlewood-Offord problem, by conditioning on the such that , for just an increase in the constant factor in the bound.
Lemma 4.3**.**
Assume the setting of 1.1. Then for every such that and ,
[TABLE]
for some universal constant .
- Proof:
By Theorem 3.1,
[TABLE]
for some constant . Note that
[TABLE]
Let , let be the vector defined by for , and let . For , let be the set of indices such that , and also includes if and includes if . Then the right-hand side of Eq. (5) is bounded above by
[TABLE]
where the inequality follows by Lemma 4.1 and evaluating .
Let be the set of indices such that is greater than . When , the corresponding product disappears. When , we can apply Claim 3.2. Thus, the right-hand side of Eq. (4) can be bounded above by
[TABLE]
Let be defined as
[TABLE]
so that for all . Let be a random vector from so that for each
[TABLE]
By the definition of and , the right-hand side of Eq. (6) is bounded above by,
[TABLE]
We conclude with the following argument. Let where denotes a binomial random variable with trials, each with success probability . It follows that is dominated by , and thus
[TABLE]
where the second inequality follows by Jensen’s inequality. Finally, by Claim 4.2, the right-hand side of Eq. (7) is bounded above by as desired.
Before proving Theorem 1.2, we prove the following bound on random unit vectors.
Claim 4.4**.**
Let be a random unit vector uniform over the -dimensional sphere. Then there exists a constant such that
[TABLE]
- Proof:
We start by noting that the probability density function of at is proportional to , which is also the probability density of the beta distribution, shifted so that the domain is . The probability density function at all points is bounded above by
[TABLE]
for some constants and , where the inequality follows from Stirling’s approximation (see [Jam15]). The claim follows by letting .
We now use Lemma 4.3 to prove Theorem 1.2 as follows.
- Proof of Theorem 1.2:
Let be a random rotation uniform over the Haar measure of the special orthogonal group. Then it is enough to consider the random variable . Additionally, the left-hand side in the statement of the theorem is bounded above by
[TABLE]
This is because if the absolute value of the first coordinate of the random vector is greater than , so is the Euclidean norm.
By Claim 4.4, for any fixed , it holds that for at least half of the for some constant . By Lemma 4.3, we have that Eq. (8) is bounded above by
[TABLE]
as desired.
Remark 4.5*.*
In the case of one dimension, Theorem 1.2 is tight up to a factor of . To see this, consider the transition matrix on two states defined by
[TABLE]
with and , and stationary distribution uniform over both states. Such a Markov chain can be interpreted as first choosing a state at random, and then at each subsequent step choosing a new state uniformly at random with probability , or switching states with probability . We can associate with this walk a sequence of numbers, obtained as follows. Whenever a state is chosen at random, we add a new entry in the sequence starting at , and increase this entry every time the state is switched. Then conditioned on this sequence, is distributed as where is the number of entries in the sequence that are odd. Thus, if is considered as a random variable,
[TABLE]
If we assume that is large, then the probability that any given step in the walk is the start of a entry that will eventually be of odd length is approximately , and thus, is approximately distributed like , and thus
[TABLE]
5. Extension to distinct ’s
Theorem 3.3, the bound obtained in the independent case, is tight when . It is reasonable to ask if one can obtain better bounds on the probability under certain restrictions of . In particular, when the are distinct integers, Sárközy and Szemeredi [SS65] showed that for all and for some constant
[TABLE]
which is a factor smaller than Theorem 3.3.
Like Erdős’s proof of Theorem 3.3, the proof of the above by Sárközy and Szemeredi uses a clever combinatorial argument. However, Halász’s Fourier-analytic argument can also be used to prove the above. We prove a similar bound in the case of Markov chains.
Our proof is based on the techniques used in [TV06] for the same problem, in which the Fourier-analytic argument is over the group for some large enough , rather than over the integers or over the real numbers. The following claim is implicit in Corollary 7.16 in [TV06] and will be used in our computation.
Claim 5.1**.**
If are distinct positive integers, then there exists a prime such that for all , and
[TABLE]
We use Claim 5.1 to prove Theorem 1.3 which is a Markov chain version of Eq. (9).
- Proof of Theorem 1.3:
Let be the prime in Claim 5.1. Note that by Fourier inversion,
[TABLE]
Let for all , and let be the vector defined by . Then the absolute value of the expectation inside the right-hand side of Eq. (10) is bounded above by
[TABLE]
by Lemma 4.1, where for each , we define to be the set of indices such that , or if or if . Thus by Claim 5.1, we can upper bound on the right-hand side of Eq. (10) by
[TABLE]
where the inequality also holds in the case that .
As in the proof of Theorem 1.2, let be defined as
[TABLE]
so that for all , and let be a random vector from so that for each
[TABLE]
By the definition of , we have
[TABLE]
As before, let . Then because is dominated by ,
[TABLE]
where again the second inequality follows by Jensen’s inequality. Finally, Claim 4.2 can be used to upper-bound the right-hand side of Eq. (11).
6. A Pseudorandom Generator for the Littlewood-Offord Problem
In this section we prove Theorem 1.4. As stated in the introduction, this theorem can be interpreted as proving the existence of a pseudorandom generator for the Littlewood-Offord problem.
We start by describing the construction of . Our construction will be based on expander graphs which we define as follows. Given a -regular graph , let be the normalized adjacency matrix of and let be the matrix whose entries are all . We say that a family of -regular graphs is a family of expanders if for all graphs in the family,
[TABLE]
for some constant bounded away from , where is the vector whose entries are all . Note that when is -regular, the stationary distribution is , and the averaging operator is . Thus, is also the spectral gap of the Markov chain that is a simple random walk on . It is well known that there exist infinite families of expander graphs of constant degree independent of the number of vertices (see for example, [LPS88] and [Mar88]).
Let be a -regular graph from such a family so that for some constant independent of . We let our set be the set of concatenations of the labels of walks of length on , and thus has cardinality for some constant independent of and .
- Proof of Theorem 1.4:
Let be the uniform measure on and let be as defined above. Then by Theorem 3.1,
[TABLE]
For each , let and let be the vector defined by
[TABLE]
and let . Then is bounded above by,
[TABLE]
where the inequality follows by Lemma 4.1, and for each , we define to be the set of indices such that , or if or if .
Note that is the Fourier transform at of the random variable where each coordinate of is uniformly random over the set . This brings us back to the original setting of completely independent random variables, and by Eq. (2), it follows that
[TABLE]
Thus by inserting the above in Eq. (13) we obtain and upper-bound on the right-hand side of Eq. (12) of
[TABLE]
where the inequality follows from Claim 3.2, We proceed by using the same argument as in Lemma 4.3 starting from Eq. (6), which gives an upper bound of as desired. Finally, we obtain a construction of the desired size by letting .
Appendix A Proof of Lemma 4.1
We prove Lemma 4.1, which as mentioned previously, is a straightforward adaptation of the proof of a Lemma from [NRR17]. Before getting to the proof, we first state the following two claims.
Claim A.1**.**
For all , matrices , and distributions over ,
[TABLE]
- Proof:
Notice that for any vector , . The claim follows by noting that and by induction.
Claim A.2**.**
Let be so that for all and , let , and let . Then,
[TABLE]
- Proof:
By Jensen’s inequality, the right-hand side is bounded above by
[TABLE]
and the claim follows by the definition of operator norm and the fact that .
Claim A.3**.**
Let be a distribution, and let be the associated averaging operator. Then for any ,
[TABLE]
- Proof:
[TABLE]
- Proof of Lemma 4.1:
For , let and . Let if , and otherwise. Then using the triangle inequality, the left-hand side of (4.1) is at most
[TABLE]
where the equality follows from Claim A.3 and the fact that .
Fix an and let be the indices for which the th coordinate of is [math]. Then by Claim A.1,
[TABLE]
The claim now follows by applying Claim A.2.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Cro 11] E. Croot. Fourier proof of the classical littlewood-offord inequality, 2011.
- 2[Erd 45] P. Erdös. On a lemma of Littlewood and Offord. Bull. Amer. Math. Soc. , 51:898–902, 1945. ISSN 0002-9904.
- 3[Ess 66] C. G. Esseen. On the Kolmogorov-Rogozin inequality for the concentration function. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete , 5:210–216, 1966. doi: 10.1007/BF 00533057 .
- 4[FF 88] P. Frankl and Z. Füredi. Solution of the Littlewood-Offord problem in high dimensions. Ann. of Math. (2) , 128(2):259–270, 1988. ISSN 0003-486X.
- 5[Gil 98] D. Gillman. A Chernoff bound for random walks on expander graphs. SIAM J. Comput. , 27(4):1203–1220, 1998. ISSN 0097-5397. doi: 10.1137/S 0097539794268765 .
- 6[Hal 77] G. Halász. Estimates for the concentration function of combinatorial number theory and probability. Period. Math. Hungar. , 8(3-4):197–211, 1977. ISSN 0031-5303.
- 7[Jam 15] G. J. O. Jameson. A simple proof of Stirling’s formula for the gamma function. Math. Gaz. , 99(544):68–74, 2015. ISSN 0025-5572. doi: 10.1017/mag.2014.9 .
- 8[KKL 17] V. Kabanets, D. M. Kane, and Z. Lu. A polynomial restriction lemma with applications. In STOC’17—Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing , pages 615–628. ACM, New York, 2017.
