The Littlewood-Offord Problem for Markov Chains

Shravas Rao

arXiv:1904.13019·math.CO·May 1, 2019

The Littlewood-Offord Problem for Markov Chains

Shravas Rao

PDF

Open Access

TL;DR

This paper extends classical Littlewood-Offord probability bounds to cases where the signs are generated by Markov chains, incorporating spectral gap factors, and introduces a pseudorandom generator for the problem.

Contribution

It generalizes known Littlewood-Offord bounds to Markov chain sign sequences and develops a pseudorandom generator using these techniques.

Findings

01

Extended bounds to Markov chain sign sequences with spectral gap dependence

02

Established bounds for integer-valued vectors with distinct entries

03

Constructed a pseudorandom generator for the Littlewood-Offord problem

Abstract

The celebrated Littlewood-Offord problem asks for an upper bound on the probability that the random variable $ϵ_{1} v_{1} + \dots + ϵ_{n} v_{n}$ lies in the Euclidean unit ball, where $ϵ_{1}, \dots, ϵ_{n} \in {- 1, 1}$ are independent Rademacher random variables and $v_{1}, \dots, v_{n} \in R^{d}$ are fixed vectors of at least unit length.We extend many known results to the case that the $ϵ_{i}$ are obtained from a Markov chain, including the general bounds first shown by Erd\H{o}s in the scalar case and Kleitman in the vector case, and also under the restriction that the $v_{i}$ are distinct integers due to S\'ark\"ozy and Szemeredi. In all extensions, the upper bound includes an extra factor depending on the spectral gap. We also construct a pseudorandom generator for the Littlewood-Offord problem using similar techniques.

Equations115

\mbox Pr [ε_{1} v_{1} + \dots + ε_{n} v_{n} \in B]

\mbox Pr [ε_{1} v_{1} + \dots + ε_{n} v_{n} \in B]

λ = ∥ A - E_{μ} ∥_{L_{2} (μ) \to L_{2} (μ)} .

λ = ∥ A - E_{μ} ∥_{L_{2} (μ) \to L_{2} (μ)} .

\mbox Pr [∥ f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} - x_{0} ∥_{ℓ_{2}} \leq R] \leq \frac{C \cdot R d}{( 1 - λ ) n} .

\mbox Pr [∥ f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} - x_{0} ∥_{ℓ_{2}} \leq R] \leq \frac{C \cdot R d}{( 1 - λ ) n} .

\mbox Pr [f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} = x_{0}] \leq \frac{C}{( 1 - λ ) ^{3} n ^{3/2}}

\mbox Pr [f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} = x_{0}] \leq \frac{C}{( 1 - λ ) ^{3} n ^{3/2}}

\mbox Pr [∣ ε_{1} v_{1} + ε_{2} v_{2} + \dots + ε_{n} v_{n} - x_{0} ∣ \leq 1] \leq \frac{C}{n} .

\mbox Pr [∣ ε_{1} v_{1} + ε_{2} v_{2} + \dots + ε_{n} v_{n} - x_{0} ∣ \leq 1] \leq \frac{C}{n} .

\mbox Pr [f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} = x_{0}] \leq \frac{lo g ( n ) ^{C_{1} / c}}{n} .

\mbox Pr [f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} = x_{0}] \leq \frac{lo g ( n ) ^{C_{1} / c}}{n} .

Q = {v_{0} + x_{1} v_{1} + x_{2} v_{2} + \dots + x_{r} v_{r} : x_{i} \in Z, M_{i} \leq x_{i} \leq M_{i}^{'}}

Q = {v_{0} + x_{1} v_{1} + x_{2} v_{2} + \dots + x_{r} v_{r} : x_{i} \in Z, M_{i} \leq x_{i} \leq M_{i}^{'}}

∥ v ∥_{L_{p} (μ)}^{p} = i = 1 \sum N ∣ v_{i} ∣^{p} μ_{i} .

∥ v ∥_{L_{p} (μ)}^{p} = i = 1 \sum N ∣ v_{i} ∣^{p} μ_{i} .

∥ A ∥_{L_{p} (μ) \to L_{q} (μ)} = v : ∥ v ∥_{L_{p} (μ)} = 1 max ∥ A v ∥_{L_{q} (μ)} .

∥ A ∥_{L_{p} (μ) \to L_{q} (μ)} = v : ∥ v ∥_{L_{p} (μ)} = 1 max ∥ A v ∥_{L_{q} (μ)} .

x_{0} \in R^{d} sup \mbox Pr [∥ X - x_{0} ∥_{ℓ_{2}} \leq R] = O (\frac{R}{d} + \frac{d}{ε})^{d} \int_{ξ \in R^{d} : ∥ ξ ∥_{ℓ_{2}} \leq ε} ∣ E [exp (2 π i ⟨ ξ, X ⟩)] ∣ d ξ .

x_{0} \in R^{d} sup \mbox Pr [∥ X - x_{0} ∥_{ℓ_{2}} \leq R] = O (\frac{R}{d} + \frac{d}{ε})^{d} \int_{ξ \in R^{d} : ∥ ξ ∥_{ℓ_{2}} \leq ε} ∣ E [exp (2 π i ⟨ ξ, X ⟩)] ∣ d ξ .

\int_{- 1}^{1} j \in k \prod ∣ cos (2 π ξ v_{j}) ∣ d ξ \leq \frac{C}{∣ k ∣},

\int_{- 1}^{1} j \in k \prod ∣ cos (2 π ξ v_{j}) ∣ d ξ \leq \frac{C}{∣ k ∣},

\mbox Pr [∣ ε_{1} v_{1} + \dots + ε_{n} v_{n} - x_{0} ∣ \leq 1] \leq \frac{C}{n} .

\mbox Pr [∣ ε_{1} v_{1} + \dots + ε_{n} v_{n} - x_{0} ∣ \leq 1] \leq \frac{C}{n} .

C_{1} \int_{- 1}^{1} ∣ E [exp (2 π i ξ (ε_{1} v_{1} + \dots + ε_{n} v_{n}))] ∣ d ξ

C_{1} \int_{- 1}^{1} ∣ E [exp (2 π i ξ (ε_{1} v_{1} + \dots + ε_{n} v_{n}))] ∣ d ξ

= C_{1} \int_{- 1}^{1} j = 1 \prod n ∣ cos (2 π ξ v_{j}) ∣ d ξ

\leq \frac{C _{2}}{n}

∥ U_{1} (T_{1} + (1 - λ) E_{μ}) U_{2} (T_{2} + (1 - λ) E_{μ}) U_{3} \dots U_{k} (T_{k} + (1 - λ) E_{μ}) U_{k + 1} 1 ∥_{L_{1} (μ)} \leq

∥ U_{1} (T_{1} + (1 - λ) E_{μ}) U_{2} (T_{2} + (1 - λ) E_{μ}) U_{3} \dots U_{k} (T_{k} + (1 - λ) E_{μ}) U_{k + 1} 1 ∥_{L_{1} (μ)} \leq

s \in {0, 1}^{k} \sum j : s_{j} = 1 \prod ∥ T_{j} ∥_{L_{2} (μ) \to L_{2} (μ)} j : s_{j} = 0 \prod (1 - λ) j \in t (s) \prod ∣ ⟨ u_{j}, μ ⟩ ∣ .

E [\frac{1}{( x + 1 ) ^{d}}] \leq \frac{d ^{d}}{n ^{d} p ^{d}} .

E [\frac{1}{( x + 1 ) ^{d}}] \leq \frac{d ^{d}}{n ^{d} p ^{d}} .

i = 0 \sum n (i n) p^{i} (1 - p)^{n - i} \frac{i !}{( i + d )!}

i = 0 \sum n (i n) p^{i} (1 - p)^{n - i} \frac{i !}{( i + d )!}

= i = 0 \sum n (i + d n + d) p^{i + d} (1 - p)^{n - i} \frac{n !}{( n + d )! p ^{d}}

\leq \frac{n !}{( n + d )! p ^{d}} .

\mbox Pr [∣ f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} - x_{0} ∣ \leq 1] \leq \frac{C}{( 1 - λ ) n} .

\mbox Pr [∣ f_{1} (Y_{1}) v_{1} + f_{2} (Y_{2}) v_{2} + \dots + f_{n} (Y_{n}) v_{n} - x_{0} ∣ \leq 1] \leq \frac{C}{( 1 - λ ) n} .

\mbox Pr [∣ f_{1} (Y_{1}) v_{1} + \dots + f_{n} (Y_{n}) v_{n} - x_{0} ∣ \leq 1] \leq C_{1} \int_{- 1}^{1} ∣ E [exp (2 π i ξ (f_{1} (Y_{1}) v_{1} + \dots + f_{n} (Y_{n}) v_{n}))] ∣ d ξ

\mbox Pr [∣ f_{1} (Y_{1}) v_{1} + \dots + f_{n} (Y_{n}) v_{n} - x_{0} ∣ \leq 1] \leq C_{1} \int_{- 1}^{1} ∣ E [exp (2 π i ξ (f_{1} (Y_{1}) v_{1} + \dots + f_{n} (Y_{n}) v_{n}))] ∣ d ξ

E [exp (2 π i ξ (f_{1} (Y_{1}) v_{1} + \dots + f_{n} (Y_{n}) v_{n}))] = E [j = 1 \prod n exp (2 π i ξ f_{j} (Y_{j}) v_{i})] .

E [exp (2 π i ξ (f_{1} (Y_{1}) v_{1} + \dots + f_{n} (Y_{n}) v_{n}))] = E [j = 1 \prod n exp (2 π i ξ f_{j} (Y_{j}) v_{i})] .

∥ U_{1} (T_{1} + (1 - λ) E_{μ}) U_{2} (T_{2} + (1 - λ) E_{μ}) U_{3} \dots U_{n - 1} (T_{n - 1} + (1 - λ) E_{μ}) U_{n} 1 ∥_{L_{1} (μ)} \leq s \in {0, 1}^{n - 1} \sum j : s_{j} = 1 \prod λ j : s_{j} = 0 \prod (1 - λ) j \in t (s) \prod ∣ cos (2 π ξ v_{j}) ∣,

∥ U_{1} (T_{1} + (1 - λ) E_{μ}) U_{2} (T_{2} + (1 - λ) E_{μ}) U_{3} \dots U_{n - 1} (T_{n - 1} + (1 - λ) E_{μ}) U_{n} 1 ∥_{L_{1} (μ)} \leq s \in {0, 1}^{n - 1} \sum j : s_{j} = 1 \prod λ j : s_{j} = 0 \prod (1 - λ) j \in t (s) \prod ∣ cos (2 π ξ v_{j}) ∣,

C_{1} s \in {0, 1}^{n - 1} \sum j : s_{j} = 1 \prod λ j : s_{j} = 0 \prod (1 - λ) \frac{C _{2}}{∣ t ^{'} ( s ) ∣ + 1} .

C_{1} s \in {0, 1}^{n - 1} \sum j : s_{j} = 1 \prod λ j : s_{j} = 0 \prod (1 - λ) \frac{C _{2}}{∣ t ^{'} ( s ) ∣ + 1} .

r = ∣ {j : s_{j} = s_{j + 1} = 0 and ∣ v_{j} ∣ \geq 1} ∣,

r = ∣ {j : s_{j} = s_{j + 1} = 0 and ∣ v_{j} ∣ \geq 1} ∣,

\mbox Pr [s = s] = j : s_{j} = 1 \prod λ j : s_{j} = 0 \prod (1 - λ) .

\mbox Pr [s = s] = j : s_{j} = 1 \prod λ j : s_{j} = 0 \prod (1 - λ) .

C_{1} E [\frac{C _{2}}{r ( s ) + 1}] .

C_{1} E [\frac{C _{2}}{r ( s ) + 1}] .

E [\frac{C}{r ( s ) + 1}] \leq E [\frac{C}{r ^{'}}] \leq (E [\frac{C ^{2}}{r ^{'}}])^{1/2},

E [\frac{C}{r ( s ) + 1}] \leq E [\frac{C}{r ^{'}}] \leq (E [\frac{C ^{2}}{r ^{'}}])^{1/2},

\mbox Pr [∣ v_{1} ∣ \geq \frac{1}{C d}] \geq \frac{1}{2}

\mbox Pr [∣ v_{1} ∣ \geq \frac{1}{C d}] \geq \frac{1}{2}

\frac{1}{2 ^{d - 3}} \cdot \frac{Γ ( d - 1 )}{Γ (( d - 1 ) /2 ) ^{2}} \leq \frac{1}{2 ^{d - 3}} \cdot \frac{C _{1} ( d - 1 ) ^{d - 3/2} e ^{- d + 2}}{C _{1}^{2} (( d - 1 ) /2 ) ^{d - 2} e ^{- d + 1}} \leq C_{2} d - 1

\frac{1}{2 ^{d - 3}} \cdot \frac{Γ ( d - 1 )}{Γ (( d - 1 ) /2 ) ^{2}} \leq \frac{1}{2 ^{d - 3}} \cdot \frac{C _{1} ( d - 1 ) ^{d - 3/2} e ^{- d + 2}}{C _{1}^{2} (( d - 1 ) /2 ) ^{d - 2} e ^{- d + 1}} \leq C_{2} d - 1

\mbox Pr [∣ (A f_{1} (Y_{1}) v_{1} + \dots + A f_{n} (Y_{n}) v_{n} - A x_{0})_{1} ∣ \leq R] .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPoint processes and geometric inequalities · Markov Chains and Monte Carlo Methods · Random Matrices and Applications

Full text

The Littlewood-Offord Problem for Markov Chains

Shravas Rao

Abstract.

The celebrated Littlewood-Offord problem asks for an upper bound on the probability that the random variable $\varepsilon_{1}v_{1}+\cdots+\varepsilon_{n}v_{n}$ lies in the Euclidean unit ball, where $\varepsilon_{1},\ldots,\varepsilon_{n}\in\{-1,1\}$ are independent Rademacher random variables and $v_{1},\ldots,v_{n}\in\mathbb{R}^{d}$ are fixed vectors of at least unit length. We extend many known results to the case that the $\varepsilon_{i}$ are obtained from a Markov chain, including the general bounds first shown by Erdős in the scalar case and Kleitman in the vector case, and also under the restriction that the $v_{i}$ are distinct integers due to Sárközy and Szemeredi. In all extensions, the upper bound includes an extra factor depending on the spectral gap. We also construct a pseudorandom generator for the Littlewood-Offord problem using similar techniques.

This material is based upon work supported by the National Science Foundation Graduate Research Fellowship Program under Grant No. DGE-1342536.

1. Introduction

Let $v_{1},\ldots,v_{n}\in\mathbb{R}^{d}$ be fixed vectors of Euclidean length at least $1$ , and let $\varepsilon_{1},\ldots,\varepsilon_{n}$ be independent Rademacher random variables, so that $\mbox{\rm Pr}[\varepsilon_{i}=1]=\mbox{\rm Pr}[\varepsilon_{i}=-1]=1/2$ for all $i$ . The celebrated Littlewood-Offord problem [LO43] asks for an upper bound on the probability,

[TABLE]

for an open Euclidean ball $B$ with radius $1$ . This question was first investigated by Littlewood and Offord for the case $d=1$ and $d=2$ [LO43]. A tight bound of $\binom{n}{n/2}/2^{n}=\Theta(1/\sqrt{n})$ when $n$ is even, with the worst case being when the vectors are equal, was found by Erdős for the case $d=1$ using a clever combinatorial argument [Erd45]. Such bounds can be contrasted with concentration inequalities like the Hoeffding inequality in the scalar case and the Khintchine-Kahane inequality in the vector case, both of which give an upper bound on the probability $\mbox{\rm Pr}[\|\varepsilon_{1}v_{1}+\cdots+\varepsilon_{n}v_{n}\|\geq k\sqrt{n}]$ for positive $k$ . In contrast, an upper bound on Eq. (1) can be considered a form of anti-concentration, that is showing that the random sum is unlikely to be in $B$ .

In the case that the $v_{i}$ are $d$ -dimensional vectors, a tight bound up to constant factors of $C/\sqrt{n}$ was found by Kleitman [Kle70], and was improved by series of work [Sal83, Sal85, FF88, TV12]. In the scalar case, under the restriction that $v_{1},\ldots,v_{n}$ are distinct integers, an upper bound of $n^{-3/2}$ was found by Sárközy and Szemeredi [SS65].

In this work, we investigate the case in which $\varepsilon_{1},\ldots,\varepsilon_{n}$ are not independent, but are obtained from a stationary reversible Markov chain $\{Y_{i}\}_{i=1}^{\infty}$ with state space $[N]$ and transition matrix $A$ , and functions $f_{1},\ldots,f_{n}:[N]\rightarrow\{-1,1\}$ , using $\varepsilon_{i}=f_{i}(Y_{i})$ .

Let $\mu$ be the stationary distribution for the Markov chain, and let $E_{\mu}$ be the associated averaging operator defined by $(E_{\mu})_{ij}=\mu_{j}$ , so that for $v\in\mathbb{R}^{N}$ , $E_{\mu}v=\mathbb{E}_{\mu}[v]\mathbf{1}$ where $\mathbf{1}$ is the vector whose entries are all $1$ . Like many results on Markov chains, our generalizations will be in terms of the quantity

[TABLE]

If the $Y_{i}$ are independent, that is $A=E_{\mu}$ , it follows that $\lambda=0$ . Often, if $\lambda$ is small, the corresponding Markov chain behaves almost as if it were independent. In particular, there exists a Berry-Esseen theorem for Markov chains [Man96] and various concentration inequalities for Markov chain [Gil98, Lez98, LP04]. In all of these cases, there is an extra factor in the bounds in terms of $\lambda$ which disappears if $\lambda=0$ .

We show that the Littlewood-Offord problem can also be generalized to Markov chains with an extra dependence on $\lambda$ , for all dimensions. We additionally consider the one-dimensional case when the scalars are distinct integers. In all cases, the proof is based off a Fourier-analytic argument due to Halász [Hal77].

The random variables in all cases are defined in the same way, which we state below.

Setting 1.1.

Let $\{Y_{i}\}_{i=1}^{\infty}$ be a stationary reversible Markov chain with state space $[N]$ , transition matrix $A$ , stationary probability measure $\mu$ , and averaging operator $E_{\mu}$ so that $Y_{1}$ is distributed according to $\mu$ . Let $\lambda=\|A-E_{\mu}\|_{L_{2}(\mu)\rightarrow L_{2}(\mu)}$ , and let $f_{1},\ldots,f_{n}:[N]\rightarrow\{-1,1\}$ be such that $\mathbb{E}[f_{i}(Y_{i})]=0$ for every $i$ . Then consider the random variables $f_{1}(Y_{1}),f_{2}(Y_{2}),\ldots,f_{n}(Y_{n})$ .

We obtain the following theorem that upper bounds the probability that the random sum is concentrated on any unit ball. In the case that the $v_{i}$ are one-dimensional, the bound is tight up to a factor of $\sqrt{(1-\lambda)/(1+\lambda)}$ in $\lambda$ . Note that the bound depends on the dimension, while in the independent case, there is no dependence on the dimension.

Theorem 1.2.

Assume the setting of 1.1. Let $x_{0}\in\mathbb{R}^{d}$ and $R\geq\frac{1}{C\sqrt{d}}$ for some universal constant $C^{\prime}$ . For every set of vectors $v_{1},\ldots,v_{n}\in\mathbb{R}^{d}$ of Euclidean length at least $1$ ,

[TABLE]

for some universal constant $C$ .

In the one-dimensional case, we also consider the restriction that $v_{1},\ldots,v_{n}$ are distinct integers.

Theorem 1.3.

Assume the setting of 1.1. Then for every set of distinct integers $v_{1},\ldots,v_{n}\geq 1$ and $x_{0}\in\mathbb{Z}$ ,

[TABLE]

for some universal constant $C$ .

Finally, we consider a different setting, where rather than choosing $\varepsilon_{1},\ldots,\varepsilon_{n}$ independently, we choose these uniformly at random from a subset $D$ of $\{-1,1\}^{n}$ that we can construct explicitly.

Theorem 1.4.

For every $n$ , there exists an explicit set $D\subseteq\{-1,1\}^{n}$ of cardinality at most $2^{C_{1}\sqrt{n}}$ for some universal constant $C_{1}$ such that the following holds. For every $v_{1},\ldots,v_{n}\geq 1$ and $x_{0}\in\mathbb{R}$ and $\varepsilon$ chosen uniformly at random from $D$

[TABLE]

for some universal constant $C$ independent of $n$ .

One interpretation of Theorem 1.4 is that one can obtain similar results as in the Littlewood-Offord problem in one dimension using much less randomness, and in particular, using $C_{1}\sqrt{n}$ bits of randomness rather than $n$ .

This setting was also considered in [KKL17], in which the authors were able to construct an explicit set of cardinality $n2^{n^{c}}$ , from which a random sample satisfies

[TABLE]

for any constant $c$ bounded above by $1$ . Sampling from the set in Theorem 1.4 guarantees a stronger bound on the probability that the sum lands in any interval, while requiring more randomness when $c<1/2$ .

1.1. Future Work

It would be interesting to remove the dependence on the dimension in Theorem 1.2, which does not appear in the tightest bounds for independent random variables.

The setting studied by Sárközy and Szemeredi, in which the the $v_{i}$ are distinct positive integers and the random variables are independent, was the first in a series of work investigating under what conditions Eq. (1) can be bounded more strongly. We call a set $Q\subseteq\mathbb{R}^{d}$ a generalized arithmetic progression (GAP) of rank $r$ if it can be expressed as

[TABLE]

for some $v_{0},\ldots,v_{r}\in\mathbb{R}^{d}$ , $M_{1},\ldots,M_{r}\in\mathbb{Z}$ and $M_{1}^{\prime},\ldots,M_{r}^{\prime}\in\mathbb{Z}$ . In a series of works starting with [TV09] and improved by [TV10, NV11], it was shown that when $d=1$ , if Eq. (1) is bounded above by $n^{-C}$ for all unit-balls $B$ , then the set $\{v_{1},\ldots,v_{n}\}$ must be mostly contained in some GAP of rank- $r$ , where $r$ depends on $C$ . It would be interesting to see if such an analogue holds when the random variables are chosen from a Markov chain.

It would also be interesting to improve Theorem 1.4 by constructing explicit sets of cardinality smaller than $2^{C_{1}\sqrt{n}}$ that achieve similar properties.

2. Preliminaries

Given vectors $v,\mu\in\mathbb{R}^{N}$ (typically $\mu$ will be a distribution over $[N]$ ), we define the $L_{p}(\mu)$ -norm by

[TABLE]

Additionally, we let the ${L_{p}(\mu)}\rightarrow{L_{q}(\mu)}$ -operator norm of a matrix $A\in\mathbb{R}^{N\times N}$ be defined as

[TABLE]

Finally, we will use $\ell_{p}$ in place of $L_{p}(\mu)$ when $\mu$ is the vector whose entries are all $1$ . Note that in this case, $\mu$ is not a distribution.

For a vector $v$ , we let $\operatorname{diag}(v)$ be the diagonal matrix where $\operatorname{diag}(v)_{i,i}=v_{i}$ .

Let $A$ be a stochastic matrix, and let $\mu$ be a distribution for which $A$ is reversible, that is, $\mu_{i}A_{ij}=\mu_{j}A_{ji}$ . We let $(E_{\mu})_{ij}=\mu_{j}$ be the averaging operator on $L_{\infty}(\mu)\rightarrow L_{\infty}(\mu)$ . Note that $E_{\mu}$ is also stochastic and reversible on $\mu$ .

3. The Littlewood-Offord problem for independent random variables

As warm up, we present the bound in the independent case for $1$ -dimensional vectors, or scalars. These calculations will be used later in the proofs of Theorems 1.2, 1.3, and 1.4,. This bound was first proved by Erdős [Erd45] who used a clever combinatorial argument that applies Sperner’s theorem. The proof we present is in spirit, due to Halász [Hal77] and is based on techniques from Fourier analysis.

We start by presenting the following concentration inequality due to Esséen [Ess66], which will allow us to upper-bound probabilities. This inequality is in the spirit of Fourier inversion, but written in a way that can be more readily applied for our purposes.

Theorem 3.1 (Esséen concentration inequality).

Let $X\in\mathbb{R}^{d}$ be a random variable taking a finite number of values. For $R,\varepsilon>0$ ,

[TABLE]

The following bound is implicit in the proof of Proposition 7.18 in [TV06] and will be used to further bound the quantities obtained from Theorem 3.1

Claim 3.2.

Let $v_{1},\ldots,v_{k}\in\mathbb{R}$ be such that $|v_{j}|\geq 1$ for all $j$ . Then

[TABLE]

for some constant $C$ .

We now prove the bound in the independent case.

Theorem 3.3.

Let $v_{1},\ldots,v_{n}\in\mathbb{R}$ be non-zero, and let $\varepsilon_{1},\ldots,\varepsilon_{n}$ be independent random variables uniform over the set $\{-1,1\}$ . Then for all $x_{0}\in\mathbb{R}$ ,

[TABLE]

for some constant $C$ independent of $n$ .

Proof:

By Theorem 3.1, the left-hand side can be bounded above by

[TABLE]

for some constants $C_{1}$ and $C_{2}$ . The first equality follows from the independence of the $\varepsilon_{j}$ , the next equality follows from the fact that $\varepsilon_{j}$ is uniform over $\{-1,1\}$ for all $j$ , and the subsequent inequality follows from Claim 3.2. $\Box$

4. The Littlewood-Offord Problem for Random Variables from a Markov chain

Now we consider the case that $\varepsilon_{1},\ldots,\varepsilon_{n}$ are obtained from a Markov chain. The proof follows very closely the proof for independent random variables in Proposition 7.18 in [TV06] which itself is due to Halász [Hal77].

In order to handle the extra dependencies from the Markov chain, we will use the following technical lemma, which is a straightforward adaptation of a Lemma from [NRR17]. We include a proof in Appendix A.

Lemma 4.1.

Let $k\geq 1$ be an integer, $u_{1},\ldots,u_{k+1}\in\mathbb{C}^{N}$ be $N$ -dimensional vectors such that $\|u_{i}\|_{L_{\infty}(\mu)}\leq 1$ , $U_{i}=\operatorname{diag}(u_{i})$ , and $T_{1},\ldots,T_{k}\in\mathbb{R}^{N\times N}$ . For $s\in\{0,1\}^{k}$ , let $\overline{s}:=(0,s,0)\in\{0,1\}^{k+2}$ and define $t(s)\subseteq[n]$ to be $t(s):=\{i\ :\ \overline{s}_{i}=\overline{s}_{i-1}=0\}$ . Then,

[TABLE]

Before proving Theorem 1.2, we first prove the following that will allow us to upper-bound negative moments of binomial random variables.

Claim 4.2.

Let $x=B(n,p)$ be a binomial random variable with $n$ trials, each with success probability $p>0$ . Then for all positive integers $d$ ,

[TABLE]

Proof:

Note that because $d(i+1)\geq i+d$ for all non-negative $i$ , the right-hand side is bounded above by $d^{d}\mathbb{E}\left[\frac{x!}{(x+d)!}\right]$ , where the term inside the expected value can be written as

[TABLE]

The claim follows by noting that $n\leq n+i$ for $1\leq i\leq d$ . $\Box$

We start by considering the case of $1$ -dimensional vectors, or scalars. We also consider the case in which at most one-half of the $v_{i}$ have length less than $1$ . This will allow us to generalize to higher dimensions. We note that in the case of independent random variables the corresponding statement follows from the usual Littlewood-Offord problem, by conditioning on the $\varepsilon_{i}$ such that $|v_{i}|<1$ , for just an increase in the constant factor in the bound.

Lemma 4.3.

Assume the setting of 1.1. Then for every $v_{1},\ldots,v_{n}\in\mathbb{R}$ such that $|\{i:|v_{i}|\geq 1\}|\geq n/2$ and $x_{0}\in\mathbb{R}$ ,

[TABLE]

for some universal constant $C$ .

Proof:

By Theorem 3.1,

[TABLE]

for some constant $C_{1}$ . Note that

[TABLE]

Let $T_{j}=A-(1-\lambda)E_{\mu}$ , let $u_{j}$ be the vector defined by $u_{j}(y)=\exp(2\pi i\xi f_{j}(y)v_{j})$ for $y\in[N]$ , and let $U_{j}=\operatorname{diag}(u_{j})$ . For $s\in\{0,1\}^{n-1}$ , let $t(s)$ be the set of indices $j$ such that $s_{j-1}=s_{j}=0$ , and also includes $1$ if $s_{1}=0$ and includes $n$ if $s_{n-1}=0$ . Then the right-hand side of Eq. (5) is bounded above by

[TABLE]

where the inequality follows by Lemma 4.1 and evaluating $|\langle\mu,u\rangle|$ .

Let $t^{\prime}(s)$ be the set of indices $j\in t(s)$ such that $|v_{j}|$ is greater than $1$ . When $|t^{\prime}(s)|=0$ , the corresponding product disappears. When $|t^{\prime}(s)|>0$ , we can apply Claim 3.2. Thus, the right-hand side of Eq. (4) can be bounded above by

[TABLE]

Let $r:\{0,1\}^{n-1}\rightarrow[n-1]$ be defined as

[TABLE]

so that $r(s)\leq|t^{\prime}(s)|$ for all $s\in\{0,1\}^{n-1}$ . Let $\mathbf{s}$ be a random vector from $\{0,1\}^{n-1}$ so that for each $s\in\{0,1\}^{n-1}$

[TABLE]

By the definition of $r$ and $\mathbf{s}$ , the right-hand side of Eq. (6) is bounded above by,

[TABLE]

We conclude with the following argument. Let $r^{\prime}=B(\lfloor n/4\rfloor-1,(1-\lambda)^{2})+1$ where $B(n,p)$ denotes a binomial random variable with $n$ trials, each with success probability $p$ . It follows that $r^{\prime}$ is dominated by $r(\mathbf{s})+1$ , and thus

[TABLE]

where the second inequality follows by Jensen’s inequality. Finally, by Claim 4.2, the right-hand side of Eq. (7) is bounded above by $C\left((1-\lambda)\sqrt{\lfloor n/4\rfloor}\right)^{-1}$ as desired. $\Box$

Before proving Theorem 1.2, we prove the following bound on random unit vectors.

Claim 4.4.

Let $v\in\mathbb{R}^{d}$ be a random unit vector uniform over the $d-1$ -dimensional sphere. Then there exists a constant $C$ such that

[TABLE]

Proof:

We start by noting that the probability density function of $v_{1}$ at $t$ is proportional to $(1-t^{2})^{(d-3)/2}$ , which is also the probability density of the beta distribution, shifted so that the domain is $[-1,1]$ . The probability density function at all points is bounded above by

[TABLE]

for some constants $C_{1}$ and $C_{2}$ , where the inequality follows from Stirling’s approximation (see [Jam15]). The claim follows by letting $C=C_{2}/4$ . $\Box$

We now use Lemma 4.3 to prove Theorem 1.2 as follows.

Proof of Theorem 1.2:

Let $A\in\mathcal{SO}(d)$ be a random rotation uniform over the Haar measure of the special orthogonal group. Then it is enough to consider the random variable $\|Af_{1}(Y_{1})v_{1}+\cdots+Af_{n}(Y_{n})v_{n}-Ax_{0}\|_{\ell_{2}}$ . Additionally, the left-hand side in the statement of the theorem is bounded above by

[TABLE]

This is because if the absolute value of the first coordinate of the random vector is greater than $R$ , so is the Euclidean norm.

By Claim 4.4, for any fixed $d$ , it holds that $|f_{i}(Y_{i})v_{i}|\geq 1/(C^{\prime}\sqrt{d})$ for at least half of the $i$ for some constant $C^{\prime}$ . By Lemma 4.3, we have that Eq. (8) is bounded above by

[TABLE]

as desired. $\Box$

*Remark 4.5**.*

In the case of one dimension, Theorem 1.2 is tight up to a factor of $\sqrt{(1-\lambda)/(1+\lambda)}$ . To see this, consider the transition matrix on two states defined by

[TABLE]

with $f(1)=1$ and $f(2)=-1$ , and stationary distribution uniform over both states. Such a Markov chain can be interpreted as first choosing a state at random, and then at each subsequent step choosing a new state uniformly at random with probability $1-\lambda$ , or switching states with probability $\lambda$ . We can associate with this walk a sequence of numbers, $(X_{1},X_{2},\ldots)$ obtained as follows. Whenever a state is chosen at random, we add a new entry in the sequence starting at $1$ , and increase this entry every time the state is switched. Then conditioned on this sequence, $f(Y_{1})+f(Y_{2})+\cdots+f(Y_{n})$ is distributed as $\varepsilon_{1}+\varepsilon_{2}+\cdots+\varepsilon_{\mathbf{n}}$ where $\mathbf{n}$ is the number of entries in the sequence that are odd. Thus, if $\mathbf{n}$ is considered as a random variable,

[TABLE]

If we assume that $n$ is large, then the probability that any given step in the walk is the start of a entry that will eventually be of odd length is approximately $1/(1+\lambda)$ , and thus, $\mathbf{n}$ is approximately distributed like $B(n,(1-\lambda)/(1+\lambda))$ , and thus

[TABLE]

5. Extension to distinct $v_{i}$ ’s

Theorem 3.3, the bound obtained in the independent case, is tight when $v_{1}=\cdots=v_{n}=1$ . It is reasonable to ask if one can obtain better bounds on the probability $\mbox{\rm Pr}[\varepsilon_{1}v_{1}+\cdots+\varepsilon_{n}v_{n}\in B]$ under certain restrictions of $v_{1},\ldots,v_{n}$ . In particular, when the $v_{i}$ are distinct integers, Sárközy and Szemeredi [SS65] showed that for all $x_{0}$ and for some constant $C$

[TABLE]

which is a factor $n$ smaller than Theorem 3.3.

Like Erdős’s proof of Theorem 3.3, the proof of the above by Sárközy and Szemeredi uses a clever combinatorial argument. However, Halász’s Fourier-analytic argument can also be used to prove the above. We prove a similar bound in the case of Markov chains.

Our proof is based on the techniques used in [TV06] for the same problem, in which the Fourier-analytic argument is over the group $\mathbb{Z}_{p}$ for some large enough $p$ , rather than over the integers or over the real numbers. The following claim is implicit in Corollary 7.16 in [TV06] and will be used in our computation.

Claim 5.1.

If $v_{1},\ldots,v_{n}$ are distinct positive integers, then there exists a prime $p$ such that $p\geq v_{i}$ for all $i$ , and

[TABLE]

We use Claim 5.1 to prove Theorem 1.3 which is a Markov chain version of Eq. (9).

Proof of Theorem 1.3:

Let $p$ be the prime in Claim 5.1. Note that by Fourier inversion,

[TABLE]

Let $T_{j}=A-(1-\lambda)E_{\mu}$ for all $j$ , and let $u_{i}$ be the vector defined by $u_{j}(y)=\exp(2\pi i(\xi\cdot f_{j}(y)v_{j})/N)$ . Then the absolute value of the expectation inside the right-hand side of Eq. (10) is bounded above by

[TABLE]

by Lemma 4.1, where for each $s\in\{0,1\}^{n-1}$ , we define $t(s)$ to be the set of indices $j$ such that $s_{j-1}=s_{j}=0$ , or $s_{j}=0$ if $j=1$ or $s_{j-1}=0$ if $j={k+1}$ . Thus by Claim 5.1, we can upper bound on the right-hand side of Eq. (10) by

[TABLE]

where the inequality also holds in the case that $|t(s)|=0$ .

As in the proof of Theorem 1.2, let $r:\{0,1\}^{n-1}\rightarrow[n-1]$ be defined as

[TABLE]

so that $r(s)\leq|t(s)|$ for all $s\in\{0,1\}^{n-1}$ , and let $\mathbf{s}$ be a random vector from $\{0,1\}^{n-1}$ so that for each $s\in\{0,1\}^{n-1}$

[TABLE]

By the definition of $r(\mathbf{s})$ , we have

[TABLE]

As before, let $r^{\prime}=B(\lfloor(n/2\rfloor-1,(1-\lambda)^{2})+1$ . Then because $r^{\prime}$ is dominated by $r(s)$ ,

[TABLE]

where again the second inequality follows by Jensen’s inequality. Finally, Claim 4.2 can be used to upper-bound the right-hand side of Eq. (11). $\Box$

6. A Pseudorandom Generator for the Littlewood-Offord Problem

In this section we prove Theorem 1.4. As stated in the introduction, this theorem can be interpreted as proving the existence of a pseudorandom generator for the Littlewood-Offord problem.

We start by describing the construction of $D$ . Our construction will be based on expander graphs which we define as follows. Given a $d$ -regular graph $G=(V,E)$ , let $A$ be the normalized adjacency matrix of $G$ and let $J$ be the matrix whose entries are all $1/|V|$ . We say that a family of $d$ -regular graphs $\mathcal{G}$ is a family of expanders if for all graphs $G$ in the family,

[TABLE]

for some constant $\lambda$ bounded away from $1$ , where $\mu$ is the vector whose entries are all $1/|V|$ . Note that when $G=(V,E)$ is $d$ -regular, the stationary distribution is $\mu$ , and the averaging operator is $J$ . Thus, $1-\|A-J\|_{L_{2}(\mu)\rightarrow L_{2}(\mu)}$ is also the spectral gap of the Markov chain that is a simple random walk on $G$ . It is well known that there exist infinite families of expander graphs of constant degree $d$ independent of the number of vertices (see for example, [LPS88] and [Mar88]).

Let $G=(\{-1,1\}^{k},E)$ be a $d$ -regular graph from such a family so that $\|A-J\|_{L_{2}(\mu)\rightarrow L_{2}(\mu)}\leq\lambda$ for some constant $\lambda$ independent of $k$ . We let our set $D$ be the set of concatenations of the labels of walks of length $n/k$ on $G$ , and thus $D$ has cardinality $2^{k+C_{1}n/k}$ for some constant $C_{1}$ independent of $n$ and $k$ .

Proof of Theorem 1.4:

Let $\mu$ be the uniform measure on $\{-1,1\}^{k}$ and let $D$ be as defined above. Then by Theorem 3.1,

[TABLE]

For each $j\in[n/k]$ , let $T_{j}=A-(1-\lambda)J$ and let $u_{j}\in\mathbb{R}^{\{-1,1\}^{k}}$ be the vector defined by

[TABLE]

and let $U_{j}=\operatorname{diag}(u_{j})$ . Then $|\mathbb{E}[\exp(2\pi i\xi(\varepsilon_{1}v_{1}+\cdots+\varepsilon_{n}v_{n}))]|$ is bounded above by,

[TABLE]

where the inequality follows by Lemma 4.1, and for each $s\in\{0,1\}^{n/k-1}$ , we define $t(s)$ to be the set of indices $j$ such that $s_{j-1}=s_{j}=0$ , or $s_{j}=0$ if $j=1$ or $s_{j-1}=0$ if $j={n/k}$ .

Note that $\langle u_{j},\mu\rangle$ is the Fourier transform at $\xi$ of the random variable $w_{(j-1)k+1}v_{(j-1)k+1}+\cdots+w_{jk}v_{jk}$ where each coordinate of $w$ is uniformly random over the set $\{-1,1\}$ . This brings us back to the original setting of completely independent random variables, and by Eq. (2), it follows that

[TABLE]

Thus by inserting the above in Eq. (13) we obtain and upper-bound on the right-hand side of Eq. (12) of

[TABLE]

where the inequality follows from Claim 3.2, We proceed by using the same argument as in Lemma 4.3 starting from Eq. (6), which gives an upper bound of $C/\sqrt{k\cdot(n/k)}=C/\sqrt{n}$ as desired. Finally, we obtain a construction of the desired size by letting $k=\sqrt{n}$ . $\Box$

Appendix A Proof of Lemma 4.1

We prove Lemma 4.1, which as mentioned previously, is a straightforward adaptation of the proof of a Lemma from [NRR17]. Before getting to the proof, we first state the following two claims.

Claim A.1.

For all $k\geq 1$ , matrices $R_{1},\ldots,R_{k}\in\mathbb{R}^{N\times N}$ , and distributions $\mu$ over $[N]$ ,

[TABLE]

Proof:

Notice that for any vector $v$ , $E_{\mu}v=\mathbb{E}_{\mu}[v]\mathbf{1}$ . The claim follows by noting that $\mathbb{E}_{\mu}[v]\leq\|v\|_{L_{1}(\mu)}$ and by induction. $\Box$

Claim A.2.

Let $u_{1},\ldots,u_{k+1}\in\mathbb{C}^{N}$ be so that $|(u_{j})_{i}|\leq 1$ for all $i$ and $j$ , let $U_{j}=\operatorname{diag}(u_{j})$ , and let $T_{1},\ldots,T_{k}\in\mathbb{R}^{N\times N}$ . Then,

[TABLE]

Proof:

By Jensen’s inequality, the right-hand side is bounded above by

[TABLE]

and the claim follows by the definition of operator norm and the fact that $\|U_{i}\|_{L_{2}(\mu)\rightarrow L_{2}(\mu)}=\|u_{i}\|_{L_{\infty}(\mu)}$ . $\Box$

Claim A.3.

Let $\mu\in R^{N}$ be a distribution, and let $E_{\mu}$ be the associated averaging operator. Then for any $u\in\mathbb{C}^{N}$ ,

[TABLE]

Proof:

[TABLE]

$\Box$

Proof of Lemma 4.1:

For $j=1,\ldots,k$ , let $T_{j,0}=(1-\lambda)E_{\mu}$ and $T_{j,1}=T_{j}$ . Let $U_{i}^{\prime}=U_{i}$ if $i\in t(s)$ , and $U_{i}^{\prime}=I$ otherwise. Then using the triangle inequality, the left-hand side of (4.1) is at most

[TABLE]

where the equality follows from Claim A.3 and the fact that $E_{\mu}^{2}=E_{\mu}$ .

Fix an $s\in\{0,1\}^{n}$ and let $r_{1},\ldots,r_{\ell}\in t(s)$ be the indices for which the ${r_{i}}$ th coordinate of $s$ is [math]. Then by Claim A.1,

[TABLE]

The claim now follows by applying Claim A.2. $\Box$

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Cro 11] E. Croot. Fourier proof of the classical littlewood-offord inequality, 2011.
2[Erd 45] P. Erdös. On a lemma of Littlewood and Offord. Bull. Amer. Math. Soc. , 51:898–902, 1945. ISSN 0002-9904.
3[Ess 66] C. G. Esseen. On the Kolmogorov-Rogozin inequality for the concentration function. Z. Wahrscheinlichkeitstheorie und Verw. Gebiete , 5:210–216, 1966. doi: 10.1007/BF 00533057 .
4[FF 88] P. Frankl and Z. Füredi. Solution of the Littlewood-Offord problem in high dimensions. Ann. of Math. (2) , 128(2):259–270, 1988. ISSN 0003-486X.
5[Gil 98] D. Gillman. A Chernoff bound for random walks on expander graphs. SIAM J. Comput. , 27(4):1203–1220, 1998. ISSN 0097-5397. doi: 10.1137/S 0097539794268765 .
6[Hal 77] G. Halász. Estimates for the concentration function of combinatorial number theory and probability. Period. Math. Hungar. , 8(3-4):197–211, 1977. ISSN 0031-5303.
7[Jam 15] G. J. O. Jameson. A simple proof of Stirling’s formula for the gamma function. Math. Gaz. , 99(544):68–74, 2015. ISSN 0025-5572. doi: 10.1017/mag.2014.9 .
8[KKL 17] V. Kabanets, D. M. Kane, and Z. Lu. A polynomial restriction lemma with applications. In STOC’17—Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing , pages 615–628. ACM, New York, 2017.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The Littlewood-Offord Problem for Markov Chains

Abstract.

1. Introduction

Setting 1.1**.**

Theorem 1.2**.**

Theorem 1.3**.**

Theorem 1.4**.**

1.1. Future Work

2. Preliminaries

3. The Littlewood-Offord problem for independent random variables

Theorem 3.1** (Esséen concentration inequality).**

Claim 3.2**.**

Theorem 3.3**.**

4. The Littlewood-Offord Problem for Random Variables from a Markov chain

Lemma 4.1**.**

Claim 4.2**.**

Lemma 4.3**.**

Claim 4.4**.**

Remark 4.5*.*

5. Extension to distinct viv_{i}vi​’s

Claim 5.1**.**

6. A Pseudorandom Generator for the Littlewood-Offord Problem

Appendix A Proof of Lemma 4.1

Claim A.1**.**

Claim A.2**.**

Claim A.3**.**

Setting 1.1.

Theorem 1.2.

Theorem 1.3.

Theorem 1.4.

Theorem 3.1 (Esséen concentration inequality).

Claim 3.2.

Theorem 3.3.

Lemma 4.1.

Claim 4.2.

Lemma 4.3.

Claim 4.4.

*Remark 4.5**.*

5. Extension to distinct $v_{i}$ ’s

Claim 5.1.

Claim A.1.

Claim A.2.

Claim A.3.