Random Matrices from Linear Codes and Wigner's Semicircle Law II

Chin Hei Chan; Maosheng Xiong

arXiv:1907.00323·math.PR·March 10, 2020·IWSDA

Random Matrices from Linear Codes and Wigner's Semicircle Law II

Chin Hei Chan, Maosheng Xiong

PDF

Open Access

TL;DR

This paper proves that random matrices derived from linear codes over finite fields converge to Wigner's semicircle law, with a convergence rate depending on the code length, under the condition that the dual distance is at least 5.

Contribution

It establishes that a dual distance of at least 5 guarantees spectral convergence to Wigner's law with a specific convergence rate, extending previous results.

Findings

01

Spectral distribution converges to Wigner's semicircle law as code length increases.

02

Convergence rate is of order $n^{-eta}$ for some $0<eta<1$.

03

Dual distance ≥ 5 is sufficient for convergence.

Abstract

Recently we considered a class of random matrices obtained by choosing distinct codewords at random from linear codes over finite fields and proved that under some natural algebraic conditions their empirical spectral distribution converges to Wigner's semicircle law as the length of the codes goes to infinity. One of the conditions is that the dual distance of the codes is at least 5. In this paper, employing more advanced techniques related to Stieltjes transform, we show that the dual distance being at least 5 is sufficient to ensure the convergence, and the convergence rate is of the form $n^{- β}$ for some $0 < β < 1$ , where $n$ is the length of the code.

Equations240

G_{C_{i}} = \frac{1}{n _{i}} Φ_{C_{i}} Φ_{C_{i}}^{*},

G_{C_{i}} = \frac{1}{n _{i}} Φ_{C_{i}} Φ_{C_{i}}^{*},

M_{C_{i}} = \frac{n _{i}}{p _{i}} (G_{C_{i}} - I_{p_{i}}) .

M_{C_{i}} = \frac{n _{i}}{p _{i}} (G_{C_{i}} - I_{p_{i}}) .

μ_{A} = \frac{1}{n} j = 1 \sum n δ_{λ_{j}},

μ_{A} = \frac{1}{n} j = 1 \sum n δ_{λ_{j}},

F_{A} (x) := \int_{- \infty}^{x} μ_{A} (d x) .

F_{A} (x) := \int_{- \infty}^{x} μ_{A} (d x) .

μ_{n_{i}} (I) \to ϱ_{SC} (I) in Probability,

μ_{n_{i}} (I) \to ϱ_{SC} (I) in Probability,

d ϱ_{SC} (x) = \frac{1}{2 π} 4 - x^{2} \mathbbm 1_{[- 2, 2]} d x,

d ϱ_{SC} (x) = \frac{1}{2 π} 4 - x^{2} \mathbbm 1_{[- 2, 2]} d x,

∣ ⟨ v, v^{'} ⟩ ∣ \leq c n_{i}, \mbox f or an y v \neq = v^{'} \in ψ (C_{i}) .

∣ ⟨ v, v^{'} ⟩ ∣ \leq c n_{i}, \mbox f or an y v \neq = v^{'} \in ψ (C_{i}) .

wt (c) - \frac{n _{i}}{2} \leq \frac{c}{2} n_{i}, \forall c \in C_{i} ∖ {0},

wt (c) - \frac{n _{i}}{2} \leq \frac{c}{2} n_{i}, \forall c \in C_{i} ∖ {0},

c^{- 1} n^{γ_{1}} \leq p \leq c n^{γ_{2}} .

c^{- 1} n^{γ_{1}} \leq p \leq c n^{γ_{2}} .

∣ μ_{n} (I) - ϱ_{SC} (I) ∣ ≺ n^{- β}

∣ μ_{n} (I) - ϱ_{SC} (I) ∣ ≺ n^{- β}

β := min {\frac{γ _{1}}{4}, \frac{1 - γ _{2}}{8}} .

β := min {\frac{γ _{1}}{4}, \frac{1 - γ _{2}}{8}} .

I sup P [∣ μ_{n} (I) - ϱ_{SC} (I) ∣ > n^{- β + ε}] \leq n^{- D} .

I sup P [∣ μ_{n} (I) - ϱ_{SC} (I) ∣ > n^{- β + ε}] \leq n^{- D} .

ψ (a) = ζ^{Tr (a)}, \forall a \in F_{q},

ψ (a) = ζ^{Tr (a)}, \forall a \in F_{q},

\frac{1}{q}\sum_{x\in\mathbb{F}_{q}}\psi(ax)=\left\{\begin{array}[]{lll}1&:&\mbox{ if }a=0;\\ 0&:&\mbox{ if }a\in\mathbb{F}_{q}\setminus\{0\}.\end{array}\right.

\frac{1}{q}\sum_{x\in\mathbb{F}_{q}}\psi(ax)=\left\{\begin{array}[]{lll}1&:&\mbox{ if }a=0;\\ 0&:&\mbox{ if }a\in\mathbb{F}_{q}\setminus\{0\}.\end{array}\right.

# C^{⊥} = q^{n - k} \leq \frac{q ^{n}}{1 + n ( q - 1 ) + ( 2 n ) ( q - 1 ) ^{2}} = O (\frac{q ^{n}}{n ^{2}}),

# C^{⊥} = q^{n - k} \leq \frac{q ^{n}}{1 + n ( q - 1 ) + ( 2 n ) ( q - 1 ) ^{2}} = O (\frac{q ^{n}}{n ^{2}}),

\frac{n ^{2}}{q ^{k}} = O (1) .

\frac{n ^{2}}{q ^{k}} = O (1) .

\frac{1}{\#\mathcal{C}}\sum_{\mathbf{c}\in\mathcal{C}}\psi(\mathbf{a}\cdot\mathbf{c})=\left\{\begin{array}[]{lll}1&:&\mbox{ if }\mathbf{a}\in\mathcal{C}^{\bot},\\ 0&:&\mbox{ if }\mathbf{a}\notin\mathcal{C}^{\bot}.\end{array}\right.

\frac{1}{\#\mathcal{C}}\sum_{\mathbf{c}\in\mathcal{C}}\psi(\mathbf{a}\cdot\mathbf{c})=\left\{\begin{array}[]{lll}1&:&\mbox{ if }\mathbf{a}\in\mathcal{C}^{\bot},\\ 0&:&\mbox{ if }\mathbf{a}\notin\mathcal{C}^{\bot}.\end{array}\right.

s (z) := \int_{- \infty}^{\infty} \frac{d F ( x )}{x - z} = \int_{- \infty}^{\infty} \frac{μ ( d x )}{x - z},

s (z) := \int_{- \infty}^{\infty} \frac{d F ( x )}{x - z} = \int_{- \infty}^{\infty} \frac{μ ( d x )}{x - z},

\frac{d s ( z )}{d z} \leq \int_{- \infty}^{\infty} \frac{μ ( d x )}{∣ x - z ∣ ^{2}} \leq \frac{1}{η ^{2}},

\frac{d s ( z )}{d z} \leq \int_{- \infty}^{\infty} \frac{μ ( d x )}{∣ x - z ∣ ^{2}} \leq \frac{1}{η ^{2}},

μ ((x_{1}, x_{2}]) = F (x_{2}) - F (x_{1}) = η \to 0^{+} lim \frac{1}{π} \int_{x_{1}}^{x_{2}} ℑ (s (E + i η)) d E;

μ ((x_{1}, x_{2}]) = F (x_{2}) - F (x_{1}) = η \to 0^{+} lim \frac{1}{π} \int_{x_{1}}^{x_{2}} ℑ (s (E + i η)) d E;

G := G (z) = (M - z I_{p})^{- 1},

G := G (z) = (M - z I_{p})^{- 1},

G^{(T)} := G^{(T)} (z) = (M^{(T)} - z I_{p})^{- 1} .

G^{(T)} := G^{(T)} (z) = (M^{(T)} - z I_{p})^{- 1} .

\frac{1}{G _{ℓℓ}^{(T)}} = M_{ℓℓ} - z - m_{ℓ}^{*} G^{(T ℓ)} m_{ℓ},

\frac{1}{G _{ℓℓ}^{(T)}} = M_{ℓℓ} - z - m_{ℓ}^{*} G^{(T ℓ)} m_{ℓ},

∣ Tr G^{(T)} (z) - Tr G (z) ∣ \leq C η^{- 1},

∣ Tr G^{(T)} (z) - Tr G (z) ∣ \leq C η^{- 1},

s_{SC} (z) = \frac{- z + z ^{2} - 4}{2} .

s_{SC} (z) = \frac{- z + z ^{2} - 4}{2} .

u (z) = \frac{1}{- z - u ( z )}

u (z) = \frac{1}{- z - u ( z )}

f : E_{1} \times \dots E_{p} \to R

f : E_{1} \times \dots E_{p} \to R

c_{k} := sup ∣ f (x_{1}, \dots, x_{k - 1}, y, x_{k + 1}, \dots, x_{p}) - f (x_{1}, \dots, x_{k - 1}, z, x_{k + 1}, \dots, x_{p}) ∣,

c_{k} := sup ∣ f (x_{1}, \dots, x_{k - 1}, y, x_{k + 1}, \dots, x_{p}) - f (x_{1}, \dots, x_{k - 1}, z, x_{k + 1}, \dots, x_{p}) ∣,

P (∣ Y - E Y ∣ \geq ε) \leq 2 exp (- \frac{2 ε ^{2}}{c _{1}^{2} + \dots + c _{p}^{2}}) .

P (∣ Y - E Y ∣ \geq ε) \leq 2 exp (- \frac{2 ε ^{2}}{c _{1}^{2} + \dots + c _{p}^{2}}) .

P (∣ s (z) - E s (z) ∣ \geq ε) \leq 2 exp (- \frac{p η ^{2} ε ^{2}}{8}) .

P (∣ s (z) - E s (z) ∣ \geq ε) \leq 2 exp (- \frac{p η ^{2} ε ^{2}}{8}) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Coding theory and cryptography · Advanced Algebra and Geometry

Full text

Random Matrices from Linear Codes and the Convergence to Wigner’s Semicircle Law

Chin Hei Chan and Maosheng Xiong

Abstract

Recently we considered a class of random matrices obtained by choosing distinct codewords at random from linear codes over finite fields and proved that under some natural algebraic conditions their empirical spectral distribution converges to Wigner’s semicircle law as the length of the codes goes to infinity. One of the conditions is that the dual distance of the codes is at least 5. In this paper, employing more advanced techniques related to Stieltjes transform, we show that the dual distance being at least 5 is sufficient to ensure the convergence, and the convergence rate is of the form $n^{-\beta}$ for some $0<\beta<1$ , where $n$ is the length of the code.

Index Terms:

Group randomness, linear code, dual distance, empirical spectral measure, random matrix theory, Wigner’s semicircle law.

I Introduction

Random matrix theory is the study of matrices whose entries are random variables. Of particular interest is the study of eigenvalue statistics of random matrices such as the empirical spectral measure. It has been broadly investigated in a wide variety of areas, including statistics [25], number theory [17], economics [18], theoretical physics [24] and communication theory [23].

Most of the matrix models considered in the literature were matrices whose entries have independent structures. In a series of work ([2, 3, 26]), initiated in [1], the authors studied a class of matrices formed by choosing codewords at random from linear codes over finite fields and ultimately proved the convergence of the empirical spectral distribution of their Gram matrices to the Marchenko-Pastur law under the condition that the minimum Hamming distance of the dual codes is at least 5. This is the first result relating the randomness of matrices from linear codes to the algebraic properties of the underlying dual codes, and can be interpreted as a joint randomness test for sequences from linear codes. It implies in particular that sequences from linear codes with desired properties behave like random sequences from the view point of random matrix theory. This is called a “group randomness” property in [1] and may have many applications (see [20, 21] from a different perspective).

Recently we considered a distinct normalization of matrices obtained in a similar fashion from linear codes and proved the convergence of the empirical spectral distribution to the Wigner’s semicircle law under some natural algebraic conditions of the underlying codes (see [10]). This is also a group randomness property of linear codes. In this paper we explore this new phenomenon much further.

I-A Statement of Main Results

To describe our results more precisely, we need some notation. Let $\mathscr{C}=\{\mathcal{C}_{i}:i\geq 1\}$ be a family of linear codes of length $n_{i}$ and dimension $k_{i}$ over the finite field $\mathbb{F}_{q}$ of $q$ elements ( $\mathcal{C}_{i}$ is called an $[n_{i},k_{i}]_{q}$ code for short), where $q$ is a prime power. The most interesting case is binary linear codes, corresponding to $q=2$ . Denote by $\mathcal{C}_{i}^{\bot}$ the dual code of $\mathcal{C}_{i}$ and $d_{i}^{\bot}$ the Hamming distance of $\mathcal{C}_{i}^{\bot}$ . $d_{i}^{\bot}$ is also called the dual distance of $\mathcal{C}_{i}$ .

The standard additive character of $\mathbb{F}_{q}$ extends component-wise to a natural mapping $\psi:\mathbb{F}_{q}^{n_{i}}\to(\mathbb{C}^{*})^{n_{i}}$ . For each $i$ , we choose $p_{i}$ distinct codewords from $\mathcal{C}_{i}$ and apply the mapping $\psi$ . Endowing with uniform probability on the choice of the $p_{i}$ codewords, this forms a probability space. Put the $p_{i}$ distinct sequences as the rows of a $p_{i}\times n_{i}$ random matrix $\Phi_{\mathcal{C}_{i}}$ . Denote

[TABLE]

where $\Phi_{\mathcal{C}_{i}}^{*}$ is the conjugate transpose of the matrix $\Phi_{\mathcal{C}_{i}}$ and define

[TABLE]

Here $I_{p_{i}}$ is the $p_{i}\times p_{i}$ identity matrix.

For any $n\times n$ matrix $\mathbf{A}$ with eigenvalues $\lambda_{1},\ldots,\lambda_{n}$ , the spectral measure of $\mathbf{A}$ is defined by

[TABLE]

where $\delta_{\lambda}$ is the Dirac measure at the point $\lambda$ . The empirical spectral distribution of $\mathbf{A}$ is defined by

[TABLE]

Our first main result is as follows:

Theorem 1.

Suppose $p_{i},\frac{n_{i}}{p_{i}}\to\infty$ simultaneously as $i\to\infty$ . If $d^{\bot}_{i}\geq 5$ for any $i$ , then as $i\to\infty$ , we have

[TABLE]

and the convergence is uniform for all intervals $\mathcal{I}\subset\mathbb{R}$ . Here $\mu_{n_{i}}$ is the spectral measure of the matrix $M_{\mathcal{C}_{i}}$ and $\varrho_{\mathrm{SC}}$ is the probability measure of the semicircle law whose density function is given by

[TABLE]

and $\mathbbm{1}_{[-2,2]}$ is the indicator function of the interval $[-2,2]$ .

We remark that originally in [10] the same convergence (3) was proved with an extra condition that there is a fixed constant $c>0$ independent of $i$ such that

[TABLE]

The condition (5) is natural as explained in [10], and when $q=2$ , it is equivalent to

[TABLE]

where $\mathrm{wt}(\mathbf{c})$ is the Hamming weight of the codeword $\mathbf{c}$ . It is interesting that this extra condition can be dropped. Now the result of Theorem 1 has the same strength as that of [26] where the condition $d_{i}^{\bot}\geq 5$ alone is sufficient to ensure the convergence. It shall be noted that similar to [26], the condition $d_{i}^{\bot}\geq 5$ in Theorem 1 is optimal because if $d_{i}^{\bot}=4\,\forall i$ , then Conclusion (3) is false for first-order binary Reed-Muller codes which have dual distance $4$ .

Our second main result shows that the rate of convergence (3) is fast with respect to the length of the codes.

Theorem 2.

Let $\mathcal{C}$ be an $[n,k]_{q}$ code with dual distance $d^{\bot}\geq 5$ . For fixed constants $\gamma_{1},\gamma_{2}\in(0,1)$ and $c\geq 1$ , suppose $p$ and $n$ satisfy

[TABLE]

Then

[TABLE]

uniformly for all intervals $\mathcal{I}\subset\mathbb{R}$ , where $\beta>0$ is given by

[TABLE]

We remark that the symbol “ $\prec$ ” in (6) is a standard “stochastic domination” notation in probability theory (see [8] for details), which means that for any $\varepsilon>0$ and any $D>0$ , there is a quantity $N(\varepsilon,D,c,\gamma_{1},\gamma_{2})$ , such that whenever $n\geq N(\varepsilon,D,c,\gamma_{1},\gamma_{2})$ , we have

[TABLE]

Here $\mathbb{P}$ is the probability within the space of picking $p$ distinct codewords from $\mathcal{C}$ and the supremum is taken over all intervals $\mathcal{I}\subset\mathbb{R}$ . Since $\varepsilon,D$ and $N(\varepsilon,D,c,\gamma_{1},\gamma_{2})$ do not depend on $\mathcal{C}$ , the supremum can be taken over all linear codes $\mathcal{C}$ of length $n$ over $\mathbb{F}_{q}$ with $d^{\bot}\geq 5$ .

We also remark that $d^{\bot}\geq 5$ is a very mild restriction on linear codes $\mathcal{C}$ , and there is an abundance of binary codes that satisfy this condition, for example, the Gold codes ([15]), some families of BCH codes (see [13, 14]) and many families of cyclic and linear codes studied in the literature (see for example [12, 22]). Such binary linear codes can also be generated by almost perfect nonlinear (APN) functions [9, 19], a special class of functions with important applications in cryptography.

I-B Simulations

We illustrate Theorems 1 and 2 by numerical experiments. We focus on binary Gold codes augmented by the all-1 vector. It is known that binary Gold codes have length $n=2^{m}-1$ , dimension $2m$ and dual distance 5. The augmented binary Gold codes has length $n$ , dimension $2m+1$ and dual distance at least 5. Because of the presence of the all-1 vector, the condition (5) is not satisfied. For each triple $(m,n,p)$ in the set $\{(5,31,8),(7,127,20),(9,511,35),(11,2047,50)\}$ , we randomly pick $p$ codewords from the augmented binary Gold code of length $n=2^{m}-1$ and form the corresponding matrix, from which we use Sage to compute the eigenvalues and plot the empirical spectral distribution along with Wigner’s distribution (see Figures 1 to 4 below). We do the above 10 times for each such triple $(m,n,p)$ and at each time, we find that the plots are almost the same as before: they are all very close to Wigner’s semicircle law and as the length $n$ increases, they become less and less distinguishable.

In order to illustrate more clearly the shape of the eigenvalue distribution, we also plot a density graph, which is shown in Figure 5. This is based on picking $p=100$ codewords from a binary Gold code of length $n=32767=2^{15}-1$ .

From (7) it is easy to see that $\beta\leq 1/12$ and the upper bound is achieved when $\gamma_{1}=\gamma_{2}=1/3$ . It might be possible to improve this value $\beta$ and hence obtain a better convergence rate. From the simulation results, however, it is not clear to us what the optimal $\beta$ that one may expect is.

I-C Techniques and relation to previous work

This paper strengthens [10, Theorem 2] on two fronts: in Theorem 1 we obtain the same convergence by removing the extra condition (5), and in Theorem 2 we obtain a strong and explicit convergence rate with respect to the length of the code, and the results were supported by computer simulations.

The main technique we use in this paper is the Stieltjes transform, a well-developed and standard tool in random matrix theory, and the method is essentially complex analysis. From the view point of random matrix theory, in [6, 7, 27] the authors have used Stieltjes transform to study similar matrix models with success, however, our matrices, arising from general linear codes over finite fields with dual distance 5, possess characteristics significantly different from [6, 7, 27]. With applications in mind, say, to generate pseudo-random matrices efficiently via linear codes, our matrices are more natural and interesting. None of the methods in previous works seem to apply directly to our setting. Instead we adopt methods from [4, 5, 8] and use a combination of ideas to obtain our final results.

Related to this paper, the authors in [11] have used Stieltjes transform to obtain a strong convergence rate which is similar in nature to Theorem 2 of this paper, hence extending the work [26], and some of the arguments are similar.

The paper is organized as follows. In Section II we introduce Stieltjes transform and related formulas and lemmas which will play important roles later. The main ideas of proving Theorems 1 and 2 share some similarity but technically speaking, they are quite involved, with the latter being even more so. To streamline the idea of the proofs, we assume a major technical statement (Theorem 5) from which we prove Theorems 1 and 2 in Sections III and IV respectively. Finally we prove the required Theorem 5 in Section V.

II Preliminaries

II-A Linear codes over $\mathbb{F}_{q}$ of dual distance at least 5

The standard additive character $\psi:\mathbb{F}_{q}\to\mathbb{C}^{*}$ is given by

[TABLE]

where $\mathrm{Tr}$ is the absolute trace mapping from $\mathbb{F}_{q}$ to its prime subfield $\mathbb{F}_{r}$ of order $r$ and $\zeta=\exp(2\pi\sqrt{-1}/r)$ is a (complex) primitive $r$ -th root of unity. In particular when $q=r=2$ , then $\zeta=-1$ and $\psi(a)=(-1)^{a}$ for $a\in\mathbb{F}_{2}$ . It is known that $\psi$ satisfies the following orthogonality relation:

[TABLE]

Let $\mathcal{C}$ be an $[n,k]_{q}$ linear code with dual distance $d^{\bot}\geq 5$ . By the sphere-packing bound [16, Theorem 1.12.1], we have

[TABLE]

here the implied constant in the big O-notation depends only on $q$ . From this we can obtain

[TABLE]

Since $\mathcal{C}$ is linear, the orthogonal relation (10) further implies that for any $\mathbf{a}\in\mathbb{F}_{q}^{n}$ , we have

[TABLE]

Here $\mathbf{a}\cdot\mathbf{c}$ is the usual inner product between the vectors $\mathbf{a}$ and $\mathbf{c}$ in $\mathbb{F}_{q}^{n}$ .

II-B Stieltjes Transform

In this section we recall some basic knowledge of Stieltjes transform. Interested readers may refer to [5, Chapter B.2] for more details. Stieltjes transform can be defined for any real function of bounded variation. For the case of interest to us, however, we confine ourselves to functions arising from probability theory.

Let $\mu$ be a probability measure and let $F$ be the corresponding cumulative distribution function. The Stieltjes transform of $F$ or $\mu$ is defined by

[TABLE]

where $z$ is a complex variable taking values in $\mathbb{C}^{+}:=\{z\in\mathbb{C}:\Im z>0\}$ , the upper half complex plane. Here $\Im z$ is the imaginary part of $z$ .

It is known that $s(z)$ is well-defined for all $z\in\mathbb{C}^{+}$ and is well-behaved, satisfying the following properties:

(i).

$s(z)\in\mathbb{C}^{+}$ for any $z\in\mathbb{C}^{+}$ ;

(ii).

$s(z)$ is analytic in $\mathbb{C}^{+}$ and

[TABLE]

where $\eta=\Im z>0$ ;

(iii).

the probability measure $\mu$ can be recovered from the Stieltjes transform $s(z)$ via the inverse formula (see [5]):

[TABLE]

(iv).

the convergence of Stieltjes transforms is equivalent to the convergence of the underlying probability measures (see for example [5, Theorem B.9]).

II-C Resolvent Identities and Formulas for Green function entries

Let $M$ be a Hermitian $p\times p$ matrix whose $(j,k)$ -th entry is $M_{jk}$ . Denote by $G$ the Green function of $M$ , that is,

[TABLE]

where $z\in\mathbb{C}^{+}$ . The $(j,k)$ -th entry of $G$ is $G_{jk}$ .

Given any subset $T\subset[1\mathrel{{.}\,{.}}\nobreak p]:=\{1,2,\cdots,p\}$ , let $M^{(T)}$ be the $p\times p$ matrix whose $(j,k)$ -th entry is given by $(M^{(T)})_{jk}:=\mathbbm{1}_{j,k\notin T}M_{jk}$ . In addition, let $G^{(T)}$ be the Green function of $M^{(T)}$ , that is,

[TABLE]

When $T$ is a singleton, say $\{\ell\}$ , it is common to further abbreviate the notation $G^{(\{\ell\})}$ as $G^{(\ell)}$ , and similar for other matrices.

Let $\mathbf{m}_{\ell}$ denote the $\ell$ -th column of $M$ . For $z\in\mathbb{C}^{+}$ and any $\ell\in[1\mathrel{{.}\,{.}}\nobreak p]\setminus T$ , we have the Schur complement formula (see [5, 8])

[TABLE]

where $G^{(T\ell)}:=G^{(T\cup\{\ell\})}$ and $\mathbf{m}_{\ell}^{*}$ is the conjugate transpose of $\mathbf{m}_{\ell}$ .

We also have the following eigenvalue interlacing property (see [5, 8])

[TABLE]

where $z=E+\mathrm{i}\eta\in\mathbb{C}^{+}$ , ${\bf Tr}$ is the trace function, and $C$ is a constant depending only on the set $T$ .

II-D Stieltjes Transform of the Semicircle Law

The Stieltjes transform $s_{\mathrm{SC}}$ of the semicircle distribution given in (4) can be computed as (see [5])

[TABLE]

Here and throughout this paper, we always pick the complex square root $\sqrt{\cdot}$ to be the one with positive imaginary part.

It is well-known that $s_{\mathrm{SC}}(z)$ is the unique function that satisfies the equation

[TABLE]

such that $\Im u(z)>0$ whenever $\eta:=\Im z>0$ .

II-E Convergence of Stieltjes Transform in Probability

In order to bound the convergence rate of a random Stieltjes transform in probability, we need the following well-known McDiarmid’s lemma from probability theory (see [8, Lemma F.3]).

Lemma 3 (McDiarmid).

Let $X_{1},\cdots,X_{p}$ be independent random variables taking values in the spaces $E_{1},\cdots,E_{p}$ respectively. Let

[TABLE]

be a measurable function and define the random variable $Y=f(X_{1},\cdots,X_{p})$ . Define, for each $k\in[1\mathrel{{.}\,{.}}\nobreak p]$ ,

[TABLE]

where the supremum is taken over all $x_{j}\in E_{j}$ for $j\neq k$ and $y,z\in E_{k}$ . Then for any $\varepsilon>0$ , we have

[TABLE]

We will need the following concentration inequality. We remark that a very similar concentration inequality was proved (see [8, Lemma F.4]). Here for the sake of completeness, we provide a detailed proof.

Lemma 4.

Let $\mathcal{M}$ be a $p\times n$ random matrix with independent rows, define $S=(n/p)^{1/2}(\mathcal{M}\mathcal{M}^{*}-I_{p})$ . Let $s(z)$ be the Stieltjes transform of the empirical spectral distribution of $S$ . Then for any $\varepsilon>0$ and $z=E+i\eta\in\mathbb{C}^{+}$ ,

[TABLE]

Proof of Lemma 4.

Applying Lemma 3, we take $X_{j}$ to be the $j$ -th row of $\mathcal{M}$ and the function $f$ to be the Stieltjes transform $s$ . Note that the $(j,k)$ -th entry of $S$ is a linear function of the inner product of the $j$ -th and $k$ -th rows of $\mathcal{M}$ . Hence changing one row of $\mathcal{M}$ only gives an additive perturbation of $S$ of rank at most two. Applying the resolvent identity [8, (2.3)], we see that the Green function is also only affected by an additive perturbation by a matrix of rank at most two and operator norm at most $2\eta^{-1}$ . Therefore the quantities $c_{k}$ in (19) can be bounded by

[TABLE]

Then the required result follows directly from inserting the above bound to (20). ∎

III Proof of Theorem 1

Throughout the paper, let $\mathcal{C}$ be an $[n,k]_{q}$ linear code over $\mathbb{F}_{q}$ . We always assume that its dual distance satisfies $d^{\bot}\geq 5$ . Denote $N=q^{k}$ . The standard additive character on $\mathbb{F}_{q}$ extends component-wise to a natural mapping $\psi:\mathbb{F}_{q}^{n}\to\mathbb{C}^{n}$ . Define $\mathcal{D}=\psi(\mathcal{C})$ .

III-A Problem set-up

Theorems 1 and 2 are for random matrices in the probability space $\Omega_{p,I}$ of choosing $p$ distinct elements uniformly from $\mathcal{D}$ . Denote by $\mathcal{D}^{p}$ the probability space of choosing $p$ elements from $\mathcal{D}$ independently and uniformly. Because $d^{\bot}\geq 5$ , from (11) we have

[TABLE]

as $n,p\to\infty$ . Thus to prove Theorems 1 and 2, it is equivalent to consider the larger probability space $\mathcal{D}^{p}$ . This will simplify the proofs.

Now let $\Phi_{n}$ be a $p\times n$ random matrix whose rows are picked from $\mathcal{D}$ uniformly and independently. Denote by $\mathbb{E}$ the expectation with respect to the probability space $\mathcal{D}^{p}$ . We may assume that $p:=p(n)$ is a function of $n$ such that $p,n/p\to\infty$ as $n\to\infty$ .

Let

[TABLE]

Let $\mu_{n}$ be the empirical spectral measure of $M_{n}$ and let $s_{M_{n}}(z)$ be its Stieltjes transform, that is,

[TABLE]

Here $\lambda_{1},\cdots,\lambda_{p}$ are the eigenvalues of the matrix $M_{n}$ , and $G:=G(z)$ is the Green function of $M_{n}$ given by

[TABLE]

Note that the Stieltjes transform $s_{M_{n}}(z)$ is itself a random variable in the space $\mathcal{D}^{p}$ . We define

[TABLE]

Throughout the paper, the complex value $z\in\mathbb{C}^{+}$ is always written as

[TABLE]

For a fixed constant $\tau\in(0,1)$ , we define

[TABLE]

Now we assume a result about the expected Stieltjes transform $s_{n}(z)$ .

Theorem 5.

For any $z\in\Gamma_{\tau}$ , we write

[TABLE]

Then we have

[TABLE]

We emphasize here that this is one of the major technical results in this paper and the proof is a little complicated. This is the only result in the paper that is directly related to the properties of linear codes. It requires $d^{\bot}\geq 5$ but not the extra condition (5) used in [10]. To streamline the presentation, here we assume Theorem 5, then Theorem 1 can be proved easily. The proof of Theorem 5 is postponed to Section V.

III-B Proof of Theorem 1

By properties of the Stieltjes transform (see [5, Theorem B.9]), to prove Theorem 1, it is equivalent to prove the following statement: For any $\varepsilon>0$ , we have

[TABLE]

We prove Statement (25) in several steps.

First, we fix an arbitrary value $z\in\mathbb{C}^{+}$ . The quadratic equation (24) has two solutions

[TABLE]

As $n\to\infty$ , from Theorem 5 we have $\Delta(z)\to 0$ , so $z-\Delta\in\mathbb{C}^{+}$ for large enough $n$ . Since $s_{n}(z),s_{\mathrm{SC}}(z)\in\mathbb{C}^{+}$ , we see that

[TABLE]

Then by the continuity of $s_{\mathrm{SC}}$ and by taking $n\to\infty$ , we obtain

[TABLE]

Moreover, by Lemma 4, for any fixed $\varepsilon>0$ , as $n\to\infty$ , we have

[TABLE]

This and (27) immediately imply

[TABLE]

Noting that (28) holds for any fixed $z\in\mathbb{C}^{+}$ and any $\varepsilon>0$ , so to prove (25), in the next step we need to show that the convergence is “uniform” for all $z\in\mathbb{C}^{+}$ . To do this, we adopt a simple lattice argument.

For any $\tau,\varepsilon\in(0,1)$ , define the sets

[TABLE]

and

[TABLE]

It is easy to see that $\mathbf{L}_{\tau,\varepsilon}\neq\emptyset$ and

[TABLE]

For any fixed $z\in\mathbb{C}^{+}$ , define $\Xi_{n,\varepsilon}(z)$ to be the event

[TABLE]

By (28), for any $\delta>0$ , there is an $N(z,\tau,\varepsilon,\delta)$ such that

[TABLE]

Here the set $\Xi_{n,\frac{\varepsilon}{2}}(z)^{\bf c}$ denotes the complement of the event $\Xi_{n,\frac{\varepsilon}{2}}(z)$ . Then for any $n$ such that

[TABLE]

we have

[TABLE]

Finally we consider the event $\bigcap_{z\in\mathbf{L}_{\tau,\varepsilon}}\Xi_{n,\frac{\varepsilon}{2}}(z)$ , that is,

[TABLE]

Recall from (13) that the Stieltjes transforms $s_{M_{n}}(z)$ and $s_{\mathrm{SC}}(z)$ are both $\tau^{-2}$ -Lipschitz on the set $\Gamma_{\tau}^{\prime}$ , and for any $z\in\Gamma_{\tau}^{\prime}$ , we can find one $z^{\prime}\in\mathbf{L}_{\tau,\varepsilon}$ such that

[TABLE]

So for this $z\in\Gamma_{\tau}^{\prime}$ we have

[TABLE]

This means that

[TABLE]

Therefore

[TABLE]

for any $n>N(\tau,\varepsilon,\delta)$ .

Hence for any $\tau,\varepsilon\in(0,1)$ , we have

[TABLE]

Taking the limit $\tau\to 0^{+}$ , we obtain the desired Statement (25). This completes the proof of Theorem 1.

IV Proof of Theorem 2

Now for fixed constants $c>1$ and $\gamma_{1},\gamma_{2}\in(0,1)$ , let us assume

[TABLE]

Similar in proving Theorem 1 in the previous section, here we assume Theorem 5. Then the main idea of proving Theorem 2 is to provide a refined and quantitative version of Statement (25), so in each step of the proofs, we need to keep track of all the varying parameters as $n\to\infty$ .

First, the upper bound for $\Delta(z)$ in Theorem 5 can be simplified as

[TABLE]

where the constant $\beta>0$ is explicitly given in (7).

Let us define

[TABLE]

From now on, $C_{c,\tau}$ denotes some positive constant depending only on $c$ and $\tau$ whose value may vary at each occurrence. We can estimate the difference $|s_{n}(z)-s_{\mathrm{SC}}(z)|$ as follows.

Lemma 6.

For any $z\in\mathbf{S}_{\tau}$ , we have

[TABLE]

Proof of Lemma 6.

First, for large enough $n$ , noting that

[TABLE]

we see that Equation (26) holds for all $z\in\mathbf{S}_{\tau}$ . More precisely, we have

[TABLE]

By using the fact $\left|\frac{\mathrm{d}s_{\mathrm{SC}}(z)}{\mathrm{d}z}\right|\leq\eta^{-1}$ which can be easily checked from (17), we conclude that

[TABLE]

Then Lemma 6 is proved. ∎

Next we estimate the term $|s_{M_{n}}(z)-s_{\mathrm{SC}}(z)|$ . An $n$ -dependent event $\Xi$ is said to hold with high probability if for any $D>0$ , there is a quantity $N=N(D)>0$ such that $\mathbb{P}(\Xi)\geq 1-n^{-D}$ for any $n>N$ .

Theorem 7.

We have, with high probability,

[TABLE]

Proof of Theorem 7.

By the concentration inequality given in Lemma 4, we have

[TABLE]

Noting that the inequality (29) holds for any fixed $z\in\mathbf{S}_{\tau}$ . In order to prove Theorem 7, we need an upper bound which is uniform for all $z\in\mathbf{S}_{\tau}$ . We apply a lattice argument again.

Let

[TABLE]

Note that the set $\mathbf{L}_{\tau}\neq\emptyset$ and

[TABLE]

Also, for any $z\in\mathbf{S}_{\tau}$ and $\varepsilon>0$ , define $\mathcal{E}_{n,\varepsilon}(z)$ to be the event

[TABLE]

and $\mathcal{E}_{n,\varepsilon}(z)^{\bf c}$ the complement. Then (29) can be rewritten as

[TABLE]

So we have

[TABLE]

for any $D>0$ and $n>N(c,\gamma_{1},\gamma_{2},\tau,D)$ .

Finally we consider the event $\bigcap_{z\in\mathbf{L}_{\tau}}\mathcal{E}_{n,\frac{\tau}{2}}(z)$ , that is,

[TABLE]

Noting that for any $z\in\mathbf{S}_{\tau}$ , there is $z^{\prime}\in\mathbf{L}_{\tau}$ such that

[TABLE]

and that $s_{M_{n}}(z)$ and $s_{n}(z)$ are both $n^{2\beta}$ -Lipschitz on $\mathbf{S}_{\tau}$ , we obtain, for any $z\in\mathbf{S}_{\tau}$ ,

[TABLE]

This means that

[TABLE]

Hence by (30) we have

[TABLE]

for all $n>N(c,\gamma_{1},\gamma_{2},\tau,D)$ .

Combining the above inequality with Lemma 6 completes the proof of Theorem 7. ∎

Proof of Theorem 2.

As a standard application of the Helffer-Sjöstrand formula via complex analysis, Theorem 2 can be derived directly from Theorem 7. This is quite well-known, and the computation is routine. Interested readers may refer to [8, Section 8] for a very similar analysis. We omit the details. ∎

V Proof of Theorem 5

In this section we give a detailed proof of Theorem 5, where the condition that $d^{\bot}\geq 5$ plays an important role.

Recall from the beginning of Section III that $\mathcal{C}$ is a linear code of length $n$ over $\mathbb{F}_{q}$ with $d^{\bot}\geq 5$ , $\psi$ is the standard additive character on $\mathbb{F}_{q}$ , extended component-wisely to $\mathbb{F}_{q}^{n}$ , $\mathcal{D}=\psi(\mathcal{C})$ , and $\Phi_{n}$ is a $p\times n$ random matrix whose rows are selected uniformly and independently from $\mathcal{D}$ . This makes $\mathcal{D}^{p}$ a probability space, on which we use $\mathbb{E}$ to denote the expectation. Let $\mathcal{G}_{n}$ and $M_{n}$ be defined as in (21). Since all the entries of $\Phi_{n}$ are roots of unity, the diagonal entries of $M_{n}$ are all zero.

Let $x_{jk}$ be the $(j,k)$ -th entry of $\Phi_{n}$ . The following properties of $x_{jk}$ , while very simple, depend crucially on the condition that $d^{\bot}\geq 5$ .

Lemma 8.

For any $\ell\in[1\mathrel{{.}\,{.}}\nobreak p]$ , we have

(a) $\mathbb{E}(x_{\ell j}\overline{x}_{\ell k})=0$ if $j\neq k$ ;

(b) $\mathbb{E}(x_{\ell j}x_{\ell t}\overline{x}_{\ell k}\overline{x}_{\ell s})=0$ if the indices $j,t,k,s$ do not come in pairs; If the indices come in pairs, then $|\mathbb{E}(x_{\ell j}x_{\ell t}\overline{x}_{\ell k}\overline{x}_{\ell s})|\leq 1$ .

Proof of Lemma 8.

(a) It is easy to see that

[TABLE]

where $\mathbf{c}=(c_{1},c_{2},\cdots,c_{n})\in\mathcal{C}$ and $\mathbf{a}_{1}=(0,\cdots,0,1,0\cdots,0,-1,0\cdots,0)\in\mathbb{F}_{q}^{n}$ . Here in $\mathbf{a}_{1}$ the 1 and $-1$ appear at the $j$ -th and $k$ -th entries respectively. Since $d^{\bot}\geq 5$ , we have $\mathbf{a}_{1}\notin\mathcal{C}^{\bot}$ , and the desired result follows directly from (12).

(b) It is easy to see that

[TABLE]

where the vector $\mathbf{a}_{2}\in\mathbb{F}_{q}^{n}$ is formed from the all-zero vector by adding $1$ s to the $j$ -th and $t$ -th entries and then adding $-1$ s from the $k$ -th and $s$ -th entries. If the indices $j,t,k,s$ do not come in pairs, then $0\neq\mathrm{wt}(\mathbf{a}_{2})\leq 4$ . Since $d^{\bot}\geq 5$ , we have $\mathbb{E}(x_{\ell j}x_{\ell t}\overline{x}_{\ell k}\overline{x}_{\ell s})=0$ by (12). The second statement of (b) is trivial since $|x_{ij}|=1$ for any $i,j$ . ∎

For any $\ell\in[1\mathrel{{.}\,{.}}\nobreak p]$ , let $\Phi_{n}^{(\ell)}$ be the $p\times n$ matrix obtained from $\Phi_{n}$ by changing the whole $\ell$ -th row to 0. Define

[TABLE]

Denote by $\omega(\ell)$ the $\ell$ -th row of $\Phi_{n}$ , and $\mathbf{m}_{\ell}$ the $\ell$ -th column of $M_{n}$ . It is easy to see that

[TABLE]

Let

[TABLE]

be the Green functions of $M_{n}$ and $M_{n}^{(\ell)}$ respectively for the complex variable $z\in\mathbb{C}^{+}$ .

For the Green function $G$ , we start with the resolvent identity (15) for $T=\emptyset$ . Using (31), we can express the third term on the right side of (15) as

[TABLE]

By the identity

[TABLE]

the right hand side can be further expressed as

[TABLE]

where

[TABLE]

Here the indices $j,k$ vary in $[1\mathrel{{.}\,{.}}\nobreak n]$ and $a_{jk}$ ’s are the $(jk)$ -th entry of the $n\times n$ matrix $(a_{jk})$ given by

[TABLE]

Hence the resolvent identity (15) yields

[TABLE]

Expanding the second term on the right, we obtain

[TABLE]

where

[TABLE]

V-A Estimates of $Z_{\ell}$ and $Y_{\ell}$

The random variables $Z_{\ell}$ and $Y_{\ell}$ depend on the complex value $z=E+\mathrm{i}\eta\in\mathbb{C}^{+}$ . For any fixed constant $\tau>0$ , recall $\Gamma_{\tau}$ defined in (23). Throughout this section we always assume $z\in\Gamma_{\tau}$ .

Lemma 9.

Let $z\in\Gamma_{\tau}$ . Then for any $\ell\in[1\mathrel{{.}\,{.}}\nobreak p]$ , we have

(a) $\mathbb{E}^{(\ell)}Z_{\ell}=\mathbb{E}Z_{\ell}=0$ . Here $\mathbb{E}^{(\ell)}$ is the conditional expectation given $\{x_{jk}:j\neq\ell\}$ ;

(b) $\mathbb{E}|Z_{\ell}|^{2}=O_{\tau}(p^{-1}\eta^{-2})$ .

Proof of Lemma 9.

(a) Since the rows of $\Phi_{n}$ are independent, the entries $a_{jk}$ as defined in (33) are independent with $x_{\ell j}$ and $x_{\ell k}$ . Hence from the definition of $Z_{\ell}$ in (32) and statement (a) of Lemma 8, we have

[TABLE]

The proof of the result on $\mathbb{E}Z_{\ell}$ is similar by replacing $a_{jk}$ with $\mathbb{E}a_{jk}$ .

(b) Expanding $|Z_{\ell}|^{2}$ and taking expectation $\mathbb{E}$ inside, noting that the rows of $\Phi_{n}$ are independent, we have

[TABLE]

Since $d^{\bot}\geq 5$ , by using statement (b) of Lemma 8, we find

[TABLE]

where $C$ is an absolute constant which may be different in each appearance. Using the definition of $(a_{jk})$ in (33) we have

[TABLE]

Expanding the terms on the right, we can easily obtain

[TABLE]

Here $\lambda_{j}^{(\ell)}\in\mathbb{R}(1\leq j\leq p)$ are the eigenvalues of $M_{n}^{(\ell)}$ , and $C_{\tau}$ is a positive constant depending only on $\tau$ whose value may vary in each occurrence. ∎

The above estimations lead to the following estimations of $Y_{\ell}$ .

Lemma 10.

Let $z\in\Gamma_{\tau}$ . Then for any $\ell\in[1\mathrel{{.}\,{.}}\nobreak p]$ , we have

(a) $\mathbb{E}Y_{\ell}=O_{\tau}\left(\eta^{-1}(p^{-1}+(p/n)^{\frac{1}{2}})\right)$ ;

(b) $\mathbb{E}|Y_{\ell}|^{2}=O_{\tau}\left(\eta^{-2}(p^{-1}+p/n)\right)$ .

Proof of Lemma 10.

(a) Taking expectation on $Y_{\ell}$ in (35) and noting that $\mathbb{E}Z_{\ell}=0$ , we get

[TABLE]

By the eigenvalue interlacing property in (16) and the trivial bound $|G_{jj}^{(\ell)}|\leq\eta^{-1}$ , we get

[TABLE]

(b) We split $\mathbb{E}|Y_{\ell}|^{2}$ as

[TABLE]

where

[TABLE]

We first estimate $V_{1}$ . Using (a) of Lemma 9, we see that

[TABLE]

Hence by (b) of Lemma 9 we obtain

[TABLE]

Next we estimate $V_{2}$ . Again by Lemma 9 we have

[TABLE]

So we have

[TABLE]

Here we denote $T_{0}:=\emptyset$ and $T_{m}:=[1\mathrel{{.}\,{.}}\nobreak m]$ for any $m\in[1\mathrel{{.}\,{.}}\nobreak p]$ , and for any subset $T\subset[1\mathrel{{.}\,{.}}\nobreak p]$ , we denote $\mathbb{E}^{(T)}$ to be the conditional expectation given $\{x_{jk}:j\notin T\}$ . The second equality follows from applying successively the law of total variance to the rows of $\Phi_{n}$ .

For $m\neq\ell$ , writing $\gamma_{m}:=\mathbb{E}^{(T_{m-1})}{\bf Tr}G^{(\ell)}-\mathbb{E}^{(T_{m})}{\bf Tr}G^{(\ell)}$ , we can easily check that

[TABLE]

where $\sigma_{m}:={\bf Tr}G^{(\ell)}-{\bf Tr}G^{(\ell,m)}$ . By (16) we have $|\gamma_{m}|\leq C\eta^{-1}$ . Hence we obtain

[TABLE]

Plugging the estimates of $\mathbb{E}Y_{\ell}$ in statement (a), $V_{1}$ in (37) and $V_{2}$ above into the equation (36), we obtain the desired estimate of $\mathbb{E}|Y_{\ell}|^{2}$ . ∎

V-B Proof of Theorem 5

We can now complete the proof of Theorem 5.

Proof of Theorem 5.

We write (34) as

[TABLE]

where

[TABLE]

Taking expectations on both sides of (39), we can obtain

[TABLE]

where

[TABLE]

and

[TABLE]

For $A_{\ell}$ , since

[TABLE]

we obtain

[TABLE]

For $\Delta_{\ell}$ , using the fact that $|\alpha_{n}|\geq\eta$ and Lemma 10 we obtain

[TABLE]

for any $z\in\Gamma_{\tau}$ .

Summing for all $\ell\in[1\mathrel{{.}\,{.}}\nobreak p]$ and then dividing $p$ on both sides of (40), it is easy to see that in writing

[TABLE]

the quantity $\Delta(z)$ satisfies the same bound as $\Delta_{\ell}$ above. This completes the proof of Theorem 5. ∎

Acknowledgments

The research of M. Xiong was supported by RGC grant number 16303615 from Hong Kong.

Bibliography27

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] B. Babadi, S. S. Ghassemzadeh and V. Tarokh, “Group randomness properties of pseudo-noise and Gold sequences,” Proc. 12th Can. Workshop Inf. Theory (CWIT) (2011), 42–46.
2[2] B. Babadi and V. Tarokh, “Spectral distribution of random matrices from binary linear block codes,” IEEE Trans. Inf. Theory 57 (2011), no. 6, 3955–3962.
3[3] B. Babadi and V. Tarokh, “Spectral distribution of product of pseudorandom matrices formed from binary block codes”, IEEE Trans. Inform. Theory 59 (2013), no. 2, 970–978.
4[4] Z. Bai, “Convergence Rate of Expected Spectral Distributions of Large Random Matrices. Part I. Wigner Matrices,” The Annals of Probability 21 (1993), no. 2, 625–648.
5[5] Z. Bai and J. W. Silverstein, Spectral Analysis of Large Dimensional Random Matrices , 2nd ed. New York, NY 10013, USA: Springer Series in Statistics, 2010.
6[6] Z. Bai and Y. Yin, “Convergence to the semicircle law,” Ann. Probab. 16 (1988), no. 2, 863–875.
7[7] Z. Bao, “Strong convergence of ESD for the generalized sample covariance matrices when p / n → 0 → 𝑝 𝑛 0 p/n\to 0 ”, Statist. Probab. Lett. 82 (2012), no. 5, 894–901.
8[8] F. Benaych-Georges and A. Knowles, Lectures on the local semicircle law for Wigner matrices , 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Random Matrices from Linear Codes and the Convergence to Wigner’s Semicircle Law

Abstract

Index Terms:

I Introduction

I-A Statement of Main Results

Theorem 1**.**

Theorem 2**.**

I-B Simulations

I-C Techniques and relation to previous work

II Preliminaries

II-A Linear codes over Fq\mathbb{F}_{q}Fq​ of dual distance at least 5

II-B Stieltjes Transform

II-C Resolvent Identities and Formulas for Green function entries

II-D Stieltjes Transform of the Semicircle Law

II-E Convergence of Stieltjes Transform in Probability

Lemma 3** (McDiarmid).**

Lemma 4**.**

Proof of Lemma 4.

III Proof of Theorem 1

III-A Problem set-up

Theorem 5**.**

III-B Proof of Theorem 1

IV Proof of Theorem 2

Lemma 6**.**

Proof of Lemma 6.

Theorem 7**.**

Proof of Theorem 7.

Proof of Theorem 2.

V Proof of Theorem 5

Lemma 8**.**

Proof of Lemma 8.

V-A Estimates of ZℓZ_{\ell}Zℓ​ and YℓY_{\ell}Yℓ​

Lemma 9**.**

Proof of Lemma 9.

Lemma 10**.**

Proof of Lemma 10.

V-B Proof of Theorem 5

Proof of Theorem 5.

Acknowledgments

Theorem 1.

Theorem 2.

II-A Linear codes over $\mathbb{F}_{q}$ of dual distance at least 5

Lemma 3 (McDiarmid).

Lemma 4.

Theorem 5.

Lemma 6.

Theorem 7.

Lemma 8.

V-A Estimates of $Z_{\ell}$ and $Y_{\ell}$

Lemma 9.

Lemma 10.