Estimates of norms of log-concave random matrices with dependent entries

Marta Strzelecka

arXiv:1902.01150·math.PR·February 5, 2025

Estimates of norms of log-concave random matrices with dependent entries

Marta Strzelecka

PDF

TL;DR

This paper provides estimates for the expected operator norms of certain log-concave random matrices with dependent entries, extending previous results for Gaussian matrices and achieving near-optimal bounds.

Contribution

It generalizes existing bounds to matrices with dependent log-concave entries and introduces bounds for matrices with Gaussian mixture entries.

Findings

01

Expected norms are estimated up to logarithmic factors.

02

Results extend to matrices with dependent entries and Gaussian mixtures.

03

Bounds are shown to be near-optimal.

Abstract

We prove estimates for $E ∥ X : ℓ_{p^{'}}^{n} \to ℓ_{q}^{m} ∥$ for $p, q \geq 2$ and any random matrix $X$ having the entries of the form $a_{ij} Y_{ij}$ , where $Y = (Y_{ij})_{1 \leq i \leq m, 1 \leq j \leq n}$ has i.i.d. isotropic log-concave rows. This generalises the result of Gu\'edon, Hinrichs, Litvak, and Prochno for Gaussian matrices with independent entries. Our estimate is optimal up to logarithmic factors. As a byproduct we provide the analogue bound for $m \times n$ random matrices, which entries form an unconditional vector in $R^{mn}$ . We also prove bounds for norms of matrices which entries are certain Gaussian mixtures.

Equations174

\mathbb{E}\|X\|_{2,2}\lesssim\max_{i}\Bigl{(}\sum_{j}\mathbb{E}X_{ij}^{2}\Bigr{)}^{1/2}+\max_{j}\Bigl{(}\sum_{i}\mathbb{E}X_{ij}^{2}\Bigr{)}^{1/2}+\Bigl{(}\sum_{i,j}\mathbb{E}X_{ij}^{4}\Bigr{)}^{1/4}.

\mathbb{E}\|X\|_{2,2}\lesssim\max_{i}\Bigl{(}\sum_{j}\mathbb{E}X_{ij}^{2}\Bigr{)}^{1/2}+\max_{j}\Bigl{(}\sum_{i}\mathbb{E}X_{ij}^{2}\Bigr{)}^{1/2}+\Bigl{(}\sum_{i,j}\mathbb{E}X_{ij}^{4}\Bigr{)}^{1/4}.

\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\biggl{[}\bigl{(}\log m\bigr{)}^{1/q}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|a_{ij}|^{p}\Bigr{)}^{1/p}+\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|a_{ij}|^{q}\Bigr{)}^{1/q}\\ +\bigl{(}\log m\bigr{)}^{1/q}\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\biggr{]}.

\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\biggl{[}\bigl{(}\log m\bigr{)}^{1/q}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|a_{ij}|^{p}\Bigr{)}^{1/p}+\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|a_{ij}|^{q}\Bigr{)}^{1/q}\\ +\bigl{(}\log m\bigr{)}^{1/q}\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\biggr{]}.

\mathbb{P}\bigl{(}X\in\lambda K+(1-\lambda)L\bigr{)}\geq\mathbb{P}(X\in K)^{\lambda}\mathbb{P}(X\in L)^{1-\lambda}.

\mathbb{P}\bigl{(}X\in\lambda K+(1-\lambda)L\bigr{)}\geq\mathbb{P}(X\in K)^{\lambda}\mathbb{P}(X\in L)^{1-\lambda}.

\displaystyle\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\Bigl{[}\bigl{(}\log m\bigr{)}^{1/q}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+\bigl{(}\log m\bigr{)}^{1+\frac{1}{q}}\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]},

\displaystyle\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\Bigl{[}\bigl{(}\log m\bigr{)}^{1/q}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+\bigl{(}\log m\bigr{)}^{1+\frac{1}{q}}\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]},

\displaystyle\mathbb{E}\|X\|_{p^{\prime},q}=\mathbb{E}\sup_{u\in\ell_{p^{\prime}}^{n}}\|Xu\|_{q}\geq\mathbb{E}\|X^{(j)}\|_{q}=\mathbb{E}\bigl{\|}\bigl{(}|Y_{ij}|A_{ij}\bigr{)}_{i}\bigr{\|}_{q}\geq(2C_{1})^{-1}\|A^{(j)}\|_{q}.

\displaystyle\mathbb{E}\|X\|_{p^{\prime},q}=\mathbb{E}\sup_{u\in\ell_{p^{\prime}}^{n}}\|Xu\|_{q}\geq\mathbb{E}\|X^{(j)}\|_{q}=\mathbb{E}\bigl{\|}\bigl{(}|Y_{ij}|A_{ij}\bigr{)}_{i}\bigr{\|}_{q}\geq(2C_{1})^{-1}\|A^{(j)}\|_{q}.

∥ X ∥_{p^{'}, q} = u \in ℓ_{p^{'}}^{n} sup v \in ℓ_{q^{'}}^{n} sup v^{T} X u \geq ∣ X_{ij} ∣.

∥ X ∥_{p^{'}, q} = u \in ℓ_{p^{'}}^{n} sup v \in ℓ_{q^{'}}^{n} sup v^{T} X u \geq ∣ X_{ij} ∣.

\displaystyle\mathbb{E}\|X\|_{p^{\prime},q}\geq(4\Cr{seminorms}+1)^{-1}\Bigl{[}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]},

\displaystyle\mathbb{E}\|X\|_{p^{\prime},q}\geq(4\Cr{seminorms}+1)^{-1}\Bigl{[}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]},

\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\biggl{(}(\log m)^{1+\frac{1}{q}}\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|X_{ij}|^{p}\Bigr{)}^{1/p}+\mathbb{E}\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|X_{ij}|^{q}\Bigr{)}^{1/q}\biggr{)}.

\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\biggl{(}(\log m)^{1+\frac{1}{q}}\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|X_{ij}|^{p}\Bigr{)}^{1/p}+\mathbb{E}\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|X_{ij}|^{q}\Bigr{)}^{1/q}\biggr{)}.

\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|A_{ij}Y_{ij}|^{p}\Bigr{)}^{1/p}+\mathbb{E}\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|A_{ij}Y_{ij}|^{q}\Bigr{)}^{1/q}\\ \leq C\Bigl{(}p^{2}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+q^{2}\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+(p+q)\log(m\vee n)\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|A_{ij}Y_{ij}|\Bigr{)},

\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|A_{ij}Y_{ij}|^{p}\Bigr{)}^{1/p}+\mathbb{E}\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|A_{ij}Y_{ij}|^{q}\Bigr{)}^{1/q}\\ \leq C\Bigl{(}p^{2}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+q^{2}\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+(p+q)\log(m\vee n)\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|A_{ij}Y_{ij}|\Bigr{)},

\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|B_{ij}Y_{ij}|^{p}\Bigr{)}^{1/p}\lesssim p^{2}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|B_{ij}|^{p}\Bigr{)}^{1/p}+p\log(m\vee n)\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|B_{ij}Y_{ij}|.

\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|B_{ij}Y_{ij}|^{p}\Bigr{)}^{1/p}\lesssim p^{2}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|B_{ij}|^{p}\Bigr{)}^{1/p}+p\log(m\vee n)\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|B_{ij}Y_{ij}|.

\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\biggl{(}(\log m)^{\frac{3}{2}+\frac{1}{q}}\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|X_{ij}|^{p}\Bigr{)}^{1/p}\\ +\sqrt{\log n}\mathbb{E}\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|X_{ij}|^{q}\Bigr{)}^{1/q}\biggr{)},

\mathbb{E}\|X\|_{p^{\prime},q}\leq C(p,q)\biggl{(}(\log m)^{\frac{3}{2}+\frac{1}{q}}\mathbb{E}\max_{1\leq i\leq m}\Bigl{(}\sum_{j=1}^{n}|X_{ij}|^{p}\Bigr{)}^{1/p}\\ +\sqrt{\log n}\mathbb{E}\max_{1\leq j\leq n}\Bigl{(}\sum_{i=1}^{m}|X_{ij}|^{q}\Bigr{)}^{1/q}\biggr{)},

(E f (Z)^{p})^{1/ p} \leq \Cl se min or m s \frac{p}{q} (E f (Z)^{q})^{1/ q} for p \geq q \geq 1

(E f (Z)^{p})^{1/ p} \leq \Cl se min or m s \frac{p}{q} (E f (Z)^{q})^{1/ q} for p \geq q \geq 1

(\mathbb{E}\|Z\|_{p}^{q})^{1/q}\leq Cp\Bigl{(}\mathbb{E}\|Z\|_{p}+\sigma_{p,X}(q)\Bigr{)}\quad\mbox{ for }q\geq 1,

(\mathbb{E}\|Z\|_{p}^{q})^{1/q}\leq Cp\Bigl{(}\mathbb{E}\|Z\|_{p}+\sigma_{p,X}(q)\Bigr{)}\quad\mbox{ for }q\geq 1,

\sigma_{p,X}(q):=\sup_{t\in B_{p^{\prime}}^{n}}\Bigl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{\|}_{q}

\sigma_{p,X}(q):=\sup_{t\in B_{p^{\prime}}^{n}}\Bigl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{\|}_{q}

\mathbb{P}\Bigl{(}\|Z\|_{p}\geq\Cl{d1}p\bigl{(}u+\mathbb{E}\|Z\|_{p}\bigr{)}\Bigr{)}\leq\Cl{d2}\sup_{t\in B_{p^{\prime}}^{n}}\mathbb{P}\biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq u\biggr{)}.

\mathbb{P}\Bigl{(}\|Z\|_{p}\geq\Cl{d1}p\bigl{(}u+\mathbb{E}\|Z\|_{p}\bigr{)}\Bigr{)}\leq\Cl{d2}\sup_{t\in B_{p^{\prime}}^{n}}\mathbb{P}\biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq u\biggr{)}.

\displaystyle\mathbb{P}\Biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq\frac{1}{2}\biggl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\biggr{\|}_{q}\Biggr{)}

\displaystyle\mathbb{P}\Biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq\frac{1}{2}\biggl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\biggr{\|}_{q}\Biggr{)}

\displaystyle\geq(1-2^{-q})^{2}\Biggl{(}\frac{\bigl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\bigr{\|}_{q}}{\bigl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\bigr{\|}_{2q}}\Biggr{)}^{2q}\geq e^{-\Cl{c4}q}.

\sup_{t\in B_{p^{\prime}}^{n}}\mathbb{P}\biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq u\biggr{)}\geq e^{-2\Cr{c4}}

\sup_{t\in B_{p^{\prime}}^{n}}\mathbb{P}\biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq u\biggr{)}\geq e^{-2\Cr{c4}}

q:=\sup\biggl{\{}r\geq 2\Cr{c4}\colon\ \sup_{t\in B_{p^{\prime}}^{n}}\Bigl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{\|}_{r/\Cr{c4}}\leq 2u\biggr{\}}.

q:=\sup\biggl{\{}r\geq 2\Cr{c4}\colon\ \sup_{t\in B_{p^{\prime}}^{n}}\Bigl{\|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{\|}_{r/\Cr{c4}}\leq 2u\biggr{\}}.

\sup_{t\in B_{p^{\prime}}^{n}}\mathbb{P}\biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq u\biggr{)}\geq e^{-q}.

\sup_{t\in B_{p^{\prime}}^{n}}\mathbb{P}\biggl{(}\Bigl{|}\sum_{i=1}^{n}t_{i}Z_{i}\Bigr{|}\geq u\biggr{)}\geq e^{-q}.

\mathbb{P}\bigl{(}S\geq\Cl{c5}p(\mathbb{E}S+u)\bigr{)}\leq\mathbb{P}(S\geq e\|S\|_{q})\leq e^{-q}

\mathbb{P}\bigl{(}S\geq\Cl{c5}p(\mathbb{E}S+u)\bigr{)}\leq\mathbb{P}(S\geq e\|S\|_{q})\leq e^{-q}

u:=\sup_{t\in B_{E}}\Bigl{(}\sum_{i=1}^{m}\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\Bigr{)}^{1/q},

u:=\sup_{t\in B_{E}}\Bigl{(}\sum_{i=1}^{m}\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\Bigr{)}^{1/q},

v:=\Bigl{(}\lambda^{8}\bigl{(}T_{2}(E^{*})\bigr{)}^{2}\log m\hskip 2.27626pt\mathbb{E}\max_{1\leq i\leq m}\|X_{i}\|_{E^{*}}^{q}\Bigr{)}^{1/q},

v:=\Bigl{(}\lambda^{8}\bigl{(}T_{2}(E^{*})\bigr{)}^{2}\log m\hskip 2.27626pt\mathbb{E}\max_{1\leq i\leq m}\|X_{i}\|_{E^{*}}^{q}\Bigr{)}^{1/q},

\biggl{[}\mathbb{E}\sup_{t\in B_{E}}\biggl{|}\sum_{i=1}^{m}\Bigl{(}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}-\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\Bigr{)}\biggr{|}\biggr{]}^{1/q}\leq C(\sqrt{uv}+v)\leq 2C(u+v).

\biggl{[}\mathbb{E}\sup_{t\in B_{E}}\biggl{|}\sum_{i=1}^{m}\Bigl{(}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}-\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\Bigr{)}\biggr{|}\biggr{]}^{1/q}\leq C(\sqrt{uv}+v)\leq 2C(u+v).

\displaystyle\Bigl{(}\mathbb{E}\max_{1\leq i\leq m}\|X_{i}\|_{p}^{q}\Bigr{)}^{1/q}\leq C(p,q)\Bigl{[}\max_{1\leq i\leq m}\|A_{i}\|_{p}+\log m\ \mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]},

\displaystyle\Bigl{(}\mathbb{E}\max_{1\leq i\leq m}\|X_{i}\|_{p}^{q}\Bigr{)}^{1/q}\leq C(p,q)\Bigl{[}\max_{1\leq i\leq m}\|A_{i}\|_{p}+\log m\ \mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]},

\sup_{t\in B_{p^{\prime}}^{n}}\biggl{(}\sum_{i=1}^{m}\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\biggr{)}^{1/q}\leq\Cr{seminorms}q\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}.

\sup_{t\in B_{p^{\prime}}^{n}}\biggl{(}\sum_{i=1}^{m}\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\biggr{)}^{1/q}\leq\Cr{seminorms}q\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}.

\mathbb{E}\max_{1\leq i\leq m}|a_{i}Z_{i}|\geq\frac{1}{\Cl{d3}}\max_{k\leq m}\bigr{(}a_{k}^{*}\min_{i\leq m}\|Z_{i}\|_{\log(k+1)}\bigl{)},

\mathbb{E}\max_{1\leq i\leq m}|a_{i}Z_{i}|\geq\frac{1}{\Cl{d3}}\max_{k\leq m}\bigr{(}a_{k}^{*}\min_{i\leq m}\|Z_{i}\|_{\log(k+1)}\bigl{)},

E ∥ X ∥_{p^{'}, q}

E ∥ X ∥_{p^{'}, q}

\displaystyle\leq\biggl{[}\mathbb{E}\sup_{t\in B_{p^{\prime}}^{n}}\biggl{|}\sum_{i=1}^{m}\Bigl{(}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}-\mathbb{E}\bigl{|}\langle X_{i},t\rangle\bigr{|}^{q}\Bigr{)}\biggr{|}\biggr{]}^{1/q}+\sup_{t\in B_{p^{\prime}}^{n}}\biggl{(}\mathbb{E}\sum_{i=1}^{m}\bigl{|}\langle t,X_{i}\rangle\bigr{|}^{q}\biggr{)}^{1/q}

\leq C \cdot (u + v)

\displaystyle\leq C(p,q)\Bigl{[}\bigl{(}\log m\bigr{)}^{1/q}\max_{1\leq i\leq m}\bigl{\|}A_{i}\bigr{\|}_{p}+\max_{1\leq j\leq n}\bigl{\|}A^{(j)}\bigr{\|}_{q}+\bigl{(}\log m\bigr{)}^{\frac{1}{q}+1}\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|X_{ij}|\Bigr{]}.

E 1 \leq i \leq k max ∣ a_{i} Z_{i} ∣ \geq C^{- 1} 1 \leq i \leq k min ∥ a_{i} Z_{i} ∥_{l o g (k + 1)} \geq C^{- 1} a_{k} 1 \leq i \leq m min ∥ Z_{i} ∥_{l o g (k + 1)} .

E 1 \leq i \leq k max ∣ a_{i} Z_{i} ∣ \geq C^{- 1} 1 \leq i \leq k min ∥ a_{i} Z_{i} ∥_{l o g (k + 1)} \geq C^{- 1} a_{k} 1 \leq i \leq m min ∥ Z_{i} ∥_{l o g (k + 1)} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Estimates of norms of log-concave random matrices with dependent entries

Marta Strzelecka

Institute of Mathematics, University of Warsaw, Banacha 2, 02–097 Warsaw, Poland.

[email protected]

(Date: February 4, 2019)

Abstract.

We prove estimates for $\mathbb{E}\|X:\ell_{p^{\prime}}^{n}\to\ell_{q}^{m}\|$ for $p,q\geq 2$ and any random matrix $X$ having the entries of the form $a_{ij}Y_{ij}$ , where $Y=(Y_{ij})_{1\leq i\leq m,1\leq j\leq n}$ has i.i.d. isotropic log-concave rows. This generalises the result of Guédon, Hinrichs, Litvak, and Prochno for Gaussian matrices with independent entries. Our estimate is optimal up to logarithmic factors. As a byproduct we provide the analogue bound for $m\times n$ random matrices, which entries form an unconditional vector in $\mathbb{R}^{mn}$ . We also prove bounds for norms of matrices which entries are certain Gaussian mixtures.

Key words and phrases:

Random matrices, operator norm, log-concave vectors, unconditional vectors.

2010 Mathematics Subject Classification:

60B20, 46B09, 15B52

The research was supported by the National Science Centre, Poland via the grants 2015/19/N/ST1/02661 and 2018/28/T/ST1/00001.

1. Introduction and main result

A classical result regarding spectra of random matrices is Wigner’s Semicircle Law, which describes the limit of empirical spectral measures of a random matrix with independent centred entries with equal variance. Theorems of this type say nothing about the largest eigenvalue (i.e. the operator norm). However, Seginer proved in [17] that for a random matrix $X$ with i.i.d. symmetric entries $\mathbb{E}\|X\|_{2,2}$ (by $\|A\|_{p,q}$ we denote the operator norm of the matrix $A$ from $\ell_{p}$ to $\ell_{q}$ ) is of the same order as the expectation of the maximum Euclidean norm of rows and columns of $X$ . The same holds true for the structured Gaussian matrices (i.e. when $X_{ij}=a_{ij}g_{ij}$ and $g_{ij}$ are i.i.d. standard Gaussian variables), as was recently shown by Latała, van Handel, and Youssef in [14], and up to a logarithmic factor for any $X$ with independent centred entries, see [16]. The advance of the two latest results is that they do not require that the entries of $X$ are equally distributed (nor that they have equal variances).

Another upper bound for $\mathbb{E}\|X\|_{2,2}$ also does not require equal distributions but only the independence of entries: by [9] we know that

[TABLE]

This bound is dimension free, but in some cases is worse than the one from [16].

Upper bounds for the expectation of other operator norms were investigated in [2] in the case of independent centred entries bounded by $1$ . For $q\geq 2$ and $m\times n$ matrices the authors proved that $\mathbb{E}\|X\|_{2,q}\lesssim\max\{m^{1/q},\sqrt{n}\}$ . In [6] Guédon, Hinrichs, Litvak, and Prochno proved that for a structured Gaussian matrix $X=(a_{ij}X_{ij})_{i\leq m,j\leq n}$ and $p,q\geq 2$ ,

[TABLE]

This estimate is optimal up to logarithmic factors (see Remark 1.2 below). Note that in the case $(p,q)\neq(2,2)$ moment method fails in estimating $\mathbb{E}\|X\|_{p^{\prime},q}$ (as it gives information only on the spectrum of $X$ ).

All the mentioned results require the independence of entries of $X$ . In this article we will see how to generalise the main result of [6] to a wide class of random matrices with independent uncorrelated log-concave rows, following the scheme of proof of the original theorem from [6]. In order to obtain the key estimates for log-concave vectors needed in the proof we use the comparison of weak and strong moments of $\ell_{p}$ -norm of $X$ from [11] and a Sudakov minoration-type bound from [10].

Our estimate is optimal (for fixed $p,q\geq 2$ ) up to a factor depending logarithmically on the dimension. Let us stress that we do not require the rows of $X$ to have independent, but only uncorrelated coordinates (and to be log-concave) — we require the independence only between the rows.

Before we state our main results, let us say a few words about log-concave vectors. We say that a random vector $X$ in $\mathbb{R}^{n}$ is log-concave, if for any compact nonempty sets $K,L\subset\mathbb{R}^{n}$ and $\lambda\in[0,1]$ ,

[TABLE]

The class of log-concave vectors is closed under linear transformations, convolutions and weak limits. By the result of Borell [3] an $n$ -dimensional vector with a full dimensional support is log-concave if and only if it has a log-concave density, i.e. has a density of the form $e^{-h}$ , where $h$ is a convex function with values in $(-\infty,\infty]$ .

Log-concave vectors are a natural generalisation of vectors distributed uniformly over convex bodies. Moreover, distribution of any log-concave vector can be obtained as a weak limit of projections of uniform measures over (higher dimensional) convex bodies (see for example [1]). Other results and conjectures about log-concave vectors are discussed in monograph [4].

We say that a vector $X$ in $\mathbb{R}^{n}$ is isotropic if $\operatorname{Cov}X=\operatorname{Id}$ . If $X$ is a log-concave random vector in $\mathbb{R}^{n}$ with full dimensional support, then there exists a linear transformation $T$ such that $\operatorname{Cov}(TX)=\operatorname{Id}$ , so the isotropicity is only a matter of normalisation.

To make the notation more clear, if $A=(A_{ij})_{i\leq m,j\leq n}$ is an $m\times n$ matrix, we denote by $A_{i}\in\mathbb{R}^{n}$ its $i$ -th row and by $A^{(j)}\in\mathbb{R}^{m}$ we denote its $j$ -th column. We are ready now to present the main theorem.

Theorem 1.1.

Let $m\geq 2$ , let $Y_{1},\ldots,Y_{m}$ be i.i.d. isotropic log-concave vectors in $\mathbb{R}^{n}$ , and let $A=(A_{ij})$ be an $m\times n$ (deterministic) matrix. Consider a random matrix $X$ with entries $X_{ij}=A_{ij}Y_{ij}$ for $i\leq m,j\leq n$ , where $Y_{ij}$ is the $j$ -th coordinate of $Y_{i}$ . Then for every $p,q\geq 2$ we have

[TABLE]

where $C(p,q)$ depends only on $p$ and $q$ .

*Remark 1.2**.*

Note that the bound from Theorem 1.1 is optimal up to a constant depending on $p,q$ and logarithmically on the dimension. Indeed, since $Y_{ij}$ is log-concave we have by the regularity of $Y_{ij}$ (see (2.1) below) that $\mathbb{E}|Y_{ij}|\geq(2\Cr{seminorms})^{-1}\big{(}\mathbb{E}Y_{ij}^{2}\bigr{)}^{1/2}=(2\Cr{seminorms})^{-1}$ . Hence for every $j\leq n$ , (we take $u=e_{j}$ , use the unconditionality of $\|\cdot\|_{q}$ and the Jensen inequality)

[TABLE]

Since $\|X\|_{p^{\prime},q}=\|X^{T}\|_{q^{\prime},p}$ , we also have $\mathbb{E}\|X\|_{p^{\prime},q}\geq(2C_{1})^{-1}\|A_{i}\|_{p}$ for all $i\leq m$ . Moreover, for all $i\leq m$ and $j\leq n$ , (we take $v=e_{i}$ and $u=e_{j}\operatorname*{sgn}X_{ij}$ )

[TABLE]

Therefore

[TABLE]

what yields the claim.

The next corollary is a version of Theorem 1.1 in the spirit of the aforementioned results from [17, 14, 16]. It follows directly from (1.3), and the Jensen inequality.

Corollary 1.3.

Under the assumptions of Theorem 1.1 we have

[TABLE]

*Remark 1.4**.*

If the rows and columns of $Y$ are isotropic and log-concave (we do not require independence), and $p,q\geq 1$ , then

[TABLE]

what means that the bound we used in the proof of Corollary 1.3 (the one which uses the Jensen inequality) may be reversed (in the log-concave setting) up to a logarithmic factor and constants depending only on $p$ and $q$ . Therefore the estimates from Theorem 1.1 and Corollary 1.3 are equivalent up to a logarithmic factor. Inequality (1.2) follows directly from the following proposition.

Proposition 1.5.

Let $Y$ be an $m\times n$ random matrix, with isotropic and log-concave rows, let $B$ be a deterministic $m\times n$ matrix, and let $p\geq 1$ . Then

[TABLE]

It turns out that instead of assuming the log-concavity, we may assume the unconditionality, i.e. that an $m\times n$ random matrix we consider, treated as an $(mn)$ -dimensional vector, is unconditional (we no longer assume the independence of rows). Recall that we say that a random vector $Z$ in $\mathbb{R}^{d}$ is unconditional, if for every choice of signs $\eta\in\{-1,1\}^{d}$ the vectors $Z$ and $(\eta_{i}Z_{i})_{i\leq d}$ are equally distributed (or, equivalently, that $Z$ and $(\varepsilon_{i}Z_{i})_{i\leq d}$ are equally distributed, where $\varepsilon_{1},\ldots,\varepsilon_{d}$ are i.i.d. symmetric Bernoulli variables, independent of $Z$ ). The assertion of the next corollary is expressed in the spirit of Corollary 1.3, which is more natural in the non log-concave setting (without the assumption of log-concavity the assertions of Theorem 1.1 and Corollary 1.3 are no longer equivalent).

Corollary 1.6.

Assume that $X$ is a random matrix such that the $(mn)$ -dimensional vector $(X_{1,1},\ldots X_{1,n},X_{2,1},\ldots,X_{2,n},X_{m,1},\ldots,X_{mn})$ is unconditional. Then for every $p,q\geq 2$ we have

[TABLE]

where $C(p,q)$ depends only on $p$ and $q$ .

The rest of this note is organised as follows. Section 2 contains results from other articles, which will be used in a sequel. Section 3 contains generalisations of Lemmas 3.1 and 3.2 from [6] to the log-concave setting and the proof of Theorem 1.1. In Section 4 we will show how to deduce an analogue of Theorem 1.1 for Gaussian mixtures (see Corollary 4.2) and we will provide a proof of Proposition 1.5. Section 5 is devoted to the proof of Corollary 1.6.

Notation. By $C$ we denote universal constants. If a constant $C$ depends on a parameter $\alpha$ , we express it as $C(\alpha)$ . The value of $C,C(\alpha)$ may differ at each occurrence. Whenever we want to fix the value of an absolute constant we use letters $C_{1},C_{2},\ldots$ . We may always assume that $C,C_{i}\geq 1$ . For two quantities $a,b$ we write $a\lesssim b$ if there exists a constant $C$ , such that $a\leq Cb$ , and $a\sim b$ , if $a\lesssim b$ and $b\lesssim a$ . For two numbers $a$ and $b$ we write $a\vee b$ instead of $\max\{a,b\}$ .

For a random variable $X$ by $\|X\|_{p}$ we denote the $p$ -th integral norm of $X$ , i.e. the quantity $(\mathbb{E}|X|^{p})^{1/p}$ (in the case $X=\|Y\|$ we also call this quantity the $p$ -th strong moment of $Y$ associated with the norm $\|\cdot\|$ ). For a vector $x\in\mathbb{R}^{n}$ (in particular for a random vector $X$ ) and $r\geq 1$ , by $\|x\|_{r}$ we denote the $\ell_{r}$ -norm of $x$ , i.e. $\|x\|:=(\sum_{i=1}^{n}|x_{i}|^{r})^{1/r}$ . For $r=2$ we shall also write $|\cdot|$ instead of $\|\cdot\|_{2}$ . It will be always clear from the context, what $\|X\|_{q}$ means for a random object $X$ , so the double meaning of $\|\cdot\|_{q}$ will not lead to any misunderstanding. Recall that for an $m\times n$ matrix $A$ by $\|A\|_{p,q}$ we denote its norm from $\ell_{p}^{n}$ to $\ell_{q}^{m}$ . For $p\in[1,\infty]$ by $p^{\prime}$ we denote the Hölder conjugate of $p$ , i.e. $1=\frac{1}{p}+\frac{1}{p^{\prime}}$ .

2. Preliminaries

We will frequently use the regularity of $f(Z)$ for log-concave vectors $X$ and seminorms $f$ , i.e.

[TABLE]

(see [4, Theorem 2.4.6]).

We will also need the comparison of weak and strong moments for $\ell_{p}$ -norms of log-concave vectors:

Theorem 2.1 ([11, Theorem 5]).

Let $Z$ be a log-concave vector in $\mathbb{R}^{n}$ , and let $p\in[1,\infty)$ . Then

[TABLE]

where

[TABLE]

is the $q$ -th weak moment of $X$ associated with the $\ell_{p}$ -norm.

We will use the previous theorem also in the tail-bound version:

Corollary 2.2.

Assume $Z$ is a log-concave vector in $\mathbb{R}^{n}$ , and $p\in[1,\infty)$ . Then

[TABLE]

For the Reader’s convenience we give a proof of this corollary, which goes along the lines of the proof of Corollary 1.3 in [12].

Proof.

Define a random variable $S:=\|Z\|_{p}$ . By the Paley–Zygmund inequality and (2.1) we have for $t\in\mathbb{R}^{n}$ , and $q\geq 1$ ,

[TABLE]

In order to show (2.2) we consider 3 cases.

Case 1. $2u<\sup_{t\in B_{p^{\prime}}^{n}}\|\sum_{i=1}^{n}t_{i}Z_{i}\|_{2}$ . Then by (2.3)

[TABLE]

and (2.2) obviously holds if $\Cr{d2}\geq\exp(2\Cr{c4})$ .

Case 2. $\sup_{t\in B_{p^{\prime}}^{n}}\|\sum_{i=1}^{n}t_{i}Z_{i}\|_{2}\leq 2u<\sup_{t\in B_{p^{\prime}}^{n}}\|\sum_{i=1}^{n}t_{i}Z_{i}\|_{\infty}$ . Let us then define

[TABLE]

By (2.3) we have

[TABLE]

By (2.1), Theorem 2.1, and Chebyshev’s inequality we have

[TABLE]

for $\Cr{c5}$ large enough. Thus (2.2) holds in this case.

Case 3. $u>\sup_{t\in B_{p^{\prime}}^{n}}\|\sum_{i=1}^{n}t_{i}Z_{i}\|_{\infty}=\|S\|_{\infty}$ . Then $\mathbb{P}(S\geq u)=0$ and (2.2) holds for any $\Cr{d1}\geq 1$ . ∎

In the proof of Theorem 1.1 we will use Theorem 2.1 from [6], which is another version of results provided before by Guédon–Rudelson in [8], and by Guédon–Mendelson–Pajor–Tomczak-Jaegerman in [7]:

Theorem 2.3 ([6, Theorem 2.1]).

Let $E$ be a Banach space with modulus of convexity of power type $2$ with constant $\lambda$ . Let $X_{1},\ldots X_{m}\in E^{*}$ be independent random vectors, and let $q\geq 2$ . Define

[TABLE]

and

[TABLE]

where $T_{2}(E^{*})$ is the Rademacher type $2$ constant of $E^{*}$ . Then

[TABLE]

We will use Theorem 2.3 with $E=\ell_{p^{\prime}}^{n}$ . In this case $\lambda$ and $T_{2}(E^{*})$ are known.

3. Proof of Theorem 1.1

The next two lemmas provide estimates of the quantities $u$ and $v$ appearing in Theorem 2.3 in the case $E=B_{p^{\prime}}^{n}$ .

Lemma 3.1.

Assume $p,q,X$ , and $Y$ are as in Theorem 1.1. Then

[TABLE]

where $C(p,q)$ depends only on $p$ and $q$ .

Lemma 3.2.

Assume $p,q,X$ , and $Y$ are as in Theorem 1.1. Then

[TABLE]

In the proof of Lemma 3.1 we will also need the following estimate:

Lemma 3.3.

Assume that $Z$ is an isotropic log-concave vector in $\mathbb{R}^{m}$ . Then for all $1\leq k\leq m$ and all $a\in\mathbb{R}^{m}$ we have

[TABLE]

where $(a_{i}^{*})_{i=1}^{m}$ denotes the non-increasing rearrangement of $(|a_{i}|)_{i=1}^{m}$ .

In order to prove Theorem 1.1, we repeat the proof scheme from [6].

Proof of Theorem 1.1.

We use Theorem 2.3 for $E=\ell_{p^{\prime}}^{n}$ . Then $\lambda\sim p$ (see [15, Theorem 5.3]) and $T_{2}(E^{*})\sim\sqrt{p}$ . Let $u$ and $v$ be given by formulas (2.4) and (2.5). The triangle inequality, Theorem 2.3, Lemma 3.1, and Lemma 3.2 yield

[TABLE]

∎

The main contribution of this article lies in the proofs of Lemmas 3.1, 3.2, and 3.3.

Proof of Lemma 3.3.

We may and do assume that $a_{1}\geq a_{2}\geq\ldots\geq a_{m}\geq 0$ , i.e. $a_{i}^{*}=a_{i}$ for $i\leq m$ . By [10, Proposition 3.3] we have for all $k\leq m$ ,

[TABLE]

Thus

[TABLE]

Proof of Lemma 3.1.

We may and do assume that $m\geq 2$ .

Since we may approximate $A_{ij}$ by nonzero numbers, we may and do assume that $a_{ij}\neq 0$ for all $i,j$ . Let $\Cr{d1},\Cr{d2}$ be the constants from (2.2), let $\Cr{d3}$ be the constant from Lemma 3.3, and recall that $\Cr{seminorms}$ is the constant from (2.1). We may assume that all these constants are greater than $1$ .

Note that for any $a,b\in\mathbb{R}$ we have $a=(a-b)_{+}+a\wedge b$ . Thus, by the triangle inequality,

[TABLE]

Moreover, for every $1\leq i\leq m$ we have by (2.1) and the isotropicity of $Y_{i}$ , that

[TABLE]

Now we pass to the estimation of the fist term of (3.2). Let

[TABLE]

By (2.2) we have

[TABLE]

For $u\geq\sup_{\|t\|_{p^{\prime}}\leq 1}\|\sum_{j=1}^{n}t_{j}X_{ij}\|_{\infty}$ the function we integrate vanishes, so from now on we will consider only $i$ ’s for which $u<\sup_{\|t\|_{p^{\prime}}\leq 1}\|\sum_{j=1}^{n}t_{j}X_{ij}\|_{\infty}$ .

Note that if $1\leq i\leq m$ and $\sup_{\|t\|_{p^{\prime}}\leq 1}\|\sum_{j=1}^{n}t_{j}X_{ij}\|_{\infty}>u\geq e\sigma\geq e\sigma_{p,X_{i}}(2)$ , then

[TABLE]

and $\sigma_{{}_{p},X_{i}}(r)=u/e$ . Therefore

[TABLE]

Now we will estimate $r$ from below. For $t\geq 2$ let

[TABLE]

Since $Y_{i}^{\prime}s$ are identically distributed, $\varphi$ does not depend on $i$ . By (2.1), and the isotropicity of $Y$ we have

[TABLE]

Since we can permute the rows of $A$ , we may and do assume that

[TABLE]

Let $j(i)\leq n$ be such an index that $|A_{ij(i)}|=\max_{1\leq j\leq n}|A_{ij}|$ . Lemma 3.3 applied to $Z_{i}=Y_{ij(i)}$ and the non-increasing sequence $a_{i}=|A_{ij(i)}|$ implies

[TABLE]

so for all $i\leq m$ we have

[TABLE]

Note that by (2.1) for all $r\geq\lambda\geq 2$ we have $\sigma_{p,X_{i}}(r/\lambda)\geq\sigma_{p,X_{i}}(r)/(\Cr{seminorms}\lambda)$ . Take $\lambda=\sigma_{p,X_{i}}(r)/B=u/(Be)\geq 2$ . Then by a calculation similar to the one above we get

[TABLE]

so indeed $r\geq\lambda\geq 2$ .

Therefore for all $i\leq m$ we have

[TABLE]

Since the function $\varphi$ is strictly increasing, the previous inequality yields $r\geq\lambda\log(i+1)$ . This together with (3.5) implies that (recall that $\lambda=\frac{u}{Be}\geq 2$ )

[TABLE]

Inequalities (3), (3.8), and the Stirling formula yield that

[TABLE]

Moreover, by (2.1)

[TABLE]

where the second inequality holds since the weak first moment is bounded above by the strong first moment. This together with (3.2), (3), and (3.9) gives the assertion. ∎

Proof of Lemma 3.2.

Note that if $0\leq r\leq s$ , then for every $x\in\mathbb{R}^{n}$ we have $\|x\|_{s}\leq\|x\|_{r}$ , so we may and do assume $p=2$ . By (2.1), the isotropicity of $Y$ , and the Jensen inequality we have

[TABLE]

*Remark 3.4**.*

By the same reasoning as in the log-concave case, we may prove (using [12, Corollary 1.3], [13, Theorem 2.1], and the claim below instead of (2.2), Lemma 3.3 and the previous estimates on $\sigma_{p,X_{i}}(s)$ , respectively) the following.

Let $X$ be an $m\times n$ random matrix with entries $X_{ij}=A_{ij}Y_{ij}$ , where $Y_{ij}$ are independent symmetric random variables such that $\mathbb{E}Y_{ij}^{2}=1$ . Assume that for any $r\geq 2$ and any $1\leq i\leq m$ , $1\leq j\leq n$ we have $\frac{r^{\beta}}{L}\leq\|Y_{ij}\|_{r}\leq Lr^{\beta}$ with $\beta\in[\frac{1}{2},1]$ . Then for every $p,q\geq 2$ we have

[TABLE]

where $C(p,q,L)$ depends only on $p$ , $q$ , and $L$ . At the end of Section 4 we provide another result concerning this type of random matrices (see Corollary 4.5).

As we mentioned, it suffices to prove the claim:

[TABLE]

where $C$ is an absolute constant, and repeat the proof of Theorem 1.1.

Proof of the claim.

It suffices to consider $r=2k$ , where $k$ is an integer. Let us denote

[TABLE]

Let $G=(G_{j})_{j=1}^{n}$ be the standard $n$ -dimensional Gaussian vector. Recall that for any $t\in\mathbb{R}^{n}$ and $r\geq 1$ we have $\|\sum_{j=1}^{n}t_{j}G_{j}\|_{r}=\|t\|_{2}\|G_{1}\|_{r}\sim\|t\|_{2}\sqrt{r}=\sqrt{r}\|\sum_{j=1}^{n}t_{j}Y_{ij}\|_{2}$ .

By the assumptions on $Y_{i}$ and by the fact that $\beta\geq\frac{1}{2}$ we get

[TABLE]

what finishes the proof of (3.10). ∎

By the claim we get

[TABLE]

what allows us to obtain a version of (3) for $\varphi(t):=\min_{\begin{subarray}{c}1\leq i\leq m,\\ 1\leq j\leq n\end{subarray}}\|Y_{ij}\|_{t}.$

4. Estimates of norms of matrices in the case of Gaussian mixtures

Let us recall the definition from [5], where the significance of Gaussian mixtures is also described.

*Definition 4.1**.*

A random variable $X$ is called a (centred) Gaussian mixture if there exists a positive random variable $r$ and a standard Gaussian random variable $g$ , independent of $r$ , such that $X$ has the same distribution as $rg$ .

We will work with matrices of the form $(R_{ij}B_{ij}G_{ij})_{i\leq m,j\leq n}$ which entries are Gaussian mixtures. We additionally assume that $R_{ij}=|Z_{ij}|^{\gamma}$ , where $\gamma\geq 0$ , and that the matrix $Z$ is log-concave and isotropic (considered as a random vector in $\mathbb{R}^{mn}$ ). It will be clear from the proof, that the corollary below is true also for another type of matrices: $(R_{i}B_{ij}G_{ij})_{i\leq m,j\leq n}$ , where $R_{i}=|Z_{i}|^{\gamma}$ , and $(Z_{1},\ldots,Z_{m})$ is an arbitrary isotropic log-concave random vector.

Corollary 4.2.

Let $m,n\geq 2$ , let $\gamma\geq 0$ , let $B=(B_{ij})$ be a deterministic $m\times n$ matrix, and let $G=(G_{ij})_{i\leq m,j\leq n}$ be a random matrix which entries are i.i.d. standard Gaussian variables. Let $X_{ij}=|Z_{ij}|^{\gamma}B_{ij}G_{ij}$ , where $Z=(Z_{ij})_{i\leq m,j\leq n}$ is a log-concave and isotropic random matrix independent of $G$ . Then for every $p,q\geq 2\vee\frac{1}{\gamma}$ we have

[TABLE]

Proof.

Theorem 1.1 applied to $Y=G$ and $A_{ij}=|Z_{ij}|^{\gamma}B_{ij}$ yields

[TABLE]

so it suffices to prove that

[TABLE]

and

[TABLE]

for $p\geq 1\vee\frac{1}{\gamma}$ . By the symmetry of assumptions we need only to show (4.1).

If $\gamma<1$ , then

[TABLE]

and

[TABLE]

so it suffices to consider only $\gamma\geq 1$ (we used here the assumption that $p\geq\frac{1}{\gamma}$ ).

Note that for any $u\geq 1$ we have

[TABLE]

Fix $i\leq m$ . By Theorem 2.1 applied to $p=p\gamma$ , $q=u\gamma$ (recall that $\gamma\geq 1$ , so $u\gamma,p\gamma\geq 1$ ), and $Z_{j}=|B_{ij}|^{1/\gamma}Z_{ij}$ we have

[TABLE]

Let us use (2.1) and the assumption $\mathbb{E}Z_{ij}^{2}=1$ to estimate the first term in (4):

[TABLE]

Recall that $B_{p^{\prime}}^{n}\subset B_{2}^{n}$ . We use again (2.1) and the isotropicity of $Z_{i}$ to estimate the second term in (4):

[TABLE]

Take $u=\log m$ and put together (4), (4), and (4.4) to get the assertion. ∎

*Remark 4.3**.*

Using [6, Theorem 1.1] instead of Theorem 1.1 in the proof above yields a slightly better estimate:

[TABLE]

*Remark 4.4**.*

It is clear from the proof of Corollary 4.2 that in the case $Z_{ij}=G_{ij}^{\prime}$ , where $G_{ij}^{\prime}$ are i.i.d. standard Gaussian variables, inequality (4.1) may be slightly improved:

[TABLE]

In order to obtain this improvement one should use $\|\langle t,G_{i}\rangle\|_{u\gamma}\lesssim\sqrt{u\gamma}\|\langle t,G_{i}\rangle\|_{2}$ instead of $\|\langle t,Z_{i}\rangle\|_{u\gamma}\lesssim u\gamma\|\langle t,Z_{i}\rangle\|_{2}$ . Therefore, if we additionally use Remark 4.3, the assertion of Corollary 4.2 in the case $Z_{ij}=G_{ij}^{\prime}$ (where $G^{\prime}$ is independent of $G$ ) will state that

[TABLE]

Proof of Proposition 1.5.

We begin similarly as in the proof of (4.1) (in the case $\gamma=1$ ), but we estimate the second term on the right-hand side of (4) in a slightly different way, using (2.1):

[TABLE]

We take $u=\log(m\vee n)$ to get the assertion. ∎

We may use the result concerning Gaussian mixtures to obtain the estimate similar to the one from Remark 3.4, valid for all $\beta\geq\frac{1}{2}$ (not only for $\beta\in[\frac{1}{2},1]$ ), but with a slightly worse constants than in Remark 3.4. The proof is based on the fact, that variables $Y_{ij}$ satisfying the moment assumption from Remark 3.4 are comparable with a certain Gaussian mixtures.

Corollary 4.5.

Let $m,n\geq 2$ , $\gamma\geq\frac{1}{2}$ , and let $X$ be an $m\times n$ random matrix with entries $X_{ij}=A_{ij}Y_{ij}$ , where $Y_{ij}$ are independent symmetric random variables such that $\mathbb{E}Y_{ij}^{2}=1$ . Assume that for any $r\geq 2$ and any $1\leq i\leq m$ , $1\leq j\leq n$ we have $\frac{r^{\beta}}{L}\leq\|Y_{ij}\|_{r}\leq Lr^{\beta}$ . Then for all $p,q\geq 2$ ,

[TABLE]

Proof.

Let $G_{ij},G_{ij}^{\prime}$ , $i\leq m$ , $j\leq n$ , be i.i.d. standard Gaussian variables. Let $(\varepsilon_{ij})$ be i.i.d. symmetric Bernoulli random variable, independent of $G$ and $G^{\prime}$ . Note that $Y_{ij}^{\prime}:=|G_{ij}|^{2\beta}\varepsilon_{ij}$ satisfies $\frac{r^{\beta}}{L^{\prime}}\leq\|Y_{ij}^{\prime}\|_{r}\leq L^{\prime}r^{\beta}$ for all $r\geq 2$ , with a universal constant $L^{\prime}$ , since $\|G_{ij}\|_{s}\sim\sqrt{s}$ for $s\geq 1$ . Let $X^{\prime}=(X_{ij})$ be the $m\times n$ random matrix with entries $X_{ij}^{\prime}=A_{ij}Y_{ij}^{\prime}$ . By [14, Lemma 4.7] we know that

[TABLE]

for any norm $\vvvert\cdot\vvvert$ on $m\times n$ real matrices. In particular

[TABLE]

Moreover, by the Jensen inequality and by (4.7) applied with $\gamma=2\beta$ we have

[TABLE]

what yields the assertion, since $\mathbb{E}\max_{\begin{subarray}{c}1\leq i\leq m\\ 1\leq j\leq n\end{subarray}}|G_{ij}|\sim\sqrt{\log(mn)}$ . ∎

5. The case of unconditional entries

Proof of Corollary 1.6.

Since $X$ is unconditional, it has the same distribution as the matrix $(\varepsilon_{ij}X_{ij})_{i\leq m,j\leq n}$ , where $\varepsilon_{ij}$ are i.i.d. symmetric Bernoulli variables independent of $X$ . Let $G_{ij}$ be i.i.d. standard Gaussian variables independent of $X$ and $(\varepsilon_{ij})_{i\leq m,j\leq n}$ . Then

[TABLE]

where in the last step we used Corollary 1.3 to estimate the mean with respect to $G$ . We use (4.6) with $\gamma=1$ (to $\mathbb{E}_{G}$ in each term above separately) to get the assertion. ∎

*Remark 5.1**.*

Using [6, Theorem 1.1] instead of Theorem 1.1 in the proof above yields a slightly better estimate in Theorem 1.6:

[TABLE]

6. Acknowledgements

I would like to thank Rafał Latała for suggestions which helped me to make the presentation clearer and more reader-friendly.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Artstein-Avidan, B. Klartag, and V. Milman, The Santaló point of a function, and a functional form of the Santaló inequality , Mathematika 51 (2004), no. 1-2, 33–48 (2005). MR 2220210
2[2] G. Bennett, V. Goodman, and C. M. Newman, Norms of random matrices , Pacific J. Math. 59 (1975), no. 2, 359–365. MR 0393085
3[3] C. Borell, Convex measures on locally convex spaces , Ark. Mat. 12 (1974), 239–252. MR 0388475
4[4] S. Brazitikos, A. Giannopoulos, P. Valettas, and B.H. Vritsiou, Geometry of isotropic convex bodies , Mathematical Surveys and Monographs, vol. 196, American Mathematical Society, Providence, RI, 2014. MR 3185453
5[5] A. Eskenazis, P. Nayar, and T. Tkocz, Gaussian mixtures: entropy and geometric inequalities , Ann. Probab. 46 (2018), no. 5, 2908–2945. MR 3846841
6[6] O. Guédon, A. Hinrichs, A.E. Litvak, and J. Prochno, On the expectation of operator norms of random matrices , Geometric aspects of functional analysis, Lecture Notes in Math., vol. 2169, Springer, Cham, 2017, pp. 151–162. MR 3645120
7[7] O. Guédon, S. Mendelson, A. Pajor, and N. Tomczak-Jaegermann, Majorizing measures and proportional subsets of bounded orthonormal systems , Rev. Mat. Iberoam. 24 (2008), no. 3, 1075–1095. MR 2490210
8[8] O. Guédon and M. Rudelson, L p subscript 𝐿 𝑝 L_{p} -moments of random vectors via majorizing measures , Adv. Math. 208 (2007), no. 2, 798–823. MR 2304336

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Estimates of norms of log-concave random matrices with dependent entries

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction and main result

Theorem 1.1**.**

Remark 1.2*.*

Corollary 1.3**.**

Remark 1.4*.*

Proposition 1.5**.**

Corollary 1.6**.**

2. Preliminaries

Theorem 2.1** ([11, Theorem 5]).**

Corollary 2.2**.**

Proof.

Theorem 2.3** ([6, Theorem 2.1]).**

3. Proof of Theorem 1.1

Lemma 3.1**.**

Lemma 3.2**.**

Lemma 3.3**.**

Proof of Theorem 1.1.

Proof of Lemma 3.3.

Proof of Lemma 3.1.

Proof of Lemma 3.2.

Remark 3.4*.*

Proof of the claim.

4. Estimates of norms of matrices in the case of Gaussian mixtures

Definition 4.1*.*

Corollary 4.2**.**

Proof.

Remark 4.3*.*

Remark 4.4*.*

Proof of Proposition 1.5.

Corollary 4.5**.**

Proof.

5. The case of unconditional entries

Proof of Corollary 1.6.

Remark 5.1*.*

6. Acknowledgements

Theorem 1.1.

*Remark 1.2**.*

Corollary 1.3.

*Remark 1.4**.*

Proposition 1.5.

Corollary 1.6.

Theorem 2.1 ([11, Theorem 5]).

Corollary 2.2.

Theorem 2.3 ([6, Theorem 2.1]).

Lemma 3.1.

Lemma 3.2.

Lemma 3.3.

*Remark 3.4**.*

*Definition 4.1**.*

Corollary 4.2.

*Remark 4.3**.*

*Remark 4.4**.*

Corollary 4.5.

*Remark 5.1**.*