Superconcentration, and randomized Dvoretzky's theorem for spaces with 1-unconditional bases
Konstantin Tikhomirov

TL;DR
This paper demonstrates that for convex bodies with a 1-unconditional basis, random sections are nearly Euclidean with high probability, revealing a new superconcentration phenomenon linked to the randomized Dvoretzky theorem.
Contribution
It introduces a novel superconcentration property for convex bodies with 1-unconditional bases, improving understanding of random sections in high-dimensional geometry.
Findings
Random sections of such bodies are approximately Euclidean with high probability.
Superconcentration of the norm's p-th power is established under certain smoothness and gradient conditions.
The results show better dependence on epsilon than in classical positions, with implications for high-dimensional convex geometry.
Abstract
Let be a sufficiently large natural number and let be an origin-symmetric convex body in in the -position, and such that the normed space admits a -unconditional basis. Then for any , and for random -dimensional subspace distributed according to the rotation-invariant (Haar) measure, the section is -Euclidean with probability close to one. This shows that the "worst-case" dependence on in the randomized Dvoretzky theorem in the -position is significantly better than in John's position. It is a previously unexplored feature, which has strong connections with the concept of superconcentration introduced by S. Chatterjee. In fact, our main result follows from the next theorem: Let be as before and assume additionally that …
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Superconcentration, and randomized Dvoretzky’s theorem for spaces with -unconditional bases
Konstantin Tikhomirov111Department of Mathematics, Princeton University; email: [email protected]
Abstract
Let be a sufficiently large natural number and let be an origin-symmetric convex body in in the -position, and such that the space admits a -unconditional basis. Then for any , and for random –dimensional subspace distributed according to the rotation-invariant (Haar) measure, the section is –Euclidean with probability close to one. This shows that the “worst-case” dependence on in the randomized Dvoretzky theorem in the -position is significantly better than in John’s position. It is a previously unexplored feature, which has strong connections with the concept of superconcentration introduced by S. Chatterjee. In fact, our main result follows from the next theorem: Let be as before and assume additionally that has a smooth boundary and {\mathbb{E}}_{\gamma_{n}}\|\cdot\|_{B}\leq n^{c}\,{\mathbb{E}}_{\gamma_{n}}\big{\|}{\rm grad}_{B}(\cdot)\big{\|}_{2} for a small universal constant , where is the gradient of and is the standard Gaussian measure in . Then for any the -th power of the norm is –superconcentrated in the Gauss space.
Keywords: Dvoretzky’s theorem, almost Euclidean sections, superconcentration, -position
**MSC 2010: 46B06, 46B09, 52A21 **
1 Introduction
The term superconcentration was introduced by S. Chatterjee to describe a situation when the size of typical fluctuations of a function on a probability space is much smaller than the bound provided by “classical” concentration inequalities [4]. In this note, we are concerned with applications of the superconcentration phenomenon in asymptotic geometric analysis; specifically, in the problem of finding large -Euclidean sections of convex bodies. On the probabilistic level, we derive a concentration inequality for convex positively homogeneous functions in the Gauss space satisfying some additional assumptions. On the geometric level, we show that John’s position may be a “bad” choice as far as dependence of dimension on is concerned in the randomized Dvoretzky’s theorem, and, at least for unit balls of normed spaces with a -unconditional basis, the -position allows a substantially better bound on the dimension.
The theorem of A. Dvoretzky [8] asserts that for arbitrary fixed and , every symmetric convex body of a large enough dimension contains a -Euclidean -dimensional section. A proof of the theorem based on the concentration of measure was proposed by V. Milman [18]. In view of results of Y. Gordon [11] and G. Schechtman [24], who improved dependence of the dimension on , the theorem of Milman reads: If is an origin-symmetric convex body in with the Minkowski functional and
[TABLE]
(where is the standard Gaussian vector in and is the Lipschitz constant of ), then for any and any natural the random -dimensional subspace uniformly distributed according to the rotation-invariant measure, cuts a -Euclidean section with probability close to one. The quantity is often called the critical, or Dvoretzky’s, dimension. The last statement asserts that most sections of (with respect to the rotation-invariant probability measure) of the given dimension are -Euclidean; in our note this version of Dvoretzky’s theorem is called “randomized” (as opposed to “existential”). Let us note that Dvoretzky’s theorem as well as numerous questions around it are covered in several monographs and surveys; see, in particular, [19, 23, 25, 2].
The Dvoretzky–Rogers lemma implies that for any convex body in John’s position (i.e. such that the ellipsoid of maximal volume contained inside is the unit Euclidean ball) one has , whence for a universal constant . This yields
Theorem 1** **(Randomized Dvoretzky’s theorem in John’s position,
Let be an origin-symmetric convex body in in John’s position, and let and . Then, for random -dimensional subspace uniformly distributed according to the rotation-invariant (Haar) measure, one has
[TABLE]
Here, is a universal constant.
In fact, in the above theorem “-Euclidean” can be replaced with a stronger notion of -spherical which we define as a section such that , where is some positive real number and is the standard Euclidean ball in .
It is not difficult to show that dependence on in the above theorem cannot be improved. The following statement can be verified by elementary geometric arguments combined with basic probability (for completeness, we give a proof in Section 6):
Proposition 2** (Optimality of randomized Dvoretzky’s theorem in John’s position).**
There are universal constants and with the following property: For any there is an origin-symmetric convex body in John’s position (and, moreover, the standard basis in is -unconditional with respect to the norm ) such that for all and , the random -dimensional subspace uniformly distributed according to the rotation-invariant (Haar) measure, satisfies
[TABLE]
Although Theorem 1 is sharp, it raises the question whether John’s position is a good choice for generating random almost Euclidean sections, or there is another canonical position which yields a better dependence on . As an example, let us note that in a recent paper [21] dealing with Dvoretzky’s theorem for subspaces of , G. Paouris and P. Valettas used a position of the unit ball other than John’s.
For an origin-symmetric convex body in and a linear operator define
[TABLE]
where is the Minkowski functional of and is the standard Gaussian vector in . We say that is in the -position if and
[TABLE]
It is not difficult to check that for any body and a linear operator with and such that
[TABLE]
the image is in the -position. The importance of the -position in asymptotic geometric analysis was revealed by T. Figiel and N. Tomczak-Jaegermann in [9] (see also [30, § 12], [23, Chapter 3], [2, Chapter 6]).
The main result of our note is the following theorem:
Theorem 3** **(Randomized Dvoretzky’s theorem in the -position).
Let be an origin-symmetric convex body in in the -position, and such that the space has a -unconditional basis. Further, let and . Then for random -dimensional subspace uniformly distributed according to the rotation-invariant (Haar) measure, one has
[TABLE]
with the notion “–spherical” defined above. Here, is a universal constant.
A version of the above statement was known in the particular case [26]; see also [22] where the randomized Dvoretzky’s theorem for –balls is studied. The dependence on in Theorem 3 is sharp in a sense that the majority of –dimensional sections of the standard cube are not –spherical [29]. Let us emphasize once more that the above statement does not hold in general if the -position is replaced with John’s. We conjecture that the assertion of Theorem 3 is true without the assumption that the space admits a -unconditional basis.
In [27] G. Schechtman proved, by combining random and deterministic arguments with a result of N. Alon and V. Milman [1], that any origin-symmetric convex body contains a –dimensional -Euclidean section. However, in contrast with Theorem 3, the result of [27] is existential in a sense that it does not provide a canonical position for a convex body in which most of its –dimensional sections are -Euclidean.
Our proof of Theorem 3 is based on the following dichotomy: Given a convex body satisfying the assumptions of the theorem, either the expectation of the length of the gradient of the norm is very small compared to (in which case a simple analysis shows that the assertion of the theorem is true) or the gradient is relatively “large” in which case we involve the superconcentration. Of course, only the second case is of interest.
Let be a natural number and let be the standard Gaussian measure in . For any sufficiently smooth real-valued function in one has
[TABLE]
where is the gradient of (the Poincaré inequality). A function in is called -superconcentrated (for some ) if
[TABLE]
The setting of actual interest involves a sequence of functions (indexed by the dimension ) such that tends to zero with . We refer to [4] for definition of superconcentration in a more general context, its relation to other properties (called “chaos” and “multiple valleys”), as well as for results dealing with specific probabilistic models. Theorem 3 of this note follows from the next result.
Theorem 4**.**
There are universal constants and with the following property. Let and let be an origin-symmetric convex body in in the -position, with a smooth boundary, and such that the space admits a -unconditional basis. Further, assume that , where denotes the standard Gaussian vector in and is the gradient of the norm . Then for any the function is –superconcentrated in the Gauss space.
The main tool in the proof of Theorem 4 is Talagrand’s bound (see Theorem 6), which we combine with some special properties of the -position (“balancing conditions”). The proof is not difficult and admits various generalizations in a sense that the -position can be replaced with other transformations of the convex body that provide appropriate “balancing” in regard to the Gaussian measure (we’ll return to this issue at the end of the paper).
2 Notation and preliminaries
Let be a natural number. The canonical basis in will be denoted by and the standard inner product — by . Given a set of vectors in , we denote their linear span by . For a subspace , is its orthogonal complement in and is the orthogonal projection operator onto . Given a boolean variable , denote by the indicator function of , so that if and only if is true. Similarly, for an event denote by the indicator function of the event.
A convex body in is any compact convex set with non-empty interior. Everywhere in this note, we say that the boundary of a convex body is smooth if every point of admits a unique tangent hyperplane. Given an origin-symmetric convex body , denote by its Minkowski functional. By some abuse of notation, for any subspace we denote by the Minkowski functional of considered as a convex body inside . Further, given a -dimensional subspace , we say that the section is –Euclidean (for some ) if the Banach–Mazur distance from to a -dimensional Euclidean ball is bounded from above by .
A basis of an -dimensional normed space with a norm is -unconditional if for any scalars and any signs . The canonical basis of is -unconditional with respect to a norm if and only if the unit ball of is symmetric with respect to coordinate hyperplanes.
Given a real-valued function in , by we denote its Lipschitz constant.
The set of all -dimensional subspaces of admits a unique normalized rotation-invariant Borel measure (the Haar measure). Whenever we speak about a random subspace in this note, we assume it is distributed according to that measure. The standard Gaussian measure in is denoted by , the standard Gaussian vector — by , and the standard real-valued Gaussian variable — by (or ’s when there are several of them).
Universal constants will be denoted by , etc. and their value may be different on different ocasions.
2.1 Gaussian concentration inequalities
The next theorem (with a worse constant in the exponent) is due to G. Pisier.
Theorem 5** (see, in particular, [23, Chapter 4], [15, p. 12] or [14, Chapter 2]).**
Let be a standard Gaussian vector in and be a -Lipschitz function. Then
[TABLE]
The next statement plays a crucial role in our analysis (a version of this inequality for the uniform measure on discrete cube was proved by M. Talagrand in [28]):
Theorem 6** (Talagrand’s bound; see, in particular, [6], [4, Chapter 5]).**
Let be an absolutely continuous function in and let () denote -th component of the gradient of . Then
[TABLE]
where is a universal constant.
2.2 Canonical positions of convex bodies
Given an origin-symmetric convex body , its position is any convex body for some invertible linear transformation . The two canonical positions we consider in this note are John’s and the -position, which were defined in the introduction. Recall that a position is John’s if the ellipsoid of maximal volume contained in is the unit Euclidean ball. The concept was used by F. John [12] to estimate the Banach–Mazur distance of arbitrary convex body to the Euclidean ball (see also [3]).
A viewpoint to canonical positions involving arbitrary norms on the space of linear operators was developed by D.R. Lewis [16] (see also [30, Chapter 3], [23, Chapter 3], as well as an alternative approach of A. Giannopoulos and V. Milman [10] based on isotropic measures on the Euclidean sphere). In particular, the -position can be defined as a linear transformation satisfying
[TABLE]
where is the norm on the space of linear operators in defined in the introduction, and is the norm in trace duality with [16, 30, 23, 2]. It is easy to see that the -position is rotation-invariant in a sense that, together with , any linear image of the form () is in the -position. At the same time, the -position is unique up to an orthogonal transformation [30, Proposition 14.3]. In fact, the following stability result is true:
Lemma 7** (Stability of the -position).**
Let and let be an origin-symmetric convex body in in the -position. Then for any there is depending only on with the following property: whenever is an invertible linear operator in with and , there exists an orthogonal transformation of such that .
Proof.
The proof to a large extent follows the argument in [30, Proposition 14.3]. Fix a small and define . Now, let be an operator satisfying the assumptions of the lemma. Choose an orthogonal operator so that , with being positive definite (the polar decomposition of ). Then and . It remains to show that . We shall prove this by contradiction. Assume that either or . Define an operator via its inverse: W^{-1}:=\frac{1}{2}\big{(}P^{-1}+{\rm Id}_{n}\big{)}. Clearly, if are eigenvalues of then
[TABLE]
Obviously, for all and, additionally, as at least one of the eigenvalues satisfies , we have
[TABLE]
Thus,
[TABLE]
whence . Next, observe that for any vector we have
[TABLE]
Thus, while . This contradicts the assumption that is in the -position. ∎
As a simple corollary, we obtain
Corollary 8**.**
Let and let be an origin-symmetric convex body in the -position. Then for any there is an origin-symmetric convex body with a smooth boundary, in the -position, and such that . Moreover, if the norm admits a -unconditional basis then can be defined so that is -unconditional as well.
Proof.
Fix the convex body and a small positive . Define a positive number , where the function is taken from Lemma 7. First, one can construct a smooth approximation of satisfying the relations
[TABLE]
and such that is -unconditional whenever is (see, for example, [13]). The inclusion relations imply that , whence, applying an appropriate dilation, we get a smooth convex body satisfying
[TABLE]
and such that . Now, let be an invertible linear transformation so that is in the -position. Obviously, , and, as is in the -position, we have . Thus, , i.e. . By Lemma 7 (applied to and operator ), there is an orthogonal transformation such that
[TABLE]
whence
[TABLE]
Together with (1), this implies that is the smooth convex body in the -position satisfying the required conditions. ∎
Remark 1*.*
Corollary 8 will allow us to reduce the proof of Theorem 3 to the case when the underlying convex body is smooth.
The next statement is intuitively obvious; we give its proof for completeness.
Lemma 9**.**
Let be an origin-symmetric convex body in in the -position, and assume that the normed space admits a -unconditional basis. Then the basis is orthogonal with respect to the canonical inner product in .
Proof.
Let be a -unconditional basis in , and suppose that it is not orthogonal. Without loss of generality, we can assume that and are not orthogonal. Let be the linear transformation of given by its action on the basis vectors: for all , and , where is the orthogonal projection onto . It is easy to see that the transformation is volume-preserving. Further, define a convex body via its Minkowski functional:
[TABLE]
Thus, is a (non-orthogonal) linear transformation of and . We will show that which, in view of the uniqueness of the -position mentioned above (see [30, Proposition 14.3] or the last corollary) leads to contradiction.
Let be the standard –dimensional Gaussian vector in and let be the standard Gaussian variable independent from . We consider three random variables on the probability space given by
[TABLE]
Obviously, \ell(B,{\rm Id}_{n})=\big{(}{\mathbb{E}}\,\eta_{1}^{2}\big{)}^{1/2}=\big{(}{\mathbb{E}}\,\eta_{2}^{2}\big{)}^{1/2}, and \ell(B^{\prime},{\rm Id}_{n})=\big{(}{\mathbb{E}}\,\xi^{2}\big{)}^{1/2}. At the same time, using -unconditionality of the basis with respect to , we obtain
[TABLE]
Thus, by the triangle inequality we get
[TABLE]
whence
[TABLE]
This implies that must also be in the -position contradicting the fact that the position is unique up to an orthogonal transformation. ∎
2.3 The gradient
Let be an origin-symmetric convex body in with a smooth boundary. For any point , the gradient of the function at point is well defined. It is not difficult to check that
[TABLE]
and that for all and . Further, the gradient of is continuous everywhere on its domain.
The next statement follows from the fact that any -unconditional norm in is a monotone function in the positive cone, as well as from hyperplane symmetries. We omit the proof.
Lemma 10**.**
Let be a convex body in with a smooth boundary such that the standard basis is -unconditional with respect to . Then for every point and every collection of signs we have
[TABLE]
As an almost immediate consequence, we obtain
Lemma 11**.**
Let be a convex body in with a smooth boundary such that the standard basis is -unconditional with respect to . Then for any we have
[TABLE]
Proof.
Let be an independent copy of . Then, appying Lemma 10 and formula (2), as well as standard estimates for the moments of Gaussians, we get
[TABLE]
where . It remains to note that is the standard Gaussian vector in . ∎
Let us state one more simple geometric property of the gradient:
Lemma 12**.**
Let be a smooth convex body in such that the standard basis is -unconditional with respect to . Then for any and any fixed numbers (), the function of one variable is non-increasing on and non-decreasing on .
The next elementary observation follows directly from property (2).
Lemma 13**.**
Let be an origin-symmetric convex body in with a smooth boundary. There is a universal constant such that for all we have
[TABLE]
3 Basic properties of the -position
Let us note that all auxiliary results proved in this section work for arbitrary origin-symmetric convex bodies with smooth boundaries in the -position.
Lemma 14**.**
Let be a smooth origin-symmetric convex body in the -position. Then
[TABLE]
Proof.
We will show that
[TABLE]
the statement will then follow by rotation-invariance of the -position. Fix for a moment any , take and define a diagonal operator via its diagonal entries:
[TABLE]
We clearly have , and, in view of (2),
[TABLE]
Further, as is continuous at every point of , we get
[TABLE]
On the other hand, in view of the definition of the -position, we have for any . Combining this with the above inequalities, we obtain
[TABLE]
Taking the limit when , we get
[TABLE]
At the same time, obviously
[TABLE]
Thus, the above relations must be equalities for all . ∎
Lemma 15**.**
For any and there are numbers depending only on and depending on and with the following property. Let , and let be an origin-symmetric convex body in such that
[TABLE]
Then
[TABLE]
Proof.
Without loss of generality, is large and . Fix a vector with . Standard deviation estimates for Gaussian variables imply
[TABLE]
whence, in view of (3), we have
[TABLE]
It remains to note that {\mathbb{E}}\|G\|_{B}\geq c_{p}\big{(}{\mathbb{E}}\|G\|_{B}^{p}\big{)}^{1/p} for some depending only on (see, for example, [15, Corollary 3.2]). ∎
Together Lemmas 14 and 15 imply
Proposition 16**.**
There are universal constants and with the following property. Let , and let be an origin-symmetric convex body in with a smooth boundary in the -position. Then
[TABLE]
Proof.
We can assume that is large. We will construct an orthogonal basis in as follows. First, there is a vector with such that B\subset\big{\{}x\in{\mathbb{R}}^{n}:\,|\langle x,y_{1}\rangle|\leq 1\big{\}}. We set . Now, assuming that are constructed, choose a vector with such that B\cap H_{k}\subset\big{\{}x\in H_{k}:\,|\langle x,y_{k+1}\rangle|\leq 1\big{\}}.
Note that for any and that for all . Now, set , , and consider two cases.
- •
Suppose that . In view of the triangle inequality and the definition of ’s, we have
[TABLE]
(here and further in the proof ’s are independent standard Gaussians). Then standard estimates for the maximum of independent Gaussian variables [7, p. 302], together with the assumption on the Euclidean norm of ’s, imply
[TABLE]
Hence, we get
[TABLE]
- •
Assume that . Thus, the Lipschitz constant of is less than , and Theorem 5, together with the relation , imply
[TABLE]
On the other hand, in view of the choice of and standard deviation estimates for a Gaussian variable, we have
[TABLE]
Clearly, in view of (2) we have
[TABLE]
and
[TABLE]
Thus, (4) yields
[TABLE]
The last inequality, together with the above deviation estimates for , implies
[TABLE]
whence
[TABLE]
On the other hand, by Lemma 14 and in view of the equivalence of moments of (see [15, Corollary 3.2]),
[TABLE]
for a universal constant . Thus,
[TABLE]
Solving for , we get
[TABLE]
for some constant .
∎
As a consequence of the above proposition, we get
Lemma 17**.**
There are universal constants and with the following property. Let , let be an origin-symmetric convex body in with a smooth boundary in the -position. Assume that
[TABLE]
Then for all we have
[TABLE]
where is the gradient of the -th power of the norm .
Proof.
We can assume that is large. Proposition 16 and Theorem 5 imply that with probability at least for a universal constant . Now, set and assume that . In view of Lemma 13 we have for all non-zero vectors and a universal constant . Hence, denoting by the event that , we obtain
[TABLE]
implying . Thus, with probability at least we have
[TABLE]
whence for all we get
[TABLE]
On the other hand, the concentration of (again, provided by Proposition 16 and Theorem 5) implies that
[TABLE]
for all for a sufficiently small universal constant . Thus, for all such we have
[TABLE]
and the result follows. ∎
4 The superconcentration of
Let be an origin-symmetric convex body in with a smooth boundary and let . For any point the -th partial derivative of at is equal to . Hence, applying Theorem 6, we get
[TABLE]
where is a universal constant. For each , write
[TABLE]
where
[TABLE]
Here, “F” stands for “flat” and “S” — for “spiky”. Then the upper bound for the variance can be written as
[TABLE]
We will treat the flat and the spiky parts separately.
Lemma 18** (The flat part).**
There are universal constants and with the following property. Let , let be a smooth origin-symmetric convex body in in the -position, and assume that the standard basis in is -unconditional with respect to . Then for all in the interval we have
[TABLE]
Proof.
We will assume that is large. Note that
[TABLE]
whence for all we get
[TABLE]
where the last relation follows from Hölder’s inequality. Next, in view of Lemma 11,
[TABLE]
whence
[TABLE]
It remains to note that we can choose the constant small enough and large enough to guarantee that
[TABLE]
∎
Lemma 19** (The spiky part).**
There are universal constants and with the following property. Let , let be as in the last lemma, and let , and be such that
[TABLE]
Then
[TABLE]
Proof.
We will assume that is large. In view of Lemma 14 as well as Lemma 10 and the definition of , we have
[TABLE]
On the other hand, conditioned on any realization of (), is a monotone function of , and, in view of Lemma 12 and the definition of , is a monotone function of . Hence, setting (), we obtain
[TABLE]
Further,
[TABLE]
where the last inequality follows from Proposition 16 and Theorem 5. Combining all the above inequalities, we obtain
[TABLE]
Taking into consideration the equivalence of moments of , we get that
[TABLE]
for a universal constant . Thus,
[TABLE]
Finally, we have, by the assumption of the lemma and the above relation,
[TABLE]
whence
[TABLE]
∎
Proof of Theorem 4.
We suppose that is large. Moreover, in view of Lemma 9 and rotation-invariance of the Gaussian distribution, we can assume without loss of generality that the standard basis in is -unconditional with respect to . Let be minimum of the constants from Lemmas 17 and 18, and let . We assume that
[TABLE]
Let us start by applying Lemma 19 with . Note that for those with
[TABLE]
we have
[TABLE]
for a universal constant . Hence,
[TABLE]
Together with relation (5) this gives
[TABLE]
Next, applying Lemma 18 we obtain
[TABLE]
Finally, in view of Lemma 17 and (6), this gives
[TABLE]
It remains to note that equals the Euclidean norm of the gradient of at , and apply the definition of superconcentration. ∎
5 The randomized Dvoretzky theorem
We will show how the variance bound from the previous section is translated into a small deviations inequality for . At a high level, the procedure is rather standard; for example, let us refer to [14, Chapter 3] for a very general scheme that allows to deduce exponential concentration from the Poincaré inequality. On the other hand, as we have better bounds on the variance of than those provided by the Poincaré inequality, our deviation estimates are stronger. Let us remark that the use of superconcentration in our analysis was inspired by a recent paper of G. Paouris, P. Valettas and J. Zinn [22] dealing with almost Euclidean sections of -balls, which complemented earlier results of A. Naor [20].
Let us start with a simple lemma that follows immediately from Proposition 16 and Theorem 5:
Lemma 20**.**
There are universal constants and with the following property. Let , let be an origin-symmetric convex body in in the -position, with a smooth boundary, and let . Then {\mathbb{E}}\|G\|_{B}^{2q}\leq C^{q}\,\big{(}{\mathbb{E}}\|G\|_{B}^{q}\big{)}^{2}.
Lemma 21**.**
There are constants , and with the following property. Let be an origin-symmetric convex body in in the -position, with a smooth boundary, and such that admits a -unconditional basis. Then for all we have
[TABLE]
Proof.
We can suppose that is large. Let be the constant from Theorem 4 (we can safely assume that ), and set , where is taken from Lemma 20. Further, let and consider two cases:
- •
Suppose that . Denote by the event . Clearly, by Markov’s inequality. Now, the Poincaré inequality for implies
[TABLE]
where at the last step we applied Lemma 13. Further, by the Cauchy–Schwarz inequality and Lemma 20 we get
[TABLE]
Finally,
[TABLE]
where the second inequality follows from the choice of .
- •
If then, by Theorem 4, we have
[TABLE]
for a universal constant . Note that, in view of Proposition 16,
[TABLE]
for a universal constant , whence
[TABLE]
Thus in any case we obtain the required bound. ∎
Now, we have
Theorem 22**.**
There are universal constants and with the following property. Let , let be an origin-symmetric convex body in in the -position, and assume that admits a -unconditional basis. Then for any we have
[TABLE]
Proof.
As before, we assume that is large. Note that, in view of Corollary 8, our convex body can be approximated with arbitrary precision by a smooth convex body in the -position. Thus, without loss of generality we can assume that itself is smooth. Let and be the constants from Lemma 21, and set and . Note that Lemma 21 then implies
[TABLE]
whence {\mathbb{E}}\|G\|_{B}^{2p}\leq\frac{4}{3}\big{(}{\mathbb{E}}\|G\|_{B}^{p}\big{)}^{2}. On the other hand,
[TABLE]
which, together with the above inequality, gives
[TABLE]
Now, for any we get
[TABLE]
Assume that . Then, by Chebyshev’s inequality and the above, we get
[TABLE]
To get lower deviation estimates, we apply a theorem of D. Cordero-Erausquin, M. Fradelizi and B. Maurey [5]. According to the theorem, the function
[TABLE]
is log-concave on the real line. Hence, assuming is the number satisfying {\mathbb{P}}\big{\{}\|G\|_{B}\leq(1+\varepsilon_{0}){\rm Med}\|G\|_{B}\big{\}}=3/4, we get
[TABLE]
whence
[TABLE]
for all . Note that the upper deviation estimates obtained in the first part of the proof imply that for a universal constant . Finally,
[TABLE]
for all . The result follows. ∎
The main result of this note — Theorem 3 — is obtained from Theorem 22 via a simple covering argument. We prefer to omit this (completely standard by now) part of the proof; we refer, in particular, to [19, 25, 2] for information on this matter.
6 Proof of Proposition 2
In this section, we prove Proposition 2 from the introduction, by providing an example of a convex set in in John’s position which shows that in general one cannot expect a better than quadratic dependence on in Theorem 1. In this connection it is natural to recall an example by T. Figiel which highlights the limitations of the existential Dvoretzky theorem (we refer to [25, Lecture 3] for more details). However, our example operates in a different regime as we bound the dimension of almost Euclidean sections by whereas Figiel’s convex set admits –dimensional sections.
Let us start by stating two facts. The first of the two lemmas below can be verified by combining a concentration inequality for the -norm of the Gaussian vector with a standard covering argument (see [26]), whereas the second one is a simple (and rather crude) corollary of Theorem 5 (again, combined with a covering procedure).
Lemma 23**.**
There are universal constants and with the following property: Let , and . Let be i.i.d. standard Gaussian vectors in , and set . Then
[TABLE]
Lemma 24**.**
There are universal constants and such that for any and the following holds: Let be i.i.d. standard Gaussian vectors in . Then
[TABLE]
The following lemma is a trivial planimetric observation; we provide the proof for reader’s convenience.
Lemma 25**.**
Let , and let a figure in the plane be given by
[TABLE]
where is the unit Euclidean ball in the plane. Then the Banach–Mazur distance from to can be estimated as
[TABLE]
Proof.
Clearly, the convex figure is isometric to the intersection of the disk and ellipse . Further, the four points of intersection of the boundaries and have coordinates
[TABLE]
Now, assume that is a linear transformation of such that
[TABLE]
In view of the symmetries in , we can assume that is diagonal, with and for some . Since , we have the inequality
[TABLE]
On the other hand, the distance is bounded from below by , whence, for any , we have
[TABLE]
Choose , where . It is easy to check that in this case , so that
[TABLE]
Thus,
[TABLE]
∎
In the following statement, we estimate the extreme singular values of a standard rectangular Gaussian matrix. The lemma is by no means new; however, we prefer to give an elementary proof based only on the standard concentration inequalities and not involving any spectral theory.
Lemma 26**.**
There are universal constants with the following property: Let , and let be the standard Gaussian matrix. Then {\mathbb{P}}\big{\{}s_{\max}(A)\leq\sqrt{m}+c_{\ref{l: gaussian extreme}}\sqrt{k}\big{\}}\leq\frac{1}{16} and {\mathbb{P}}\big{\{}s_{\min}(A)\geq\sqrt{m}-c_{\ref{l: gaussian extreme}}\sqrt{k}\big{\}}\leq\frac{1}{16}.
Proof.
Let us prove only the first assertion of the lemma (the argument for is very similar). We assume that , and that is sufficiently large. Let be i.i.d. standard Gaussian vectors in . We set and define random signs inductively as follows:
[TABLE]
We shall estimate the norm of the linear combination . Clearly, for any we have
[TABLE]
where g_{i}:=\big{\|}\sum_{j=1}^{i-1}\rho_{j}Y_{j}\big{\|}_{2}^{-1}\big{\langle}Y_{i},\sum_{j=1}^{i-1}\rho_{j}Y_{j}\big{\rangle} () are standard Gaussian variables. A rough estimate gives (provided that is sufficiently large):
[TABLE]
Next, the principal observation is that, conditioned on any realization of , the variable is distributed according to the normal law, whence ’s are jointly independent. It follows that, provided that is sufficiently large,
[TABLE]
with probability at least . Together with the above estimates, this gives
[TABLE]
with probability at least . Combined with Theorem 5 applied to \big{(}\sum_{i=1}^{k}\|Y_{i}\|_{2}^{2}\big{)}^{1/2} viewed as -Lipschitz function of i.i.d. standard Gaussian variables, this yields
[TABLE]
with probability at least . It remain to note that, for the Gaussian matrix with columns , we have \sqrt{k}\,s_{\max}(A)\geq\big{\|}\sum_{i=1}^{k}\rho_{i}Y_{i}\big{\|}_{2} deterministically. ∎
Remark 2*.*
Note that in the last lemma we estimate from below and from above. A lower bound on and upper bound on of a random matrix with i.i.d. centered entries can be derived as a corollary of the Marchenko–Pastur theorem for the limiting spectral distribution [17]. However, the Marchenko–Pastur theorem requires that the ratio converges to a fixed number, and its applicability in the case is unclear. On the other hand, the above proof of the lemma is based on an elementary argument which gives rather crude bounds but remains valid under very mild assumptions on , .
Proof of Proposition 2.
To simplify working with constants, we adopt the following convention in this proof: by writing “” we mean that for some universal constant which can be made arbitrarily small at expense of changing the constants in the final statement.
Let us fix a (large) , and define , where are i.i.d. standard Gaussians. Note that (see, for example, [26] or [7, p. 302]). Further, define two cylinders
[TABLE]
and set . It is easy to see that , and that () are contact points of and , whence, by John’s theorem [12, 3], is the maximal volume ellipsoid inside . Let \varepsilon\in\big{(}(\log n)^{-1/2},c\big{]}, (where is a sufficiently small universal constant whose value can be recovered from the proof), and set where are i.i.d. standard Gaussian vectors in . We will show that with probability at least , the random section is not –Euclidean.
Set . Note that, by the definition of , we have . Then, applying Lemma 23 with in place of , we obtain
[TABLE]
where . Next, observe that, in view of the choice of and asymptotic estimates of the median of the –norm of Gaussian vectors (see, in particular, [7, p. 302]), we have \big{|}\sqrt{m}/M_{r}-1\big{|}\ll(\log n)^{-1/2}. Together with Lemma 24, the above relation and the assumption that is small, gives
[TABLE]
or, geometrically,
[TABLE]
Further, the intersection is clearly an ellipsoid. Let be the orthogonal projections of ’s onto the linear span of , and let be the random Gaussian matrix with columns . Note that
[TABLE]
is the length of the largest semi-axis of (let us denote the corresponding random vector by , i.e. is the largest vector in with the end-point on the boundary of ). Similarly, we let be the smallest semi-axis of , i.e. the shortest vector in with the end-point on the boundary of . In view of Lemma 24, we have
[TABLE]
for some universal constant . Hence, by Lemma 26 we get
[TABLE]
where is a universal constant. Denote by the (random) -dimensional span of and . In view of (7), with probability at least the random figure is at the distance at most from . On the other hand, applying Lemma 25, we get
[TABLE]
Note that, conditioned on the event and , we have
[TABLE]
as long as is chosen to be sufficiently small. Hence, in view of (8),
[TABLE]
Finally,
[TABLE]
and the result follows. ∎
7 Remarks and open questions
- •
A question of importance is whether the assertion of Theorem 3 holds without assuming existence of a -unconditional basis in . This seems quite plausible, although absence of a preferred orthogonal basis in this case suggests that Talagrand’s bound will likely be inapplicable. In our paper the phenomenon of superconcentration is presented as a black box: we do not attempt to investigate the matters that lie beneath the bound. Proving the assertion of the theorem in full generality should require new tools.
- •
The assumption of Theorem 3 that the convex body is in the -position is not something absolutely necessary. Rather, what we need is a sort of a “balancing” condition for the norm . In particular, it is natural to expect that the assertion of the main theorem remains valid if the -position is replaced with the one given by
[TABLE]
for some fixed . One may further ask what are other natural positions in which the superconcentration phenomenon guarantees better than quadratic dependence on in the randomized Dvoretzky theorem.
Acknowledgments. The research is partially supported by the Simons Foundation. I would like to thank Ramon van Handel for an interesting discussion. I am also grateful to Nicole Tomczak-Jaegermann, Alexander Litvak and Assaf Naor for valuable remarks.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Alon and V. D. Milman, Embedding of l ∞ k subscript superscript 𝑙 𝑘 l^{k}_{\infty} in finite-dimensional Banach spaces, Israel J. Math. 45 (1983), no. 4, 265–280. MR 0720303
- 2[2] S. Artstein-Avidan, A. Giannopoulos and V. D. Milman, Asymptotic geometric analysis. Part I, Mathematical Surveys and Monographs, 202, American Mathematical Society, Providence, RI(2015). MR 3331351
- 3[3] K. Ball, Ellipsoids of maximal volume in convex bodies, Geom. Dedicata 41 (1992), no. 2, 241–250. MR 1153987
- 4[4] S. Chatterjee, Superconcentration and related topics , Springer Monographs in Mathematics, Springer, Cham, 2014. MR 3157205
- 5[5] D. Cordero-Erausquin, M. Fradelizi and B. Maurey, The (B) conjecture for the Gaussian measure of dilates of symmetric convex sets and related problems, J. Funct. Anal. 214 (2004) no. 2. MR 2083308
- 6[6] D. Cordero-Erausquin and M. Ledoux, Hypercontractive measures, Talagrand’s inequality, and influences, in Geometric aspects of functional analysis , 169–189, Lecture Notes in Math., 2050, Springer, Heidelberg. MR 2985132
- 7[7] H. A. David and H. N. Nagaraja, Order statistics , third edition, Wiley Series in Probability and Statistics, Wiley-Interscience, Hoboken, NJ, 2003. MR 1994955
- 8[8] A. Dvoretzky, Some results on convex bodies and Banach spaces, in Proc. Internat. Sympos. Linear Spaces (Jerusalem, 1960) , 123–160, Jerusalem Academic Press, Jerusalem. MR 0139079
