This paper extends the Moderate Deviations Principle to a new class of random fields using a hierarchical, inductive approach, improving understanding of deviations in complex stochastic systems.
Contribution
It introduces a hierarchical, inductive method to establish upper bounds for moderate deviations in random fields, generalizing previous results beyond sums of independent variables.
Findings
01
Upper bound for moderate deviations in a new class of random fields.
02
Inductive approach applicable in multiple dimensions.
The Moderate Deviations Principle (MDP) is well-understood for sums of independent random variables, worse understood for stationary random sequences, and scantily understood for random fields. An upper bound for a new class of random fields is obtained here by induction in dimension. Version 3. Sect 1. Stationarity, being not essential in the proofs, is removed from the definitions and the main result formulation. Sect. 2. [C,2C] instead of [C1,2C1] before Prop. 2.6; a≥1 instead of a≥C/C1 in the last proof; Remark 2.5 added; supremum over shifts in (2.2) (formerly (2.3)); "centered" instead of "CMS". Cosmetic changes: indexing of leaks; Remark 2.11. Sect. 3. Cosmetic change: semicolon after the second display of the proof of Lemma 3.9. References: "response", not "responce".
\mathbb{E}\,\bigg{|}\int_{B}X_{t}\,\mathrm{d}t\bigg{|}<\infty\quad\text{and}\quad\mathbb{E}\,\int_{B}X_{t}\,\mathrm{d}t=0\quad\text{for all boxes }B\subset\mathbb{R}^{d}\,.
\mathbb{E}\,\bigg{|}\int_{B}X_{t}\,\mathrm{d}t\bigg{|}<\infty\quad\text{and}\quad\mathbb{E}\,\int_{B}X_{t}\,\mathrm{d}t=0\quad\text{for all boxes }B\subset\mathbb{R}^{d}\,.
the distribution of∫[0,r1]×⋯×[0,rd]Xtdt is a Borel measurable function of (r1,…,rd).
the distribution of∫[0,r1]×⋯×[0,rd]Xtdt is a Borel measurable function of (r1,…,rd).
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQuantum chaos and dynamical systems · Stochastic processes and statistical mechanics · Mathematical Dynamics and Fractals
Full text
Linear response and moderate deviations:
hierarchical
approach. II
Boris Tsirelson
Abstract
The Moderate Deviations Principle (MDP) is well-understood for sums of
independent random variables, worse understood for stationary random
sequences, and scantily understood for random fields.
An upper bound for a new class of random fields is obtained here by
induction in dimension.
We examine a class of random fields X=(Xt)t∈Rd, but we are interested only in integrals ∫BXtdt over
boxes B=[α1,β1]×⋯×[αd,βd]⊂Rd (where α1<β1,…,αd<βd) rather than “individual”
random variables Xt. Similarly to [2, Sect. 1] we merely
deal with a box-indexed family of random variables, denoted (if only
for convenience) by ∫BXtdt and satisfying additivity:
[TABLE]
for all boxes B⊂Rd−1; this being additivity w.r.t. the first coordinate, the same is required for each coordinate.
We say that X is centered, if
[TABLE]
Two more conditions, stationarity and mesurability, will be used
in [3] when proving the moderate deviations principle.
Stationarity means measure preserving time shifts that send ∫BXtdt to ∫B+sXtdt. Measurability (under
stationarity) is similar to [2, (1.2)]:
[TABLE]
But for now, our goal being upper bounds, we need only (1.1)
and (1.2), and call such X a centered random field (on
Rd).
The notions “independent” and “identically distributed” are
interpreted for such processes similarly to [2, Sect. 1].
Sometimes, when convenient, we write X(t) rather than Xt.
Splittability, defined in [2, Def. 1.4] for d=1, will be
defined here for all d.
First, given a centered random field X=(Xt)t∈Rd, we define
a split of X as a triple of random fields X0,X−,X+
(on some probability space) such that the two fields X−,X+ are
(mutually) independent and the four fields X,X0,X−,X+ are
identically distributed. (Informally, a split is useful when its leak
defined below is small.) Clearly, X0,X−,X+ are centered (since X
is).
Second, given r,a,b∈R, a<r<b, and a split (X0,X−,X+) of X, we define the leak (of this split) along the
hyperplane {r}×Rd−1 on the strip [a,b]×Rd−1 as the random field (Yt)t∈Rd−1 where
[TABLE]
in the sense that
[TABLE]
for all boxes B⊂Rd−1. Clearly, Y is also a centered
random field. Similarly, for each k∈{1,…,d} we define
the leak along the hyperplane Rk−1×{r}×Rd−k on
the coordinate strip Rk−1×[a,b]×Rd−k. (Different k and r need different splits, to be useful.)
Given X, we need splits (Xk,r0,Xk,r−,Xk,r+) of X
for all k=1,…,d and r∈R; and for all a∈(−∞,r), b∈(r,∞) we consider the corresponding leak
Yk,r,a,b of this split.
For d=1 the leak is just a single random variable Y=∫arX−(t)dt+∫rbX+(t)dt−∫abX0(t)dt.
Accordingly, a family (Xi)i∈I of random fields on Rd leads to a family (Yk,r,a,b,i)k=1,…,d;a<r<b;i∈I of
centered random fields on Rd−1.
Third, we consider a family of centered random fields (X_{i})_{i\in I}=\bigl{(}(X_{t,i})_{t\in\mathbb{R}^{d}}\bigr{)}\vphantom{)}_{i\in I} (the index set I being arbitrary)
and define uniform splittability of such family. Then splittability of
a centered random field appears as the particular case of a single-element
set I. We define uniform splittability by recursion in the dimension
d=0,1,2,…, treating a centered random field on R0 as just a
single random variable of zero mean.
1.4 Definition**.**
A family (Xi)i∈I of centered random fields Xi=(Xt,i)t∈Rd is uniformly splittable, if either d=0 and
[TABLE]
or d≥1 and the following two conditions hold:
(a) there exist a box B⊂Rd and ε>0 such that
[TABLE]
(b) there exist splits (Xk,r,i0,Xk,r,i−,Xk,r,i+) of
each Xi, whose leaks are a uniformly
splittable family (Yk,r,a,b,i)k=1,…,d;a<r<b;i∈I.
We use volume and width of a box B=[α1,β1]×⋯×[αd,βd]⊂Rd:
[TABLE]
The theorem below applies first of all to a single random field (that
is, a single-element set I); the general formulation enables the
proof by induction in the dimension.
1.5 Theorem**.**
For every uniformly splittable family (Xi)i∈I of centered random
fields Xi=(Xt,i)t∈Rd there exists C∈(1,∞) such that for every i∈I, every box B⊂Rd and
every λ∈R,
[TABLE]
(Of course, logdvolB is \bigl{(}\log(\operatorname{vol}B)\bigr{)}\vphantom{)}^{d}.)
This theorem will be proved by induction in the dimension d=1,2,…
Throughout we assume that a uniformly splittable family (Xi)i∈I of centered random fields Xi=(Xt,i)t∈Rd is given.
We assume that the theorem holds in dimension d−1, unless d=1; in
the latter case the proof needs trivial modifications.
According to Def. 1.4 we have splits (of the fields Xi) whose leaks Yk,r,a,b,i are a uniformly splittable family
(in dimension d−1). Theorem 1.5, applied to this family, gives
C1 such that for all k,r,a,b,i
[TABLE]
whenever B⊂Rd−1 is a box of width ≥C1, and C1∣λ∣≤logd−1volB1.
For d=1 the leaks Y1,r,a,b,i are just random
variables; their uniform splittability means
∃ε>0∀r,a,b∀i∈IEexpε∣Y1,r,a,b,i∣≤2. By Lemma 1.8 below this
implies logEexpελY1,r,a,b,i≤λ2 for λ∈[−1,1]. Taking C_{1}=\max\bigl{(}\frac{1}{\varepsilon^{2}},1\bigr{)}\vphantom{)} we get for all
r,a,b,i
If a random variable Z satisfies Eexp∣Z∣≤2 and EZ=0, then logEexpλZ≤λ2 for all λ∈[−1,1].
Theorem 1.5 is proved in Sect. 2 for \lambda=\mathcal{O}\bigl{(}(\operatorname{vol}B)^{-\frac{1}{2d}}\log^{-(d-1)}\operatorname{vol}B\bigr{)}\vphantom{)}; larger λ are
treated in Sect. 3. I still do not know, what happens when
λ tends to [math] slower than log−dvolB. This logarithmic
gap between moderate and large deviations, is it a phenomenon or a
drawback of my approach?
2 Far from large deviations
2.1 Proposition**.**
There exists C∈(1,∞) such that for every i∈I, every box B⊂Rd of
volume v and width ≥C, and every λ∈R,
[TABLE]
Similarly to [2, Sect. 2a] we consider random variables
[TABLE]
their cumulant generating functions λ↦logEexpλSB,i, and ensure shift invariance by taking the supremum over
all shifts of a box:
[TABLE]
Still, fB,i(λ)≥0, since EexpλSB,i≥E(1+λSB,i)=1.
Further, we take the supremum over all i and all boxes of a given
volume and width ≥C:
We generalize [2, Prop. 2a9(a)]. Given a box B⊂Rd−1
and a number r>0, we consider two boxes B1=[0,r]×B
and B2=[−r,r]×B in Rd. Let v=volB1 and 0ptB≥C1. For d=1 we mean B1=[0,r] and B2=[−r,r]; B disappears, as well as the condition on 0ptB; by
convention, log−(d−1)volB=1, and (in the proof) volB=1.
2.4 Lemma**.**
For all p∈(1,∞) and λ such that
C1∣λ∣≤pp−12vlog−(d−1)volB,
[TABLE]
Proof.
Given i∈I, we use the split (X0,X−,X+) of Xi whose
leak Y=Y1,0,−r,r,i on the strip [−r,r]×Rd−1
satisfies (1.6). Similarly to [2, 2a7 and 2a9], the random
variables U=v1∫[−r,0]×BXt−dt,
V=v1∫[0,r]×BXt+dt,
W=2v1∫[−r,r]×BXt0dt and Z=−∫BYtdt satisfy Z=2vW−vU−vV and, by (1.6), logEexpλZ≤C1(volB)λ2 for C1∣λ∣≤log−(d−1)volB. Similarly to [2, Prop. 2a9], by
Hölder’s inequality, f_{B_{2},i}(\lambda)=\log\mathbb{E}\,\exp\frac{\lambda}{\sqrt{2v}}\int_{B_{2}}X_{t,i}\,\mathrm{d}t=\log\mathbb{E}\,\exp\lambda W=\log\mathbb{E}\,\exp\lambda\bigl{(}\tfrac{U+V}{\sqrt{2}}+\frac{1}{\sqrt{2v}}Z\bigr{)}\vphantom{)}\leq\frac{1}{p}\cdot 2f_{B_{1}}\bigl{(}\frac{p\lambda}{\sqrt{2}}\bigr{)}\vphantom{)}+\frac{p-1}{p}\log\mathbb{E}\,\exp\frac{p}{p-1}\frac{\lambda Z}{\sqrt{2v}}. The second
term does not exceed \frac{p-1}{p}C_{1}(\operatorname{vol}B)\bigl{(}\frac{p}{p-1}\frac{\lambda}{\sqrt{2v}}\bigr{)}\vphantom{)}^{2}=C_{1}\frac{\operatorname{vol}B}{2v}\frac{p}{p-1}\lambda^{2} for p−1p2vC1∣λ∣≤log−(d−1)volB; and 2vvolB=2r1. It remains to take
supremum in i∈I.
For d=1 the leak Y=Y1,0,−r,r,i, being a random variable,
satisfies (1.7); Z=−Y; logEexpλZ≤C1λ2 for C1∣λ∣≤1 (and v=r, of course).
∎
2.5 Remark**.**
Above, a box is halved (divided in two boxes) by the hyperplane x1=0. More generally, the same
holds when halving the box by another coordinate hyperplane xk=c
for k∈{1,…,d} and the appropriate c.
Now we consider two boxes in Rd, B0=[0,r1]×⋯×[0,rd] and B=[0,2n1r1]×⋯×[0,2ndrd] for arbitrary r1,…,rd∈[C,2C) for some C≥C1, and arbitrary n1,…,nd∈{0,1,2,…}.
2.6 Proposition**.**
If C is large enough, then for all r1,…,rd∈[C,2C), δ>0 and a≥R(volB0)C satisfying
[TABLE]
where B0=[0,r1]×⋯×[0,rd], there exists
natural N such that the following holds for all n1,…,nd∈{0,1,2,…} satisfying n1+⋯+nd≥N:
[TABLE]
where B=[0,2n1r1]×⋯×[0,2ndrd] and
Δ=C11a1S(volB)log−(d−1)S(volB).
(For d=1, by convention, log−(d−1)S(…)=1,
notwithstanding that S(…)=1.)
Proof.
We take n=n1+⋯+nd, Bn=B, halve the longest side of
Bn, denote the half of Bn by Bn−1, and repeat this
operation getting Bn−2,…,B0. For each k=0,…,n−1 we have volBk+1=2k+1volB0 and, assuming C≥C1,
[TABLE]
by Lemma 2.4 (and Remark 2.5), since (recall r and B of 2.4) the
longest side 2r of Bk+1 cannot be less than R(volBk+1), and volB=2rvolBk+1≤S(volBk+1). (For d=1 this is just 1=2r2r≤1.)
Given Ak and Δk such that fBk(λ)≤Ak2λ2
for ∣λ∣≤Δk, we denote q=p−1p, x=R(volBk+1)C1 and get
[TABLE]
for |\lambda|\leq\min\bigl{(}\frac{\sqrt{2}}{p}\Delta_{k},\frac{1}{C_{1}q}\sqrt{\operatorname{vol}B_{k+1}}\log^{-(d-1)}S(\operatorname{vol}B_{k+1})\bigr{)}\vphantom{)}. Generally, the
minimum of Ak2p+x2q over p,q such that p1+q1=1 is equal to (Ak+x)2, and is reached at p=1+Akx, q=1+xAk. Thus, fBk+1(λ)≤Ak+12λ2 for ∣λ∣≤Δk+1, provided that Ak+1≥Ak+x=Ak+R(volBk+1)C1 and
\Delta_{k+1}\leq\min\Big{(}\frac{\sqrt{2}}{p_{k}}\Delta_{k},\frac{1}{C_{1}q_{k}}\sqrt{\operatorname{vol}B_{k+1}}\log^{-(d-1)}S(\operatorname{vol}B_{k+1})\Big{)}; here pk=1+Akx=1+Ak1R(volBk+1)C1 and qk=1+xAk=1+C1AkR(volBk+1).
We take Ak=a+R(volB0)C1∑i=1k2−2di (thus Ak+1=Ak+R(volBk+1)C1) and note that Ak↑A∞≤a+CC1a∑i=1∞2−2di≤2a if C is large enough (since a≥R(volB0)C). Also, qk≤1+C12aR(volBk+1)≤C13aR(volBk+1) for all k, if
C is large enough (since a≥R(volB0)C
again). Assuming also C1≥3 (which is harmless) we introduce Mk=C11a1S(volBk)log−(d−1)S(volBk), note that Mk+1≤C1qk1volBk+1log−(d−1)S(volBk+1) (since qk≤aR(volBk+1)), and replace the condition on Δk+1 given
above with the stronger condition \Delta_{k+1}\leq\min\bigl{(}\frac{\sqrt{2}}{p_{k}}\Delta_{k},M_{k+1}\bigr{)}\vphantom{)}. Now we note that pk−1=Ak1R(volBk+1)C1≤a1R(volBk+1)C1≤CR(volBk+1)C1R(volB0)≤2−2dk+1 if C≥C1, take integer N such that
2−2dN+1≤22d1−1, and get Mk+1≤pk2Mk for all k≥N (since MkMk+1≤22dd−1 and pk≤1+2−2dN+1≤22d1).
We choose Δk as follows:
[TABLE]
Clearly, Δk+1≤pk2Δk for all k.
In order to obtain fBn(λ)≤2aλ2 for ∣λ∣≤Mn
when n≥N it is sufficient to ensure that Δ0≤δ and
Δk≤Mk for k=0,1,…,N−1. We note that p0…pN−1≤∏k=0N−1(1+2−2dk+1)≤exp∑k=0∞2−2dk+1 and C1MN≤a1S(volBN)≤C1R(volB0)S(volBN)=22dd−1NC1volB0, thus \Delta_{0}=p_{0}\dots p_{N-1}\cdot 2^{-\frac{N}{2}}M_{N}\leq\bigl{(}\exp\sum_{k}2^{-\frac{k+1}{2d}}\bigr{)}\vphantom{)}\cdot 2^{-\frac{N}{2d}}\frac{1}{C_{1}}\sqrt{\frac{1}{C}(2C)^{d}}; by increasing N as needed we get Δ0≤δ.
It remains to ensure that Δk≤Mk for k=0,1,…,N−1.
We’ll get a bit more: Δk≤C11a1S(volBk)log−(d−1)S(volBN), that is,
[TABLE]
We may check it only for k=0 and k=N due to the fact that pk…pN−1 is a logarithmically convex function of k (since
pk decrease). For k=N it is just 1≤1. For k=0 we need p0…pN−1≤22dN, which holds for N such that exp∑k2−2dk+1≤22dN.
∎
2.7 Remark**.**
In the proof of 2.6, the restriction on C depends only on d
and C1. For large d, roughly, C≥O(d2)C1. Also, the
restriction on N depends only on d, C and δ; roughly, N≥O(d2)logC+O(d)logδ1.
2.8 Remark**.**
In 2.6, a and δ may depend on r1,…,rd. Assume
for a while that they do not; that is, the given a and δ serve
all r1,…,rd (for the given C). Then the conclusion (that
fB(λ)≤2aλ2 whenever ∣λ∣≤Δ) holds for all
B such that 0ptB≥C and volB is large enough
(namely, volB≥2N(2C)d). We get, for all v large enough,
[TABLE]
It appears (Lemma 2.12 below) that the assumption of 2.8
is satisfied always (that is, for every uniformly splittable
family). Alternatively, the reader may just include that assumption
into Def. 1.4 (replacing Item (a) there) and skip to the
proof of Prop. 2.3 near the end of this section.
2.9 Lemma**.**
For every uniformly splittable family (Xi)i∈I of centered random
fields on Rd and every box B⊂Rd there exist a,δ>0 such that
[TABLE]
This lemma will be proved by induction in the dimension d=1,2,…
As was noted near (1.6), the given family X=(Xi)i on Rd leads to another family Y=(Yk,r,a,b,i) on
Rd−1. Both families of random fields lead to box-indexed
families of functions R→[0,∞]; X leads to (fB)B⊂Rd as before; likewise, Y leads to (gB)B⊂Rd−1. (For d=1, just a single function g.)
Similarly to [2, 2a9(b) and 2a10] we modify Lemma 2.4 as
follows. Given a box B0⊂Rd−1 and numbers r,s>0,
we consider three boxes B1=[−r,0]×B0, B2=[0,s]×B0 and B=[−r,s]×B0 in Rd.
2.10 Lemma**.**
For all p∈(1,∞) and λ∈R,
[TABLE]
Proof.
Similar to the proof of 2.4. Denoting v=volB0,
the random variables
U=rv1∫B1Xt−dt,
V=sv1∫B2Xt+dt,
W=(r+s)v1∫BXt0dt and Z=−∫B0Ytdt satisfy r+sW=rU+sV+v1Z. By Hölder’s inequality, f_{B,i}(\lambda)=\log\mathbb{E}\,\exp\lambda W=\log\mathbb{E}\,\exp\lambda\bigl{(}\sqrt{\frac{r}{r+s}}U+\sqrt{\frac{s}{r+s}}V+\frac{1}{\sqrt{(r+s)v}}Z\bigr{)}\vphantom{)}\leq\frac{1}{p}\log\mathbb{E}\,\exp p\lambda\bigl{(}\sqrt{\frac{r}{r+s}}U+\sqrt{\frac{s}{r+s}}V\bigr{)}\vphantom{)}+\frac{p-1}{p}\log\mathbb{E}\,\exp\frac{p}{p-1}\lambda\frac{1}{\sqrt{r+s}}\frac{1}{\sqrt{v}}Z\leq\linebreak\frac{1}{p}f_{B_{1}}\bigl{(}p\lambda\sqrt{\frac{r}{r+s}}\,\bigr{)}\vphantom{)}+\frac{1}{p}f_{B_{2}}\bigl{(}p\lambda\sqrt{\frac{s}{r+s}}\,\bigr{)}\vphantom{)}+\frac{p-1}{p}g_{B_{0}}\bigl{(}-\frac{p}{p-1}\frac{\lambda}{\sqrt{r+s}}\bigr{)}\vphantom{)}; supremum in i gives the
first inequality (the upper bound). The second inequality (the lower
bound), being rewritten as f_{B_{1}}\bigl{(}\lambda\sqrt{\frac{r}{r+s}}\,\bigr{)}\vphantom{)}+f_{B_{2}}\bigl{(}\lambda\sqrt{\frac{s}{r+s}}\,\bigr{)}\vphantom{)}\leq\frac{1}{p}f_{B}(p\lambda)+\linebreak\frac{p-1}{p}g_{B_{0}}\bigl{(}\frac{p}{p-1}\frac{\lambda}{\sqrt{r+s}}\bigr{)}\vphantom{)},
follows by Hölder’s inequality from the relation rU+sV=r+sW−v1Z.
∎
2.11 Remark**.**
Above, a box B is divided in two boxes B1,B2 by the hyperplane x1=0. More generally, the same holds when B is divided by another
coordinate hyperplane xk=c for k∈{1,…,d} and any
appropriate c.
Let us call a box B⊂Rdgood when there exist a,δ>0 such that fB(λ)≤aλ2 whenever ∣λ∣≤δ. Similarly, a box B⊂Rd−1 is good when \exists a,\delta>0\;\forall\lambda\;\bigl{(}|\lambda|\leq\delta\;\Longrightarrow\;g_{B}(\lambda)\leq a\lambda^{2}\bigr{)}\vphantom{)}.
Existence of (at least one) good box follows from Item (a) of
Def. 1.4 and Lemma 1.8. (For d=0 the only
“box” is good.)
In order to prove Lemma 2.9 we assume (the induction hypothesis)
that all boxes in Rd−1 are good, and prove that all boxes in Rd are good.
By 2.10, B is good if and only if B1,B2 are good. It
follows that every box contained in some good box is good (turn from
B to B1 or B2, and iterate).
Thus, all boxes that are small enough are good. It follows that every
box is good (divide it into small boxes).
Given a set of boxes, we say that these boxes are uniformly
good when there exist a,δ>0 such that for every box B of
the given set, every λ∈[−δ,δ] satisfies the inequality fB(λ)≤aλ2 (or gB(λ)≤aλ2, for B⊂Rd−1).
2.12 Lemma**.**
Let 0<c<C<∞. Then the boxes [0,r1]×⋯×[0,rd] for all r1,…,rd∈[c,C] are uniformly
good.
Proof.
Induction in the dimension d. The induction hypothesis gives a0,δ0>0 such that g[0,r1]×⋯×[0,rd−1](λ)≤a0λ2 whenever ∣λ∣≤δ0 and r1,…,rd−1∈[c,C].
(When d=1, this holds for the single function g.)
The box [0,C]d=[0,C]×⋯×[0,C] being good by 2.9, we take a,δ>0 such
that f[0,C]d(λ)≤aλ2 whenever ∣λ∣≤δ. We
use the second inequality of 2.10 for p=2 (taking 2.11
into account):
[TABLE]
for |\lambda|\leq\min\bigl{(}\frac{\delta}{2}\sqrt{\frac{c}{C}},\frac{\delta_{0}}{2}\sqrt{c}\bigr{)}\vphantom{)}. Thus, the boxes [0,r1]×[0,C]d−1 for r1∈[c,C] are uniformly good. Now we divide the box [0,r1]×[0,C]d−1 by the hyperplane x2=r2, apply
again the argument used above, and see that the boxes [0,r1]×[0,r2]×[0,C]d−2 for r1,r2∈[c,C] are
uniformly good. And so on.
∎
We take C large enough according to 2.6. By 2.12 the
boxes B=[0,r1]×⋯×[0,rd] for all r1,…,rd∈[C,2C] are uniformly good. We take a≥1 and δ>0
such that fB(λ)≤aλ2 for all these B and all λ∈[−δ,δ]. Now 2.8 gives V such that all v∈[V,∞) satisfy
[TABLE]
We take M>1 such that M≥C, Md≥V, M≥2a, M≥C1a, and get
[TABLE]
since v>1, logd−1S(v)≤logd−1v (just 1≤1
for d=1), v≥Md≥V, C1∣λ∣≤MC1S(v)log−(d−1)v≤a1S(v)log−(d−1)S(v), and fv,M(λ)≤fv,C(λ)≤2aλ2≤Mλ2.
∎
In this section we denote by C2 the constant given by
Prop. 2.3, use the functions fv,C mostly for C=C2,
and denote fv=fv,C2. By 2.3,
[TABLE]
We still use (1.6), (1.7) and C1 therefrom; C1≤C2. By convention, log0x=1 always (also if x does not
belong to (1,∞) or even is ill-defined).
3.2 Lemma**.**
For all p∈(1,∞),
[TABLE]
whenever C1∣λ∣≤pp−12vlog−(d−1)S(2v)
and 2v≥(2C2)d.
Proof.
Given a box B such that volB=2v and 0ptB≥C2,
we halve the longest side 2r of B and apply Lemma 2.4 (as we
did in the proof of 2.6). Once again, 2r≥R(volB)=R(2v)≥2C2, thus a half of B is still of width ≥C2,
and 2r2v≤S(2v), thus 2.4 applies and gives f_{B}(\lambda)\leq\frac{2}{p}f_{v}\bigl{(}\frac{p\lambda}{\sqrt{2}}\bigr{)}\vphantom{)}+C_{1}\frac{p}{p-1}\cdot\frac{\lambda^{2}}{2r}.
∎
It follows that
[TABLE]
since p−1pC1∣λ∣≤2vlog−(d−1)S(2v).
For convenience we denote
[TABLE]
3.3 Corollary**.**
For all p∈(1,∞),
[TABLE]
whenever 0<C1∣λ∣≤pp−12vlog−(d−1)S(2v) and 2v≥(2C2)d.
We take p=λ12λ0 and note that λ0=2pλ1 and p∣λ1∣p−1=∣λ1∣1−2∣λ0∣1≥2vC1logd−1S(2v), that is, C1∣λ1∣≤pp−12vlog−(d−1)S(2v),
thus 3.3 applies.
∎
Taking C≥max(C2,e1/d) we see that the case C∣λ∣≤logd−1vS(v) is covered
by (3.1).
From now on we assume that v≥Cd and
[TABLE]
ultimately we’ll prove that fv,C(λ)≤Cλ2 provided
that the constant C, dependent only on d,C1,C2, is large
enough.
We take integer n such that
[TABLE]
(the constant Md, dependent on d only, will be chosen later).
3.9 Lemma**.**
If Md≥(2d)d−1 and C≥e, then n≥1.
Proof.
Assume the contrary: Md2dvd−1(C∣λ∣)2dlog2d(d−1)C∣λ∣v≤1, that is,
[TABLE]
For d=1 it means M1C∣λ∣≤1 in contradiction to C∣λ∣>1 and M1≥1. Assume d≥2. Using (3.7),
[TABLE]
thus, 2d1logv>logC∣λ∣v, that is, C∣λ∣>S(v). Now 3.10 gives Mdlogd−1C∣λ∣v<1, which implies Mdlogd−1logdv<1 (since C∣λ∣≤logdvv by (3.7)), and 2dloglogdv<1 (since Md≥(2d)d−1). On the other hand, v≥Cd≥ed implies
logv≥d and loglogdv=dloglogv≥dlogd,
thus 2dloglogdv≥2d2logd≥8log2>1; a
contradiction.
∎
From now on we assume Md≥(2d)d−1 and C≥e (thus, n≥1). We define v0,…,vn by
[TABLE]
Below, “y=O(x)” means that y≤const⋅x for some
constant dependent on d only.
3.11 Lemma**.**
n≤2C1∣λ∣vlog−(d−1)v provided that C
is large enough.
Proof.
It is sufficient to prove that n=O(logv); then, increasing
C as needed, we get n≤2C1Clogv≤2C1∣λ∣vlog−(d−1)v since C∣λ∣≤logdvv by (3.7).
We have C∣λ∣1=O(1) (since by (3.7),
C∣λ∣1≤S(v)logd−1v, the
latter being bounded in v∈(1,∞)).
Thus, logC∣λ∣v=O(logv) (since logv≥logCd≥d≥1). Also, C∣λ∣≤logdvv≤v. Using (3.8),
[TABLE]
which implies n=O(logv).
∎
Having nlogd−1v≤2C1∣λ∣v (ensured
by Lemma 3.11) we define λ0,…,λn (either all
positive or all negative) by
[TABLE]
That is,
[TABLE]
the right-hand side is positive, since
[TABLE]
moreover, for k=n we get
[TABLE]
3.13 Lemma**.**
If C is large enough, then 2v0≥(2C2)d.
Proof.
For C large enough we have logd−1xx≥Md2C2 for all x∈[(dlogC)d,∞). We take x=C∣λ∣v; using (3.7), x≥logdv≥logdCd=(dlogC)d, thus logd−1xx≥Md2C2. Using (3.8), 2v_{0}=2^{-(n-1)}v>M_{d}^{-2d}\frac{v^{d}}{(C|\lambda|)^{2d}}\log^{-2d(d-1)}\frac{\sqrt{v}}{C|\lambda|}=\bigl{(}\frac{1}{M_{d}}x\log^{-(d-1)}x\bigr{)}\vphantom{)}^{2d}\geq\bigl{(}\sqrt{2C_{2}}\bigr{)}\vphantom{)}^{2d}.
∎
From now on we assume that C is large enough, so that 2v0≥(2C2)d. Now Corollary 3.5 applies:
[TABLE]
3.14 Lemma**.**
If Md≥2(2d)d−1, then C∣λ0∣≤logd−1v0S(v0).
Proof.
Assume the contrary. Using (3.12), 2C⋅2−n/2∣λ∣≥C∣λ0∣>logd−1(2−nv)S(2−nv)=2−n/2⋅22dnv2dd−1log−(d−1)(2−nv); using (3.8), Md2dvd−1(C∣λ∣)2dlog2d(d−1)C∣λ∣v≤2n<vd−1(2C∣λ∣)2dlog2d(d−1)(2−nv) and \bigl{(}2d\log\frac{\sqrt{v}}{C|\lambda|}\bigr{)}\vphantom{)}^{d-1}\leq\frac{1}{2}M_{d}\log^{d-1}\frac{\sqrt{v}}{C|\lambda|}<\log^{d-1}(2^{-n}v).
For d=1 it means 1<1. Assume d≥2. We have
2dlogC∣λ∣v<log(2−nv) and
\bigl{(}\frac{\sqrt{v}}{C|\lambda|}\bigr{)}\vphantom{)}^{2d}<2^{-n}v, that is, 2^{n}<v\bigl{(}\frac{C|\lambda|}{\sqrt{v}}\bigr{)}\vphantom{)}^{2d}=\frac{(C|\lambda|)^{2d}}{v^{d-1}}. Using (3.8), Md2dlog2d(d−1)C∣λ∣v<1, that is, Mdlogd−1C∣λ∣v<1, which cannot be true, as was shown in the proof of
Lemma 3.9.
∎
From now on we assume that Md≥2(2d)d−1, so that
C∣λ0∣≤logd−1v0S(v0). Also, v0≥C2d by 3.13, and C2≤C.
Now (3.1) applies: fv0(λ0)≤C2λ02; and
therefore,
We rewrite the sum as \sum_{k=1}^{n}(2^{-(n-k)}v)^{-1/d}\log^{-(d-1)}S(v_{k})\leq\linebreak\bigl{(}\log^{-(d-1)}S(2v_{0})\bigr{)}\vphantom{)}(2^{-n}v)^{-1/d}\sum_{k=1}^{\infty}2^{-k/d}, note that log−(d−1)S(2v0)=O(1) (since C2>1 and 2v0≥(2C2)d≥2)
and see that the given sum is \mathcal{O}\bigl{(}\bigl{(}\frac{2^{n}}{v}\bigr{)}\vphantom{)}^{1/d}\bigr{)}\vphantom{)}. By (3.8), \bigl{(}\frac{2^{n}}{v}\bigr{)}\vphantom{)}^{1/d}<2^{1/d}M_{d}^{2}\frac{(C|\lambda|)^{2}}{v}\log^{2d-2}\frac{\sqrt{v}}{C|\lambda|}=\linebreak\mathcal{O}\bigl{(}\bigl{(}\frac{C|\lambda|}{\sqrt{v}}\bigr{)}\vphantom{)}^{2}\log^{2d-2}\frac{\sqrt{v}}{C|\lambda|}\bigr{)}\vphantom{)}. It remains to note that vC∣λ∣log2d−2C∣λ∣v=O(1), since the function
x↦xlog2d−2x is bounded on [1,∞)
and, using (3.7), vC∣λ∣≤logdv1≤(dlogC)d1≤dd1≤1.
∎
By (3.15) and Lemma 3.16, ∣λ∣vfv(λ)≤v0C2∣λ0∣+NdCv∣λ∣ for some constant Nd dependent on d
only. By (3.12), v0∣λ0∣≤2n/2v02∣λ∣=v2∣λ∣. Thus, f_{v}(\lambda)\leq|\lambda|\sqrt{v}\bigl{(}C_{2}\frac{2|\lambda|}{\sqrt{v}}+N_{d}C\frac{|\lambda|}{\sqrt{v}}\bigr{)}\vphantom{)}\leq(2C_{2}+N_{d}C)\lambda^{2}.
And finally, fv,2C2+NdC(λ)≤fv,C2(λ)=fv(λ).
∎