Large Deviations and the Lukic Conjecture

Jonathan Breuer; Barry Simon; Ofer Zeitouni

arXiv:1703.00653·math.SP·November 14, 2018

Large Deviations and the Lukic Conjecture

Jonathan Breuer, Barry Simon, Ofer Zeitouni

PDF

TL;DR

This paper employs large deviation techniques to establish higher order sum rules for orthogonal polynomials on the unit circle, providing partial proof of Lukic's conjecture in a specific singular case, which contrasts with the failure of Simon’s conjecture.

Contribution

It proves a significant part of Lukic's conjecture for the case of two singular points, advancing understanding of sum rules and orthogonal polynomials.

Findings

01

Proves one half of Lukic's conjecture for two singular points

02

Supports the validity of Lukic's conjecture where Simon's fails

03

Uses large deviation approach to sum rules in orthogonal polynomial theory

Abstract

We use the large deviation approach to sum rules pioneered by Gamboa, Nagel and Rouault to prove higher order sum rules for orthogonal polynomials on the unit circle. In particular, we prove one half of a conjectured sum rule of Lukic in the case of two singular points, one simple and one double. This is important because it is known that the conjecture of Simon fails in exactly this case, so this paper provides support for the idea that Lukic's replacement for Simon's conjecture might be true.

Equations466

Φ_{n + 1} (z) = z Φ_{n} (z) - \overline{α}_{n} Φ_{n}^{*} (z); Φ_{0} \equiv 1; Φ_{n}^{*} (z) = z^{n} \overline{Φ_{n} (\frac{1}{z ˉ})}

Φ_{n + 1} (z) = z Φ_{n} (z) - \overline{α}_{n} Φ_{n}^{*} (z); Φ_{0} \equiv 1; Φ_{n}^{*} (z) = z^{n} \overline{Φ_{n} (\frac{1}{z ˉ})}

H (\frac{d θ}{2 π} μ) = - n = 0 \sum M lo g (1 - ∣ α_{n} ∣^{2})

H (\frac{d θ}{2 π} μ) = - n = 0 \sum M lo g (1 - ∣ α_{n} ∣^{2})

H (ν ∣ μ) = {\int lo g (\frac{d ν}{d μ}) d ν, \infty, \mbox i f ν \mbox i s μ \mbox - - a . c . \mbox o t h er w i se .

H (ν ∣ μ) = {\int lo g (\frac{d ν}{d μ}) d ν, \infty, \mbox i f ν \mbox i s μ \mbox - - a . c . \mbox o t h er w i se .

j = 0 \sum \infty ∣ α_{j} ∣^{2} < \infty ⟺ \int lo g (w (θ)) \frac{d θ}{2 π} > - \infty

j = 0 \sum \infty ∣ α_{j} ∣^{2} < \infty ⟺ \int lo g (w (θ)) \frac{d θ}{2 π} > - \infty

d μ (θ) = w (θ) \frac{d θ}{2 π} + d μ_{s}

d μ (θ) = w (θ) \frac{d θ}{2 π} + d μ_{s}

- \int (1 - cos θ) lo g (w (θ)) \frac{d θ}{2 π}

- \int (1 - cos θ) lo g (w (θ)) \frac{d θ}{2 π}

+ n = 0 \sum \infty [- lo g (1 - ∣ α_{n} ∣^{2}) - ∣ α_{n} ∣^{2}]

\int (1 - cos θ) lo g (w (θ)) \frac{d θ}{2 π} > - \infty ⟺ n = 0 \sum \infty ∣ α_{n + 1} - α_{n} ∣^{2} + ∣ α_{n} ∣^{4} < \infty

\int (1 - cos θ) lo g (w (θ)) \frac{d θ}{2 π} > - \infty ⟺ n = 0 \sum \infty ∣ α_{n + 1} - α_{n} ∣^{2} + ∣ α_{n} ∣^{4} < \infty

\int j = 1 \prod k (1 - cos (θ - θ_{j}))^{m_{j}} lo g (w (θ)) d θ > - \infty

\int j = 1 \prod k (1 - cos (θ - θ_{j}))^{m_{j}} lo g (w (θ)) d θ > - \infty

j = 1 \prod k (S - e^{i θ_{j}})^{m_{j}} α \in ℓ^{2}

j = 1 \prod k (S - e^{i θ_{j}})^{m_{j}} α \in ℓ^{2}

α \in ℓ^{2 m + 2} m = j = 1, \dots, k max m_{j}

α \in ℓ^{2 m + 2} m = j = 1, \dots, k max m_{j}

(S α)_{n} = α_{n + 1}

(S α)_{n} = α_{n + 1}

\int (1 - cos θ)^{2} (1 + cos θ) lo g (w (θ)) \frac{d θ}{2 π} = - \infty

\int (1 - cos θ)^{2} (1 + cos θ) lo g (w (θ)) \frac{d θ}{2 π} = - \infty

(L_{1} 1)

(L_{1} 1)

(L_{1} 2)

(L_{1} 2)

(L_{1} 3)

(L_{1} 3)

(L_{3} 1)

(L_{3} 1)

(L_{3} 2)

(L_{3} 2)

(L_{3} 3)

(L_{3} 3)

(L_{2} 1)

(L_{2} 1)

(L_{2} 2)

(L_{2} 2)

f_{k}^{#} = \int_{0}^{2 π} e^{- ik θ} f (e^{i θ}) \frac{d θ}{2 π}

f_{k}^{#} = \int_{0}^{2 π} e^{- ik θ} f (e^{i θ}) \frac{d θ}{2 π}

a^{♭} (e^{i θ}) = k = - \infty \sum \infty a_{k} e^{ik θ}

a^{♭} (e^{i θ}) = k = - \infty \sum \infty a_{k} e^{ik θ}

F (e^{i θ}) = k = - M \sum M F_{k}^{#} e^{ik θ}

F (e^{i θ}) = k = - M \sum M F_{k}^{#} e^{ik θ}

(F f)_{k}^{#}

(F f)_{k}^{#}

= j = - M \sum M F_{j}^{#} f_{k - j}^{#}

Q (S) F (S) a \in ℓ^{p} (Z)

Q (S) F (S) a \in ℓ^{p} (Z)

Q_{+} (S) F_{+} (S) a \in ℓ^{p} (N)

F (S) a = G (S) Q (S) F (S) a \in ℓ^{p}

F (S) a = G (S) Q (S) F (S) a \in ℓ^{p}

j = 1, j \neq = q \prod k (S - e^{i θ_{j}})^{m_{j}} (S - e^{i θ_{q}})^{m_{q}} β^{(q)} \in ℓ^{2}

j = 1, j \neq = q \prod k (S - e^{i θ_{j}})^{m_{j}} (S - e^{i θ_{q}})^{m_{q}} β^{(q)} \in ℓ^{2}

(L_{3} 1)

(L_{3} 1)

(L_{3} 2)

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Large Deviations and the Lukic Conjecture

Jonathan Breuer1,4, Barry Simon2,5,

and Ofer Zeitouni3,6

Abstract.

We use the large deviation approach to sum rules pioneered by Gamboa, Nagel and Rouault to prove higher order sum rules for orthogonal polynomials on the unit circle. In particular, we prove one half of a conjectured sum rule of Lukic in the case of two singular points, one simple and one double. This is important because it is known that the conjecture of Simon fails in exactly this case, so this paper provides support for the idea that Lukic’s replacement for Simon’s conjecture might be true.

Key words and phrases:

sum rules, large deviations, orthogonal polynomials

2010 Mathematics Subject Classification:

60F10,35P05,42C05

1 Institute of Mathematics, The Hebrew University, 91904 Jerusalem, Israel. E-mail: [email protected]

2 Departments of Mathematics and Physics, Mathematics 253-37, California Institute of Technology, Pasadena, CA 91125. E-mail: [email protected]

3 Faculty of Mathematics, Weizmann Institute of Science, POB 26, Rehovot 76100, Israel and Courant Institute, NYU. E-mail: [email protected].

4 Research supported in part by the Israel Science Foundation (Grant no. 399/16) and in part by the United States-Israel Binational Science Foundation (Grant No. 2014337).

5 Research supported in part by NSF grant DMS-1265592 and in part by the United States-Israel Binational Science Foundation (Grant No. 2014337).

6 Research supported in part by a grant from the Israel Science Foundation.

1. Introduction

This paper is a contribution to the theory of sum rules in the spectral theory of orthogonal polynomials. The earliest such result is Szegő’s Theorem for orthogonal polynomials on the unit circle (OPUC) in Verblunsky’s form [29] of which we’ll say more soon. The modern theory was initiated by Killip–Simon [15] for orthogonal polynomials on the real line (OPRL) with considerable work by others [6, 12, 16, 17, 18, 19, 20, 27].

Here we’ll consider OPUC. Given a probability measure $\mu$ on $\partial{\mathbb{D}}$ , one can form the non–zero (in $L^{2}(\partial{\mathbb{D}},d\mu)$ ), monic orthogonal polynomials $\{\Phi_{n}\}_{n=0}^{M}$ where $M=N-1$ if $\mu$ has exactly $N$ points in its support and $M=\infty$ if $\mu$ has infinitely many points in its support. In the case there are exactly $N$ points, one defines $\Phi_{N}$ to be the unique degree $N$ monic polynomial vanishing at all $N$ points (so $\Phi_{N}=0$ in $L^{2}(\partial{\mathbb{D}},d\mu)$ ). The recursion (aka Verblunsky) coefficients, $\{\alpha_{j}\}_{j=0}^{M}$ , are given by the recursion relations, $0\leq j<M+1$ :

[TABLE]

For $N=\infty$ , $\{\alpha_{j}\}_{j=0}^{\infty}\in{\mathbb{D}}^{\infty}$ (see [22]) and for $N<\infty$ , only $\alpha_{0},\dots,\alpha_{N-1}$ are defined (since $\Phi_{k}$ is only defined for $k\leq N$ ) and $\alpha_{k}\in{\mathbb{D}},k=0,\dots,N-2,\,\alpha_{N-1}\in\partial{\mathbb{D}}$ .

Verblunsky’s Theorem states that there is a one–one correspondence, ${{\mathcal{V}}}$ , from probability measures to Verblunsky coefficients with the above restrictions, i.e. $\text{\rm{ran}}({{\mathcal{V}}}\restriction$ measures with infinite support) $=\prod_{j=0}^{\infty}{\mathbb{D}}$ and $\text{\rm{ran}}({{\mathcal{V}}}\restriction n$ –point measures) $=\prod_{j=0}^{n-2}{\mathbb{D}}\times\partial{\mathbb{D}}$ . Moreover in the natural topologies, ${{\mathcal{V}}}$ is a homeomorphism.

Szegő’s Theorem in Verblunsky form says that

[TABLE]

where $H(\nu|\mu)$ is the Kullback-Leibler (KL) divergence (aka $\pm$ the relative entropy, depending on the sign convention for the relative entropy):

[TABLE]

(1.2) always holds although both sides may be $+\infty$ . (The latter is the case if, e.g., $M<\infty$ since in that case the $n=M$ term in the sum is $-\log(0)=\infty$ ). In particular, the condition that both sides are finite at the same time implies

[TABLE]

where

[TABLE]

where $d\mu_{s}$ is singular w.r.t $d\theta$ . Simon [24] calls a result like (1.4) that gives equivalence of spectral data and coefficient data a “spectral theory gem”. (1.4) in particular implies the existence of measures with arbitrarily bad singular part mixed in with a.c. spectrum and with $\ell^{2}$ decaying Verblunsky coefficients.

The current paper is devoted to higher order sum rules of which the first is that of Simon [22, Section 2.8]:

[TABLE]

where $\alpha_{-1}\equiv-1$ . This implies the gem

[TABLE]

In the same section, Simon conjectured (wrongly as we’ll see!) that for $\theta_{1},\dots,\theta_{k}$ distinct in $[0,2\pi)$ and $m_{1},\dots,m_{k}$ strictly positive integers we have that

[TABLE]

if and only if

[TABLE]

and

[TABLE]

In (1.9), $S$ is the operator

[TABLE]

Moreover, Simon–Zlatoš [27] proved this conjecture in case $\sum_{j=1}^{k}m_{j}=2$ , i.e. $k\leq 2$ and $(m_{1},m_{2})=(2,0)$ or $(1,1)$ . For simplicity the remainder of this section will mainly discuss the case $\theta_{1}=0,\,\theta_{2}=\pi$ although the next two sections will revert to the general case. We’ll use the symbol $(m_{1},m_{2})$ to describe this case.

In [18], Lukic found a counterexample to this conjecture for the $(2,1)$ case. He found an explicit example where (S1), (S2) hold but

[TABLE]

To have any hope of an equivalence one needs more that (S1), (S2). Lukic made an improved conjecture that replaced (S1), (S2) by

[TABLE]

Lukic also proved a flawed gem, i.e. an equivalence under an a priori condition on the Verblunsky coefficients, that provides evidence for his conjecture. In Section 9 we’ll obtain some additional evidence for the correctness of the Lukic conjecture. In Section 2, we’ll consider equivalent versions of Lukic’s conditions that are directly expressible in terms of $\alpha$ without reference to a decomposition as a sum. In a sense, $\beta^{(j)}$ is the part of $\alpha$ localized near $\theta_{j}$ in Fourier space, so that for the $(2,1)$ case Lukic’s conditions are equivalent to (the $\textrm{(L}_{2}\textrm{)}$ conditions will appear in the next section).

[TABLE]

In this case $\textrm{(L}_{3}\textrm{1)}\equiv\textrm{(S1)}$ , $\textrm{(L}_{3}\textrm{3)}\equiv\textrm{(S2)}$ and $\textrm{(L}_{3}\textrm{2)}$ is an extra condition. The precise result we’ll prove in Section 9 is that the $\textrm{(L}_{3})$ conditions imply the finiteness of the integral in (1.12) (at least when the $\alpha$ s are real).

Recently, Gamboa, Nagel and Rouault [9] (henceforth GNR; see also [10, 11]) discovered a new approach to Szegő’s Theorem (and the Killip–Simon Theorem) using the theory of large deviations (LD). We wrote a pedagogical presentation of some of these ideas [4]. Our main goal in this paper is to use large deviation methods to study higher order sum rules. We note that GNR [11] discussed (1.6) using LD methods although for technical reasons, they were unable to prove the actual sum rules. Below we will assume the reader familiar with some of the basics of LD theory either from books [7, 5] or from our paper [4].

In Section 3, we will prove a sum rule and gem where one side of the gem is the integral in (1.8). In general, the other side of the gem will be a very complicated polynomial in the $\alpha$ ’s (with some non–polynomial terms of the form $\log(1-|\alpha|^{2})$ ). This leads to a new insight. The Lukic conjecture (if true) provides much more humane conditions on the $\alpha$ ’s than what one gets from the naive sum rule. We note that we suspect that our sum rules are identical to the ones found by Denissov–Kupin [6] who did not carry through the examples of Sections 4-9. Section 4 will use these ideas to get the sum rule (1.6) in a new way.

In the last four sections, we make two simplifying assumptions:

(A1)

when $k=2,\,\theta_{1}=0,\,\theta_{2}=\pi$ (essentially $\theta_{2}-\theta_{1}=\pi$ is what is important) and we’ll also consider the symmetric situation where one has general $k$ points symmetrically arranged as the roots of unity with all $m_{j}=1$ .

(A2)

$\bar{\alpha}_{j}=\alpha_{j}$ for all $j\geq 0$ .

These are mainly to make the sometimes involved calculations simpler. We have no doubt that one can do the calculations without (A2) and suspect one can drop (A1) although with some effort.

In Sections 5 and 6, we recover the Simon–Zlatoš gems (i.e. $(1,1)$ and $(2,0)$ ), at least under the assumptions (A1)–(A2). One thing we’ll see in these sections is that it is simpler to show that the conditions on Verblunsky coefficients imply the measure condition than the converse so in the last two sections, we’ll settle for the simpler half. In Section 7, we’ll prove this one direction for $k$ equally spaced points, all with order 1, that is we’ll prove that $\sum_{n=1}^{\infty}|\alpha_{n+k}-\alpha_{n}|^{2}+|\alpha_{n}|^{4}<\infty\Rightarrow\int(1-\cos k\theta)\log(w(\theta))\,d\theta>-\infty$ and in Section 8, we discuss an arbitrarily high order single singular point under the hypothesis that $\alpha\in\ell^{4}$ . The results in Sections 7 and 8 are not new but recover, using new methods, special cases of results of Golinskii–Zlatoš [12]. In Section 9, we will prove that for the $(2,1)$ case under (A1)/(A2), the Lukic conditions imply finiteness of the integral. Recall that this is a case where the Simon conditions do not imply finiteness of the integral so we regard this as strong evidence for the Lukic conjecture.

We believe our main results in this paper are the general sum rule and gem and the realization that the Lukic conjecture is just about finding a simpler version of the naive Verblunsky coefficient side. In addition we show how to use LD methods to recover the gems of Simon and Simon–Zlatoš and some results of Golinskii–Zlatoš. Finally, we provide evidence for the general Lukic conjecture by finding a situation where his conditions imply the finiteness of the relevant integral and where Simon’s do not.111Building on the general sum rules of this paper, Jun Yan developed an algebraic machinery that allowed him to obtain new examples where (one half of) the Lukic conjecture can be verified. We refer to [30] for details.

We thank Peter Yuditskii for telling two of us about [9] and Fabrice Gamboa, Jan Nagel and Alain Rouault for useful discussions.

2. The Lukic Condition

In this section, we want to discuss some equivalent forms of the Lukic conditions $\textrm{(L}_{1}\textrm{1-3)}$ . This and some of the analysis in later sections will require some discrete hard analysis that we set up here. First, we’ll consider

[TABLE]

In some sense, $\textrm{(L}_{2}\textrm{2)}$ says that in “ $\theta$ –space” $\alpha$ is locally $\ell^{2m_{q}+2}$ near $\theta=\theta_{q}$ . Our first result is

Theorem 2.1.

$\textrm{(L}_{1}\textrm{1-3)}\iff\textrm{(L}_{2}\textrm{1-2)}$ **

Remark.

The same argument shows that (S1-2) are equivalent to (2.1) and (2.2) but with $2m_{q}+2$ replaced by $2\max m_{j}+2$ . This illustrates the difference between the Simon and Lukic conditions.

The proof will depend on momentum space localization. We can view $\ell^{q}({\mathbb{N}})$ as a subspace of $\ell^{q}({\mathbb{Z}})$ and define $P:\ell^{q}({\mathbb{Z}})\to\ell^{q}({\mathbb{N}})$ by restricting $\{a_{n}\}_{n=-\infty}^{\infty}$ to $\{a_{n}\}_{n=0}^{\infty}$ . We can think of $P$ either as a map between spaces which clearly has norm $1$ or as a map of $\ell^{q}({\mathbb{Z}})$ to itself whose range is those $a$ with $a_{n}=0$ for all $n<0$ . In the latter view, $P$ is a projection of norm $1$ . We can extend $S$ to $\ell^{q}({\mathbb{Z}})$ by setting $(Sa)_{n}=a_{n+1}$ . This $S$ is an invertible isometry (on $\ell^{q}({\mathbb{N}})$ it doesn’t have a left inverse).

$S$ is unitary on $\ell^{2}({\mathbb{Z}})$ with spectrum all of $\partial{\mathbb{D}}$ , so, by the spectral theorem, we can define $F(S)$ on $\ell^{2}({\mathbb{Z}})$ for any $F\in L^{\infty}({\mathbb{D}})$ and then $F_{+}(S)$ on $\ell^{2}({\mathbb{N}})$ by $a\mapsto PF(S)a$ . These are sometimes called Laurent and Toeplitz operators respectively. $F(S)$ is made most transparent by using Fourier transform, $f\mapsto f^{\#}$ , mapping $L^{2}(\partial{\mathbb{D}},\tfrac{d\theta}{2\pi})$ to $\ell^{2}({\mathbb{Z}})$ by

[TABLE]

These are, of course, Fourier coefficients of $f$ in the orthonormal basis of $L^{2}(\partial{\mathbb{D}},\tfrac{d\theta}{2\pi})$ , $\{e^{ik\theta}\}_{k=-\infty}^{\infty}$ , so we can define $a\mapsto a^{\flat}$ from sequences to functions by defining (with convergence in $L^{2}$ –sense):

[TABLE]

Then $(f^{\#})^{\flat}=f$ .

If $F$ is a trigonometric polynomial so

[TABLE]

then, for $f\in L^{2}$ ,

[TABLE]

i.e. $F(S)$ is convolution with $F^{\#}$ . If $F\in C^{\infty}(\partial{\mathbb{D}})$ , by a simple argument (see [25, Section 6.3]), $F_{k}^{\#}$ decays faster than any inverse polynomial, so, in particular, $F^{\#}\in\ell^{1}$ . Taking limits in (2.5), we see that formula still holds but with $M$ replaced by $\infty$ . Thus, since $F^{\#}\in\ell^{1}$ , we see that as maps on $\ell^{p}({\mathbb{Z}})$ or $\ell^{p}({\mathbb{N}})$ , $a\mapsto F(S)a$ maps $\ell^{p}$ to itself, and since $P$ maps $\ell^{p}{({\mathbb{N}})}$ to itself, we see that $F_{+}(S)$ map $\ell^{p}({\mathbb{N}})$ to itself i.e.

Proposition 2.2.

$a\mapsto F(S)a$ * maps any $\ell^{p}({\mathbb{Z}})$ to itself and $a\mapsto F_{+}(S)a$ maps any $\ell^{p}({\mathbb{N}})$ for $1\leq p<\infty$ for any $C^{\infty}$ function, $F$ , on $\partial{\mathbb{D}}$ .*

In particular, we can localize in $\theta$ –space by picking a convenient partition of unity on $\partial{\mathbb{D}}$ and writing $a=\sum_{j=1}^{k}J_{j}(S)a$ .

Corollary 2.3.

Let $Q(z)$ be a Laurent polynomial on ${\mathbb{C}}\setminus\{0\}$ . Let $F$ be a $C^{\infty}$ function on $\partial{\mathbb{D}}$ so that $Q(z)$ has no zeros in the support of $F$ . Suppose that $a$ lies in some $\ell^{q}$ . Let $1\leq p<\infty$ . Then

[TABLE]

Proof.

Suppose first we are dealing with the maps on $\ell^{q}({\mathbb{Z}})$ . By the zero condition, it is easy to find a $C^{\infty}$ function, $G$ , on $\partial{\mathbb{D}}$ so that $G(z)Q(z)F(z)=F(z)$ for all $z\in\partial{\mathbb{D}}$ . Thus, if $Q(S)F(S)a\in\ell^{p}$ , then

[TABLE]

since $G(S)$ maps $\ell^{p}$ to $\ell^{p}$ .

Now suppose $(a_{n})\in\ell^{q}({\mathbb{N}})$ and extend it to ${\mathbb{Z}}$ by $a_{n}=0$ for $n<0$ . Since $F(S)$ is convolution with a function of very rapid decay, $F(S)a$ and $Q(S)F(S)a$ both have rapid decay to the left so since $PQ(S)PF(S)a$ lies in $\ell^{{p}}({{\mathbb{N}}})$ , we see that $Q(S)PF(S)a$ lies in $\ell^{p}({{\mathbb{Z}}})$ . Since $F(S)a$ has rapid decay on the left, $Q(S)(1-P)F(S)a$ lies in $\ell^{p}({{\mathbb{Z}}})$ and so $Q(S)F(S)a$ lies in $\ell^{p}({{\mathbb{Z}}})$ as well. By the argument in the first paragraph, $F(S)a$ lies in $\ell^{p}({{\mathbb{Z}}})$ so $PF(S)a$ lies in $\ell^{p}({{\mathbb{N}}})$ . ∎

Proof of Theorem 2.1.

( $\textrm{L}_{2}\Rightarrow\textrm{L}_{1}$ ) Let ${\alpha}$ obey $\textrm{L}_{2}$ . Pick $\{J_{j}\}_{j=1}^{k}$ , $C^{\infty}$ functions on $\partial{\mathbb{D}}$ so that $J_{j}\geq 0,\sum_{j=1}^{k}J_{j}=1$ and $J_{j}$ vanishes in the neighborhood of $\{\theta_{\ell}\}_{\ell\neq j}$ . Let $\beta^{(j)}=PJ_{j}(S)\alpha$ . $\textrm{(L}_{1}\textrm{1)}$ follows from $\sum_{j=1}^{k}J_{j}=1$ . Since $J_{j}(S)$ commutes with any polynomial in $S$ , by (2.1),

[TABLE]

(with a small argument to deal with the P operator) so, by Corollary 2.3, (1.14) holds. A similar argument shows that (2.2) implies (1.15).

( $\textrm{L}_{1}\Rightarrow\textrm{L}_{2}$ ) Suppose ${\alpha}$ obeys $\textrm{L}_{1}$ . Since polynomials in $S$ map $\ell^{p}$ to itself, (1.14) $\Rightarrow\prod_{j=1}^{k}(S-e^{i\theta_{j}})^{m_{j}}\beta^{(q)}\in\ell^{2}$ , so by (1.13), we have (2.1). By (1.14), if $r\neq q$ , then $\prod_{j\neq q}(S-e^{i\theta_{j}})^{m_{j}}\beta^{(r)}\in\ell^{2}\subset\ell^{2m_{q}+2}$ . Also (1.15) implies $\prod_{j\neq q}(S-e^{i\theta_{j}})^{m_{j}}\beta^{(q)}\in\ell^{2m_{q}+2}$ . Therefore, by (1.13), we get (2.2). ∎

For comparison with Simon’s conjecture, the following version (which appeared already in the last section) is useful. Let $m=\sup_{j}m_{j}$ ,

[TABLE]

Theorem 2.4.

$\textrm{(L}_{1}\textrm{1-3)}\iff\textrm{(L}_{3}\textrm{1-3)}$ **

Proof.

Clearly, (2.11) implies (2.2) when $m=m_{j}$ , so $\textrm{(L}_{3}\textrm{1-3)}\Rightarrow\textrm{(L}_{2}\textrm{1-2)}\Rightarrow\textrm{(L}_{1}\textrm{1-3)}$ .

On the other hand, by Theorem 2.1, $\textrm{(L}_{1}\textrm{1-3)}\Rightarrow\textrm{(L}_{3}\textrm{1-{2})}$ and trivially, (1.13) and (1.15) $\Rightarrow$ (2.11) ∎

To find some equivalent forms of the Lukic conditions, it will be useful to have the following:

Theorem 2.5.

For any sequence ${\alpha}\in\ell^{2}({\mathbb{Z}})$ of finite support, we have that:

[TABLE]

Remarks.

This is a discrete case of an inequality on derivatives due to Gagliardo [8] and Nirenberg [21]; see Simon [26, Section 6.3] and Taylor [28]. Here $S-1$ replaces $\tfrac{d}{dx}$ . The general version (with essentially the same proof) is

[TABLE]

for $k\geq 1,1\leq p\leq k$ . (2.12) is $p=2,k=3$ .

Once one has Theorem 2.5 then it is easy to show, by dominated convergence, that ${\alpha}\in\ell^{6},\,(S-1)^{2}{\alpha}\in\ell^{2}\Rightarrow(S-1){\alpha}\in\ell^{3}$ and that (2.12) holds even without the condition on finite support of ${\alpha}$ .
This result is in [27] and probably other places but the proof is so simple that we give it for the reader’s convenience.
(2) below can be thought of as resulting from a summation by parts.

Proof.

Given ${\alpha}$ , define $|{\alpha}|$ by $|\alpha|_{n}\equiv|\alpha_{n}|$ . We begin by noting that for $a,b\in{\mathbb{C}}$ , we have by the triangle inequality that

[TABLE]

so that if ${\alpha}\leq{\beta}\iff\alpha_{n}\leq\beta_{n}$ for all $n$ , then

[TABLE]

Note next that Leibniz rule takes the form (where $({\alpha}{\beta)}_{n}=\alpha_{n}\beta_{n}$ )

[TABLE]

so

[TABLE]

Choose ${\beta}={\alpha},\,{\gamma}=(S-1){\bar{\alpha}},\,{\kappa}=|(S-1){\alpha}|$ and use the fact that a sum of $(S-1){\tau}$ is zero when ${\tau}$ has finite support (because of telescoping) to see that if $\alpha$ has finite support, then

[TABLE]

where we used (2.14) to bound $|(S-1)|(S-1){\alpha}||$ by $|(S-1)^{2}{\alpha}|$ .

Hölder’s inequality and $\tfrac{1}{6}+\tfrac{1}{3}+\tfrac{1}{2}=1$ says that the first sum on the right is bounded by $\lVert S\alpha\rVert_{6}\lVert(S-1)\alpha\rVert_{3}\lVert(S-1)^{2}\bar{\alpha}\rVert_{2}=\lVert\alpha\rVert_{6}\lVert(S-1)\alpha\rVert_{3}\lVert(S-1)^{{2}}\alpha\rVert_{2}$ . The second sum has the same bound which shows that

[TABLE]

which implies (2.12) ∎

Let us focus on the case $(\theta_{1},\theta_{2},m_{1},m_{2})=(0,\pi,2,1)$ , so we have

[TABLE]

We want to note that

Theorem 2.6.

$\textrm{(L}_{3}\textrm{1-3)}$ * for $(\theta_{1},\theta_{2},m_{1},m_{2})=(0,\pi,2,1)$ is equivalent to*

[TABLE]

Moreover, one also has that if these conditions hold, then

[TABLE]

Remarks.

The proof shows that when $\textrm{(L}_{3}\textrm{1)}$ and $\textrm{(L}_{3}\textrm{3)}$ hold, then $(S-1)\alpha\in\ell^{4}$ is equivalent to $(S-1)^{k+1}\alpha\in\ell^{4}$ for any k fixed $k=1,2,\dots$ .
The example $\alpha_{n}=(n+1)^{-1/5}$ obeys $\textrm{(L}_{4}\textrm{1-3)}$ but doesn’t have $\alpha\in\ell^{4}$ .

Proof.

Clearly $\textrm{(L}_{4}\textrm{1-3)}\Rightarrow\textrm{(L}_{3}\textrm{1-3)}$ since $S-1$ maps $\ell^{4}$ to itself. So suppose we have $\textrm{(L}_{3}\textrm{1-3)}$ . Applying (2.12) to $(S+1)\alpha$ and noting that $\alpha\in\ell^{6}\Rightarrow(S+1)\alpha\in\ell^{6}$ , we conclude that $(S-1)(S+1)\alpha=(S^{2}-1)\alpha\in\ell^{3}$ proving (2.24).

Since $p>q\Rightarrow\ell^{q}\subset\ell^{p}$ , we see that $(S^{2}-1)\alpha\in\ell^{4}$ . Thus

[TABLE]

∎

3. Sum Rules

In this section, we’ll explain how to use LD methods to obtain sum rules for any choice of $\{m_{j}\}_{j=1}^{k}$ and $\{\theta_{j}\}_{j=1}^{k}$ where one side is (1.8). The sum rules imply gems. In fact, it will be easier to obtain the gems and we’ll prove them first as part of the proof of sum rules. While we haven’t tried to prove it in general, we believe our sum rules are the same as those of Denisov–Kupin [6] obtained using the method of Nazarov et. al. [20].

We begin by finding matrix models whose LDP on the spectral side involves (1.8) up to constants. Our basic random matrix measures will have the form

[TABLE]

where $Z_{N}$ is a normalization factor, $Q$ is a function of $U$ of the form

[TABLE]

where $V$ is a Laurent polynomial

[TABLE]

(if $c_{k}\neq 0$ and/or $c_{-k}\neq 0$ , we say that $k$ is the degree of $Q$ or $V$ ) and where ${\mathbb{H}}_{N}$ is Haar measure (aka circular unitary ensemble, $\mbox{\rm CUE}(n)$ ). GNR [9, 11] also discussed these models, especially the case $V(e^{i\theta})=\cos\theta$ (discussed first, in a different context, by Gross–Witten [13] whose name GNR assign to the model) but they do not prove sum rules or gems for these models.

There is a huge literature on these matrix models, discussed for example in [2, Section 2.7]. Much of the literature discusses perturbations of GUE rather than CUE but the results that we need extend to CUE, which is technically simpler because random unitary matrices, unlike random self–adjoint matrices, are automatically uniformly bounded. A major result (see, for example, [2, Section 2.6]) is that the associated limit of empirical measures (aka density of states), $d\eta$ , obeys

[TABLE]

for some constant $C$ (which when we start with $\eta$ we will take to be zero).

Any fixed vector, $\varphi\in{\mathbb{C}}^{n}$ , is a cyclic vector for a.e. $U\in{\mathbb{U}}_{n}$ . Associated to each such $U$ is a probability measure $\mu$ on $\partial{\mathbb{D}}$ which is an $n$ –point measure with masses at the eigenvalues of $U$ and weights the absolute square of the components of $\varphi$ in the corresponding eigenvectors. Thus picking $\varphi$ (conventionally to be $\delta_{1}=(1,0,\dots,0)$ ), we get a many-to-one correspondence between a set of unitaries of full measure and all $n$ -point spectral measures. Thus the measure in (3.1) induces a probability measure on $n$ -point probability measures and so on sets of Verblunsky coefficients. The unitaries $U$ and $U^{\prime}$ correspond to the same spectral measure if and only if there is a unitary ${W}$ which has $\varphi$ as an eigenvector with $U^{\prime}={W}U{W}^{-1}$ . It is important to notice that the spectral measure determines the eigenvalues of $U$ and so $\text{\rm{Tr}}(U^{k})$ for any k, so these traces are only functions of the Verblunsky coefficients and we can compute the traces in any convenient representation of one of the unitaries associated to a given spectral measure.

The measure in (3.1) induces a measure ${\mathbb{P}}_{N}$ on $N$ –point measures (the spectral measures, viewed as elements of ${\mathcal{M}}_{+,1}(\partial{\mathbb{D}})$ ), and the Verblunsky map drags that to a measure $\widetilde{{\mathbb{P}}}_{N}$ on the set of $N$ -point Verblunsky coefficients, i.e. ${\mathbb{D}}^{N-1}\times\partial{\mathbb{D}}$ .

The measure in (3.1) induces another measure on the sequence of empirical measures $L_{N}=\frac{1}{N}\sum_{i=1}^{N}\delta_{\lambda_{i}}\in{\mathcal{M}}_{+,1}(\partial{\mathbb{D}})$ , where $\lambda_{i}$ are the eigenvalues of $U$ . Recall that if $V$ obeys (3.4), then $L_{N}$ converges a.s. as $N\to\infty$ to $\eta$ and by the method of Ben Arous–Guionnet [3], the sequence $L_{N}$ obeys a LDP (in the usual topology of weak convergence of probability measures) with speed $N^{2}$ and rate function at measure $\mu$ , $E(\mu)-E(\eta)$ where $E$ is the 2D Coulomb energy in external field which is minimized at $\mu=\eta$ (by (3.4)).

By the arguments in [4, Section 3], if the support of $\eta$ is all of $\partial{\mathbb{D}}$ and $\eta$ possesses a density with respect to Lebesgue’s measure which is strictly positive $d\theta$ -almost everywhere, one finds that the spectral measure obeys an LDP in ${\mathcal{M}}_{+,1}(\partial{\mathbb{D}})$ with speed $N$ and rate function

[TABLE]

where $H$ is given by (1.3). On the other hand, as discussed in [9] and [4], by the continuity of the map ${\mathcal{V}}$ , the latter LDP induces a LDP on the infinite sequence of Verblunsky coefficients, viewed as elements of ${\mathbb{D}}^{{\mathbb{Z}}_{+}}$ equipped with the product topology, with rate function given in terms of $I$ . By the uniqueness of the rate functions in large deviations theory, if one has an expression for the rate function in terms of the Verblunsky coefficients then one gets a sum rule with the integral in (1.8) on one side (up to constants due to the normalization of $\eta$ and a $\int\log\left(\frac{d\eta}{d\theta}\right)\,d\eta(\theta)$ ) term.

We remark that the regularity assumptions stated above for $\eta$ (namely full support and a.e. positive density) make it possible to mimic the proof in [4, Section 3] and approximate the spectral measure throughout its support; to see what goes wrong when there are gaps in the support of $\eta$ , it is enough to consider the analogous problem for Hermitian matrices where $\partial{\mathbb{D}}$ is replaced by ${\mathbb{R}}$ . In that case, there may be “stray eigenvalues” which are not controlled by the LDP for the empirical measure. We refer to [9] for a discussion of this issue, and [11] for a detailed proof of the LDP for the spectral measure in the cases treated in this paper.

In what follows, we will be interested in $\eta$ of the form

[TABLE]

which automatically satisfies the regularity assumption stated above.

In computing (3.4) with that $\eta$ , the following is useful

Proposition 3.1.

For any $n\in{\mathbb{Z}}$ , $n\neq 0$ , we have that

[TABLE]

If $n=0$ , the integral is zero.

Proof.

While this integral is in the tables, the proof is so simple we give it. Replacing $\psi$ by $\psi-\theta$ , we can suppose that $\theta=0$ . By taking complex conjugates, we can suppose that $n\leq 0$ . Write $e^{i\psi}=z$ and

[TABLE]

Then note that for $n<0$

[TABLE]

by the Cauchy integral theorem. By the Cauchy formula for Taylor coefficients and the well known series $\log(1-z)=-\sum_{n=1}^{\infty}\tfrac{z^{n}}{n}$ , for $n\leq 0$ (since the series only converges inside the disk, one needs to note that the integral over the unit circle is a limit of integrals over slightly smaller circles)

[TABLE]

∎

Thus, for $\eta$ of the form (3.6), $V$ defined by (3.4) is a Laurent polynomial with no constant term.

As a preliminary to the calculation of the Verblunsky coefficient side, we want to make two comments about the sum rules and their relation to the rate function. The first one regards the fact that rather than the integral in (1.8), the form of the rate function on the measure side is $H(\eta|\mu)$ , which involves an additional term of the form

[TABLE]

Computing this constant term is important in writing the sum rule. As an example, rather than the left side of (1.6), the LD calculation will give $H(\eta|\mu)$ where

[TABLE]

Noting that $\int(1-\cos\theta)\log(1-\cos\theta)\,\tfrac{d\theta}{2\pi}=1-\log(2)$ (which follows as in the proof of Proposition 3.1; see [22, Section 2.8]) we can write (1.6) as

[TABLE]

The right hand side has to vanish when the $\alpha_{n}$ are the Verblunsky coefficients of the measure $\eta$ (since $H(\eta|\eta)=0$ ). Let us confirm this not only as a check but because it will let us compute the constant in Section 4 when we only know the sum rule up to a constant.

The Verblunsky coefficients for the $\eta$ of (3.9) are not hard to compute [22, Example 1.6.4 and equation (1.6.14)]

[TABLE]

Since $\sum_{n=0}^{\infty}|\alpha_{n}^{(0)}|^{2}<\infty$ , we can cancel the $\tfrac{1}{2}|\alpha_{n}|^{2}$ terms in the sums on the right side in (3.10) and see that when $\alpha=\alpha^{(0)}$ the right side is

[TABLE]

The sum telescopes since $[(n+2)(n+3)]^{-1}=(n+2)^{-1}-(n+3)^{-1}$ so the sum is $1/2$ and $1-\tfrac{1}{2}-\tfrac{1}{2}=0$ . To evaluate the infinite product, note Euler’s formula that

[TABLE]

so

[TABLE]

and thus the $\log$ term in (3.12) is $-\log(1/2)$ which cancels the $-\log(2)$ . Thus, we confirm that the expression in (3.12) is [math].

The other issue concerns a huge difference in getting sum rules once a $V(\theta)$ is added to the mix. Recall that under the $\mbox{\rm CUE}(N)$ measure, i.e. in case $V=0$ , the measure $\widetilde{{\mathbb{P}}}_{N}\in{\mathcal{M}}_{+,1}({\mathbb{D}}^{N-1}\times\partial{\mathbb{D}})$ on the Verblunsky coefficients has the property that if $j<N$ , then the Verblunsky coefficients $(\alpha_{0},\dots,\alpha_{j})$ are independent of $(\alpha_{j+1},\dots,\alpha_{N-1})$ so, with $\pi_{j}$ denoting the continuous projection from $\{\alpha_{k}\}_{k=0}^{\infty}$ to $\{\alpha_{k}\}_{k=0}^{j}$ , the rate function $I_{j}$ of $\pi_{j}^{*}(\widetilde{{\mathbb{P}}}_{N})\in{\mathcal{M}}_{+,1}({\mathbb{D}}^{j+1})$ is easy to compute (see [4, Section 2] for a discussion of $\pi_{j}^{*}$ ). Since $V(U)$ has cross terms between $\alpha_{k}$ and $\alpha_{\ell}$ for suitable $k\leq j$ and $\ell>j$ (in (1.6) the $\alpha_{j+1}\alpha_{j}$ terms), one no longer has independence and the exact calculation of $I_{j}$ involves the limiting distribution of $\{\alpha_{\ell}\}_{\ell>j}$ . In the case of (1.6), we want to show that $I(\alpha)=F(\alpha_{0})+\sum_{k=0}^{\infty}G(\alpha_{k},\alpha_{k+1})$ where $G$ has a $\tfrac{1}{2}|\alpha_{k+1}-\alpha_{k}|^{2}$ piece and a piece from the $\log(1-|\alpha_{k}|^{2})+|\alpha_{k}|^{2}$ term. Instead of computing $I_{j}$ exactly, we’ll show that (up to constants) $|I_{j}(\alpha_{0},\dots,\alpha_{j-1})-\sum_{k=0}^{j-2}G(\alpha_{k},\alpha_{k+1})-F(\alpha_{0})|\leq C|\alpha_{j}|$ . This fact and Rakhmanov’s Theorem (see [23, Chapter 9]) allow one to prove that $I$ has the required form.

We begin the analysis of the general case with

Theorem 3.2.

Let $V$ be a Laurent polynomial of degree $d$ and let $U_{N}$ be an $N\times N$ unitary CMV matrix. Then there exist $N$ –independent polynomials $F_{\pm}$ and $G$ , $G$ depending on $d+1$ successive $\alpha_{j}$ ’s and $\bar{\alpha}_{j}$ ’s and $F_{\pm}$ on $d$ such variables so that

[TABLE]

Moreover, $G(0,\dots,0)=0$ .

Remarks.

The unitary, $U$ , associated to any spectral measure $\mu$ is multiplication by $\lambda$ on $L^{2}(\partial{\mathbb{D}},d\mu)$ . To get a matrix related with that spectral measure associated to $(1,0,\dots,0)$ , one needs to pick an orthonormal basis $\{e_{j}\}$ for this $L^{2}$ space with $e_{1}$ the function $1$ . [22, Chapter 4] discusses two natural bases for which the matrix elements are explicit functions of the $\alpha$ ’s and $\rho$ . One choice is the set of orthonormal polynomials for $\mu$ . This yields the GGT matrix. The other is to orthonormalize $\{1,z,z^{-1},z^{2},z^{-2},\dots\}$ which yields the CMV matrix. One issue is that for general $\mu$ , the orthonormal polynomials may not be a basis so the naive GGT matrix may not be unitary but for $n$ –point measures, it is unitary. The CMV matrix is 5 diagonal while the GGT matrix is a Hessenberg matrix, i.e. only one non-vanishing diagonal below the principal diagonal but, in general, all non–vanishing matrix elements above the diagonal. The proof of this theorem will discuss the explicit form of the CMV matrix and (9.9) below the explicit form of the GGT matrix.
These polynomials have degree at most $2d$ . (The CMV matrix has matrix elements that are products of exactly two, $\alpha$ , $\bar{\alpha}$ and $\rho$ so $G$ written in terms of the three variables is of homogeneous degree $2d$ if ${\text{\rm{Tr}}(V}(U))=\text{\rm{Tr}}(U^{d})$ but removing the $\rho$ ’s produces lower degree terms even in this special case.)
$F_{\pm},G$ are not unique. If H is any function of $d$ successive $\alpha,\bar{\alpha}$ pairs and

[TABLE]

then (3.13) holds for $(G,F_{\pm})$ if and only if it holds for $(\tilde{G},\tilde{F}_{\pm})$ .

Proof.

Recall (see [22, Section 4.2]) the ${\mathcal{L}}{\mathcal{M}}$ representation of the CMV matrix, ${\mathcal{C}}$ , which we write when $N$ is even. Define the $2\times 2$ matrices

[TABLE]

Let $\Theta_{j}\equiv\Theta(\alpha_{j})$ . Then

[TABLE]

( ${\mathcal{L}}$ is a direct sum of $N/2$ $2\times 2$ matrices while ${\mathcal{M}}$ has $1\times 1$ matrices at the top and bottom and $(N/2-1)$ $2\times 2$ in between). And one has that ${\mathcal{C}}$ (i.e. our parametrization of $U$ ) is given by

[TABLE]

We will also write $\widetilde{{\mathcal{L}}}_{j},\,j=0,2,\dots,N$ for ${\mathcal{L}}$ with $\Theta_{0},\dots,\Theta_{j-2},\Theta_{j+2},\dots,\Theta_{N-2}$ replaced by zero (only $\Theta_{j}$ remains in the direct sum) and similarly for $\widetilde{{\mathcal{M}}}_{j},\,j=-1,1,\dots,{N-1}$ (where $\Theta_{-1},\Theta_{N-1}$ are $1\times 1$ matrices. Thus we have that

[TABLE]

We note that

[TABLE]

For $N$ odd, there is a similar representation but now ${\mathcal{L}}$ has a $1\times 1$ matrix at the bottom and ${\mathcal{M}}$ only a $1\times 1$ matrix at the top.

We’ll prove the theorem when $V(z)=z^{d}$ . For $V(z)=z^{-d}$ , the argument is similar since replacing $U$ by $U^{*}$ just interchanges ${\mathcal{L}}$ and ${\mathcal{M}}$ and replaces $\alpha_{j}$ by $\bar{\alpha}_{j}$ (since $\Theta(\alpha)^{*}=\Theta(\bar{\alpha})$ ). And for $0<k<d$ , $z^{\pm k}$ yields polynomials of the same form (since functions of fewer variables can be viewed as having more variables; there will be some lost $G$ ’s near the bottom but they can be made part of $F_{+}$ ).

We’ll show first that we have the required function of exact degree $2d$ where it is a polynomial in $\alpha,\bar{\alpha}$ and $\rho$ and then that each $\rho_{j}$ occurs as an even power so using $\rho_{j}^{2}=1-\alpha_{j}\bar{\alpha}_{j}$ we get the result without any $\rho$ ’s.

We write

[TABLE]

where a symbol like $\widetilde{{\mathcal{L}}}_{n_{1};k_{1}{,}k_{2}}$ means the $k_{1}{,}k_{2}$ matrix element of the matrix $\widetilde{{\mathcal{L}}}_{n_{1}}$ . In (3.21), we sum $n_{1},\dots,n_{d},m_{1},\dots,m_{d}$ from $-1$ to $N-1$ running through even and odd integers respectively and $k_{q}\,(q=2,\dots,2d)$ running from [math] to $N-1$ . The only non–zero terms have $|k_{2p+1}-n_{p+1}|\leq 1,\,|k_{q+1}-k_{q}|\leq 1,\,|k_{2p}-m_{p}|\leq 1,\,|n_{r}-m_{r}|\leq 1,\,|m_{r}-n_{r+1}|\leq 1$ , with further restrictions since, for example, $|k_{2p+1}-n_{p+1}|\leq 1$ is actually $k_{2p+1}-n_{p+1}=0\textrm{ or }1$ and not $-1$ .

This clearly writes $\text{\rm{Tr}}(U^{d})$ as a polynomial in $\alpha,\bar{\alpha},\rho$ of homogeneous degree $2d$ . For each $j=0,\dots,N-d-1$ , group together all where the smallest index of $\alpha,\bar{\alpha},\rho$ is $j$ . It is easy to see that the resulting sum, call it $G_{j}(\alpha_{j},\bar{\alpha}_{j},\rho_{j},\dots,\rho_{j+2d-1})$ , has $G_{j}$ independent of $j$ and gives the $G$ terms. The terms with $\alpha_{-1}$ (coming from $\Theta_{-1}$ , and hence $\alpha_{-1}=1$ ) we put into $F_{-}$ and those whose smallest $j$ so that $j\geq N-d$ we put into $F_{+}$ . It is easy to see that $F_{-}$ is $N$ –independent and that the $N$ –dependence of $F_{+}$ comes only from translating the indices. Thus we have proven (3.13) except we have some $\rho$ dependence.

For each product in (3.21), the $\rho_{p}$ terms come from increasing some $k_{q}=p$ to $k_{q+1}=p+1$ or a decrease in the opposite direction and it is only through such $\rho_{p}$ terms that such an increase or decrease can happen. Since $k_{1}=k_{2d+1}=j$ and each step only increases or decreases by a single step, for every $\rho_{p}$ going in one direction, there must be one going in the other, so an even number in all.

To confirm the assertion that $G(0,\dots,0)=0$ , we prove that no term in (3.21) can only have $\rho$ ’s, that is there must be at least one $j$ with $k_{j}=k_{j+1}$ . For if $k_{j+1}=k_{j}\pm 1$ it is easy to see that either $k_{j+2}=k_{j+1}$ or with the same sign $k_{j+2}=k_{j+1}\pm 1$ , that is one can’t change direction without an $\alpha$ term. But to return where one started, one must change direction. ∎

Remark.

It is an interesting exercise to use the GGT representation [22, Section 4.1] to prove that the $\rho$ ’s only occurs in even powers and that every term in $G$ has at least one power of $\alpha$ or $\bar{\alpha}$ .

In the next theorem, we use ${\mathcal{M}}_{+1,\infty}(\partial{\mathbb{D}})$ to denote the subset of ${\mathcal{M}}_{+,1}(\partial{\mathbb{D}})$ consisting of the probability measures of infinite support on $\partial{\mathbb{D}}$ , i.e., not supported on finitely many points.

Theorem 3.3.

Let V be a potential of the form (3.4) with measure $\eta$ whose support is $\partial{\mathbb{D}}$ and let $G$ be given by (3.13). Let $I$ be the rate function from (3.5) on the measure side. Let $\pi_{L}{\circ{\mathcal{V}}}:{{\mathcal{M}}_{+,1}(\partial{\mathbb{D}})}\to{\mathbb{D}}^{L}$ mapping $\mu$ to its first $L$ Verblunsky coefficients. Let $I_{L}$ be the rate function corresponding to the LDP for $\pi_{L}^{*}(\widetilde{{\mathbb{P}}}_{N})$ , and write $I_{L}(\mu)=I_{L}(\pi_{L}\circ{\mathcal{V}}\mu)$ . There is a constant $C$ independent of $L$ and $\mu$ so that if $L>d$ and $\alpha$ is the sequence of Verblunsky coefficients of $\mu$ , then for all such $L$ and $\mu\in{\mathcal{M}}_{+1,\infty}(\partial{\mathbb{D}})$ ,

[TABLE]

Remarks.

Recall that ${\mathcal{V}}$ is the Verblunsky map taking measures to Verblunsky coefficient sequences, defined in the Introduction. The mapping $\pi_{L}$ is the projection onto the first $L$ elements.
Recall (see [4, Theorem 2.6 and Theorem 2.7]) that $\pi_{L}^{*}({\widetilde{{\mathbb{P}}}_{N}})$ obeys a LDP with speed $N$ and rate $I_{L}$ related to $I$ by

[TABLE]

Proof.

By writing the induced measures on Verblunsky coefficients according to Killip–Nenciu [14] (see Theorem 4.2 of [4]) and $e^{-N\text{\rm{Tr}}(V(U))}$ according to (3.13), we see that for $W\subset{\mathbb{D}}^{L}$ and $N>L+d+1$

[TABLE]

where $\alpha_{N-1}=e^{i\theta_{N-1}}$ and

[TABLE]

For fixed $L$ , the function $\widetilde{H}_{N,L}$ , obtained by dropping the $F_{-}$ term and all the $G(\alpha_{j},\dots,\alpha_{j}+d)$ terms where $j=L-d,\dots,L-1$ is a product of a function of $(\alpha_{0},\dots,\alpha_{L-1})$ and a function of $(\alpha_{L},\dots,\alpha_{N-1})$ . Since $\pi_{L}^{-1}[W]=W\times{\mathbb{D}}^{N-L-1}\times\partial{\mathbb{D}}$ (up to a set of zero $\widetilde{{\mathbb{P}}}_{N}$ measure), the integrals over $(\alpha_{L},\dots,\alpha_{N-1})$ in the numerator and denominator of the modified (3.24) cancel.

The modified formula defines a probability measure

[TABLE]

where

[TABLE]

Since $|\alpha_{j}|\leq 1$ and $G$ , $F_{-}$ and $F_{+}$ are polynomials, the dropped terms are bounded, so that for some constant, $C_{1}$ ,

[TABLE]

By an elementary argument (see [4, Theorems 2.1 and 2.2]), $\widetilde{{\mathbb{P}}}_{N,L}$ obeys a LDP with speed $N$ and rate function

[TABLE]

where $c_{L}$ is such that $\min_{\alpha_{0},\dots,\alpha_{L-1}}\tilde{I}_{L}(\alpha_{0},\dots,\alpha_{L-1})=0$ (forced by the condition on the function $G$ (different from our $G$ here) in [4, Theorem 2.2]).

With $I_{L}$ given by (3.23), we conclude by (3.28) that

[TABLE]

Taking $\mu_{0}=\tfrac{d\theta}{2\pi}$ for which $I(\mu_{0})=\lim I_{L}(\mu_{0})$ is finite and using $G(0,\dots,0)=0$ and $\log(1-|\alpha|^{2})|_{\alpha=0}=0$ , we conclude that $c_{L}$ is bounded as $L\to\infty$ so $\sup c_{L}\equiv C_{2}$ is finite. (3.22) follows with $C=C_{1}+C_{2}$ . ∎

While not essential, the following lovely lemma of Nazarov et. al [20, Lemma 3.1] will simplify some arguments.

Proposition 3.4.

Let $G$ be a continuous function on $\Omega^{k}$ where $\Omega\subset{\mathbb{R}}^{m}$ is compact. Suppose $0\in\Omega$ and that $G(0,\dots,0)=0$ . Let $\Omega^{\infty}_{0}$ be the sequences $\textbf{x}=(x_{1},x_{2},\dots)\in\Omega^{\infty}$ so that eventually $x_{j}=0$ $($ i.e. only finitely many $x_{j}$ are non-zero $)$ . For $\textbf{x}\in\Omega^{\infty}_{0}$ define

[TABLE]

Suppose there is a $C$ so that for all $\textbf{x}\in\Omega^{\infty}_{0}$ , $H(\textbf{x})\geq-C$ . Then, there exist continuous functions $\widetilde{G}$ on $\Omega^{k}$ and $\Gamma$ on $\Omega^{k-1}$ so that

[TABLE]

and

[TABLE]

Remark.

The point, of course, is that if we add a constant to $\Gamma$ so that $\Gamma(0,\dots,0)=0$ (which doesn’t change (3.33)), then

[TABLE]

which assures that we can extend $H$ to infinite sequences with a convergent sum or else a sum that diverges to $+\infty$ .

Theorem 3.5 (Abstract Gem).

Let V be a potential of the form (3.4) and $G$ given by (3.13). Let ${(\alpha)}\in{\mathbb{D}}^{\infty}$ and let $\mu={{\mathcal{V}}^{-1}}({\alpha})$ be the measure with those Verblunsky coefficients and $\eta$ the measure obeying (3.6). Then

[TABLE]

exists and the limit is finite if and only if $H(\eta|\mu)$ is finite.

We refer to the sum in (3.34) as the Verblunsky side of the gem.

Remark.

[12, Theorem 3.3] have a general abstract gem derived by very different means.

Proof.

By the theory of projective limits (see [4, Theorem 2.7]), $I(\mu)=\lim_{L\to\infty}I_{L}(\mu)$ . Thus by (3.22), if $I(\mu)=\infty$ , the limit in (3.34) exists and is $\infty$ .

Assume now that $I(\mu)<\infty$ . We would like to use Proposition 3.4, but first we need to restrict attention to a compact subset of the unit disc. Since $I(\mu)<\infty$ , the $d\theta$ weight of $d\mu$ is a.e. non–zero, so, by Rakhmanov’s Theorem (see [23, Chapter 9]), $\alpha_{j}(\mu)\to 0$ as $j\to\infty$ . Thus $R=\sup_{j}|\alpha_{j}(\mu)|<1$ . Let $\overline{{\mathbb{D}}}_{R}=\{z\,|\,|z|\leq R\}$ . This is compact so we can apply Proposition 3.4, (3.22) and $I(\nu)\geq 0$ for all $\nu$ to conclude that there is $G_{R}\geq 0$ and $\Gamma_{R}$ so that

[TABLE]

$G(0,\dots,0)=0\Rightarrow G_{R}(0,\dots,0)=0$ and by adding a constant to $\Gamma$ we can suppose that $\Gamma(0,\dots,0)=0$ .

The sum in (3.34) is thus

[TABLE]

Since $\alpha_{j}\to 0,\Gamma(0,\dots,0)=0$ and $\Gamma$ is continuous, the last term goes to [math] as $N\to\infty$ . Since $G_{R}\geq 0$ , the sum has a limit (which may be $+\infty$ . By (3.22) and $I_{L}(\mu)\to I(\mu)<\infty$ , we see that the sum is bounded, hence convergent. ∎

Finally, we turn to the abstract sum rule. For any ${\alpha}\in{\mathbb{D}}^{\infty}$ define

[TABLE]

$S$ may be infinite if the limit is.

Theorem 3.6 (Abstract Sum Rule).

Under the hypothesis of Theorem 3.5, for any $\mu$ with infinite support

[TABLE]

Remark.

Basically, on the basis of (3.24), one expects that the rate function is $S({\alpha}(\mu))+c$ where $c$ is a constant coming from the $N$ th root of the denominator in (3.24). Given that $I(\eta)=0$ , the constant has to be $c=-S({\alpha}(\eta))$ .

Proof.

We begin with a formula like (3.26) but with two changes. First, rather than look at $\pi^{*}_{L}({\widetilde{{\mathbb{P}}_{N}}})[W]$ for a single $W$ , we look at a ratio

[TABLE]

for two open sets $W,W_{1}$ in ${\mathbb{D}}^{L}$ so we needn’t concern ourselves with the normalization integral over all of ${\mathbb{D}}^{L}$ but can focus on small sets where we have control over the $\alpha$ ’s.

Secondly, we don’t drop all of the monomials in those $G$ terms for which $\{j_{1},\dots,j_{d}\}$ intersects both $\{0,\dots,L-1\}$ and $\{L,L+1,\dots\}$ . We keep those monomials which only have $\alpha_{L},\alpha_{L+1},\dots$ . Thus the dropped terms all have a factor of some $\alpha_{j}$ with $j\in\{L-d,L-d+1,\dots,L-1\}$ . What results is that one obtains (still using $\widetilde{{\mathbb{P}}}_{N,L}$ for the probability with the, now slightly different, dropped terms):

[TABLE]

where

[TABLE]

for some constant $K$ because the dropped terms, being polynomials that are not of degree zero in all the $\alpha_{j}$ ’s, are at least linear in some $\alpha_{j}$ .

Note that because of lower semicontinuity of $I_{L}$ , for any $\mu_{0}$ ,

[TABLE]

where $W$ runs over all open neighborhoods of $\mu_{0}$ ordered by inverse inclusion. Moreover, because $I_{L}$ is continuous, one has that

[TABLE]

Thus taking $N\to\infty$ in (3.40) and shrinking the open sets to two measures, $\mu$ and $\nu$ , we get from (3.37) that

[TABLE]

where $S_{L}$ is the sum in (3.34) when the infinite sum is replaced by the sum to $L-1-d$ .

When $H(\eta|\mu)=\infty$ , we’ve already proven (3.38) so suppose $H(\eta|\mu)<\infty$ . Then the density of the absolutely continuous part of $\mu$ with respect to $d\theta$ is a.e. non–vanishing, so, by Rakhmanov’s Theorem, $\alpha_{j}(\mu)\to 0$ . Take $\nu=\eta$ so also, $\alpha_{j}(\eta)\to 0$ . Thus the right side of (3.41) goes to zero and we find that

[TABLE]

proving (3.38). ∎

4. The (1,0) Case

In this section, we’ll consider the case of a single singularity of order 1 and recover the sum rule of Simon (1.6). The calculations are so simple, we need not make the simplifying assumption that $\bar{\alpha}_{j}=\alpha_{j}$ that we’ll make in the later sections.

The normalized empirical measure is

[TABLE]

so, by (3.4) and (3.7)

[TABLE]

and

[TABLE]

In the CMV basis, $U_{jj}=-\alpha_{j-1}\bar{\alpha}_{j}$ where $\alpha_{-1}\equiv-1$ . Thus, the Verblunsky side of the sum rule is

[TABLE]

for a suitable constant, $C$ .

In (4.4), the sum rule involves limits of finite $N$ objects so here and below, sums should involve finite matrices and finite sums. But, as we explained above we are interested in the limits of such finite sums. So we’ll write sums up to infinity indicating what one will get after taking $N\to\infty$ at the end of the calculation.

Since

[TABLE]

we can rewrite (4.4) as (changed $C$ )

[TABLE]

That in this form the constant is $C=\frac{1}{2}-\log(2)$ follows from the requirement that this vanish if ${\alpha}={\alpha}(\eta)$ and the calculations in Section 3 that (3.10) is [math]. Thus, we have a LD proof of (1.6).

To get the gem (1.7), we need the $M=1$ case of

Proposition 4.1.

For any ${\alpha}$

[TABLE]

if and only if

[TABLE]

Remark.

Since

[TABLE]

for any $|\alpha|<1$ , the summand in (4.6) is non–negative so the sum either converges or diverges to $+\infty$ .

Proof.

By (4.8), we have that

[TABLE]

By (4.9), we have that (4.6) $\Rightarrow$ (4.7). On the other hand, if (4.7) holds, then $|\alpha_{j}|\to 0$ so, for all large $j$ , $|\alpha_{j}|^{2}\leq\frac{1}{2}$ so we can apply (4.10) to the tail of the sum in (4.6) and conclude that (4.7) $\Rightarrow$ (4.6) ∎

We thus have a quick proof of the gem of Simon [22, Section 2.8]:

Theorem 4.2.

With $w$ as in (1.5), $\int_{0}^{2\pi}(1-\cos\theta)\log w(\theta)\tfrac{d\theta}{2\pi}>-\infty$ if and only if

[TABLE]

5. The (1,1) Case

In terms of (1.8), this section will consider gems where the measure side is

[TABLE]

To figure out the normalization, we note that

[TABLE]

One can also figure this out by noting that the extreme sides of (5.2) are degree 2 Laurent polynomials in $e^{i\theta}$ vanishing at $\theta=0,\pi$ to second order with maximum $1$ on $\partial{\mathbb{D}}$ . For later use, we note that the same argument shows that for $k=1,2,\dots$

[TABLE]

for a constant $K_{k}$ .

Since $\int\cos(k\theta)d\theta=0$ , we see that the normalized $d\eta$ is

[TABLE]

so by (3.4) and (3.7), we have that

[TABLE]

We discussed $k=1$ in Section 4, we’ll discuss $k=2$ in this section (thereby recovering, using large deviations, a special case of a result of Simon–Zlatǒs [27]) and general $k$ in Section 7. Thus in this section, we’ll prove

Theorem 5.1.

Let ${\alpha}$ be real. Then

[TABLE]

Note that ${\text{\rm{Tr}}(V(U))}=\tfrac{1}{4}\text{\rm{Tr}}(U^{2}+U^{-2})=\tfrac{1}{2}\text{\rm{Tr}}(U^{2})$ if ${\alpha}$ is real. For such ${\alpha}$ , the CMV matrix has the form for $j\geq 1$ (see [22, eqn(4.2.14)]):

[TABLE]

There are also matrix elements that are two off–diagonal, but if $U_{j,j\pm 2}\neq 0$ , then $U_{j\pm 2,j}=0$ , so these terms don’t contribute to $\text{\rm{Tr}}(U^{2})$ (this is also clear from the ${\mathcal{L}}{\mathcal{M}}$ factorization and from the GGT representation). Thus

[TABLE]

where bdy is short for boundary and refers to some finite number of terms involving small indices (and, later, when it appears with a finite sum, involving finitely many terms involving large indices with the number of terms bounded as the upper index of the sum changes).

Therefore, using $\rho_{j}^{2}=1-\alpha_{j}^{2}$ and $G(\alpha_{j},\alpha_{j+1},\alpha_{j+2})=\alpha_{j+1}^{2}\alpha_{j}^{2}-2\rho_{j+1}^{2}\alpha_{j}\alpha_{j+2}$ , we see after some algebraic manipulations that the Verblunsky side of the gem, see (3.34), is

[TABLE]

We claim that up to boundary terms

[TABLE]

Accepting this for a moment, we can show that the conditions of Simon and Lukic (which agree in this case)

[TABLE]

imply the measure condition, that is (given the gem) that $\textrm{(S1-2)}\Rightarrow{\mathcal{I}}_{2}<\infty,{\mathcal{I}}_{4}<\infty,{\mathcal{L}}_{6}<\infty$ . For clearly, by (S1), (5.13) $<\infty$ and, by Hölder’s inequality, ${\mathcal{I}}_{4}$ is bounded by $C\lVert\alpha\rVert_{4}^{4}$ . Since $\lVert\alpha\rVert_{6}\leq\lVert\alpha\rVert_{4}$ (on account of $|\alpha|\leq 1$ ), ${\mathcal{L}}_{6}$ is finite by Proposition 4.1 with $M=2$ .

To see (5.13), define for $\ell=0,1,2,\dots$

[TABLE]

Proposition 5.2.

Let ${\mathcal{T}}_{1}$ and ${\mathcal{T}}_{2}$ be two functions on sequences of real $\alpha$ which are boundary terms plus a sum of the form (3.31) where $G$ is a quadratic function of its variables (i.e. a second degree homogeneous polynomial). Suppose for some $L$ , ${\mathcal{T}}_{r}$ has no terms of the form $\alpha_{j}\alpha_{j+\ell}$ with $\ell>L$ . Then up to boundary terms, each ${\mathcal{T}}_{r}$ is a linear combination of $\{{\mathcal{P}}_{\ell}\}_{\ell=0}^{L}$ and ${\mathcal{T}}_{1}={\mathcal{T}}_{2}$ up to boundary terms if they are the same linear combinations.

Remarks.

This result is obvious. More subtle is the fact that “if” in the last sentence can be replaced by “if and only if” but we won’t need that harder half of this.
Again, what is being stated involves limits of finite sums. The equalities only hold up to finite boundary terms. There are also boundary terms at the upper limit but those go to zero by Rakhmanov’s Theorem. One infinite sum converges if and only if the other one does.

Corollary 5.3.

${\mathcal{I}}_{2}$ * of (5.10) is given by (5.13) up to constants.*

Proof.

The RHS of (5.10) is, up to boundary terms ${\mathcal{P}}_{0}-{\mathcal{P}}_{2}$ . Expanding the square, the RHS of (5.13) is $\tfrac{1}{2}({\mathcal{P}}_{0}-2{\mathcal{P}}_{2}+{\mathcal{P}}_{0})={\mathcal{P}}_{0}-{\mathcal{P}}_{2}$ . ∎

Proof of Theorem 5.1.

We’ve already proven that (S1-2) imply that the integral in (5.6) is finite. So we need to go in the opposite direction. Therefore, we suppose the integral is finite.

By the abstract gems discussed in Section 3, we know that ${\mathcal{I}}_{2}+{\mathcal{I}}_{4}+{\mathcal{L}}_{6}$ is finite (in that the cutoff sums are uniformly bounded) with ${\mathcal{I}}_{2}$ given by (5.13). In this form ${\mathcal{I}}_{2}$ is positive and so is ${\mathcal{L}}_{6}$ as noted in the remark after Proposition 4.1. So we look at ${\mathcal{I}}_{4}$ which we write up to boundary terms as

[TABLE]

By Hölder’s inequality, $\sum_{j=1}^{J}|\alpha_{j-1}\alpha_{j}^{2}\alpha_{j+1}|\leq\sum_{j=0}^{J+1}|\alpha_{j}|^{4}$ , so up to boundary terms, ${\mathcal{I}}_{42}$ is positive and thus ${\mathcal{I}}_{2}+{\mathcal{I}}_{41}+{\mathcal{L}}_{6}$ is finite. Since each term is positive, they are all finite, i.e. ${\mathcal{I}}_{2}<\infty$ and ${\mathcal{I}}_{41}<\infty$ . ${\mathcal{I}}_{2}<\infty$ is (S1).

${\mathcal{I}}_{2}<\infty$ and $|\alpha_{j}|<1\Rightarrow\sum_{j=1}^{\infty}\alpha_{j}^{2}(\alpha_{j+1}-\alpha_{j-1})^{2}<\infty$ . ${\mathcal{I}}_{41}<\infty$ means $\sum_{j=1}^{\infty}\alpha_{j}^{2}(\alpha_{j+1}+\alpha_{j-1})^{2}<\infty$ . Since $(x+y)^{2}+(x-y)^{2}=2(x^{2}+y^{2})$ , we conclude that

[TABLE]

Since $|\alpha_{j-1}\alpha_{j}^{2}\alpha_{j+1}|\leq\tfrac{1}{2}(\alpha_{j-1}^{2}\alpha_{j}^{2}+\alpha_{j}^{2}\alpha_{j+1}^{2})$ , we see that $\sum_{j=1}^{\infty}|\alpha_{j-1}\alpha_{j}^{2}\alpha_{j+1}|<\infty$ . All the other terms in ${\mathcal{I}}_{2}+{\mathcal{I}}_{4}+{{\mathcal{L}}}_{6}$ are positive, so all are finite. In particular, $\tfrac{1}{2}\sum_{j=1}^{\infty}\alpha_{j}^{4}<\infty$ which is (S2). ∎

In particular, we see that the proof from Lukic conditions to convergence of the integral is much easier than the converse.

6. The (2,0) Case

Our goal in this section is to prove:

Theorem 6.1.

Let $\mu$ be as in (1.5) and assume that its Verblunsky sequence $\alpha$ is real. Then

[TABLE]

if and only if

[TABLE]

and

[TABLE]

Remarks.

In this case the Lukic and Simon conditions agree.
This result, indeed without the reality restriction is in Simon–Zlatǒs [27]. The main difference in our approach is the method of deriving the sum rules. Once one has the sum rules the arguments are related but we feel our presentation is more transparent.

To begin we need to normalize $\eta$ , i.e. determine $Z$ so that $Z^{-1}\int(1-\cos\theta)^{2}w(\theta)\frac{d\theta}{2\pi}=1$ . We’ll use

[TABLE]

as one can see by expanding the square or by using $1-\cos\theta=2\sin^{2}(\theta/2)$ . Thus

[TABLE]

Since $\int\cos(k\theta)\tfrac{d\theta}{2\pi}=\delta_{k0}$ , we see that

[TABLE]

so by (3.4) and (3.7)

[TABLE]

and thus when ${\alpha}$ is real

[TABLE]

We computed $\text{\rm{Tr}}(U)$ in (4.4) and $\text{\rm{Tr}}(U^{2})$ in (5.8). Thus the Verblunsky side of the sum rule, see (3.34), is

[TABLE]

In terms of the quantities ${\mathcal{P}}_{\ell}$ of (5.16)

[TABLE]

up to boundary terms. On the other hand, expanding the square, we see that up to boundary terms

[TABLE]

since $1+4+1=6$ and $4+4=8$ . Thus we see that up to boundary terms

[TABLE]

Proof of half of Theorem 6.1 that (6.1) $\Rightarrow$ (S1),(S2).

Since $\tfrac{1}{6}+\tfrac{1}{3}=\tfrac{1}{2}$ , up to boundary terms, ${\mathcal{I}}_{4}\geq 0$ by Hölder’s inequality. By the abstract sum rule, $\eqref{6.1}\Rightarrow{\mathcal{I}}_{2}+{\mathcal{I}}_{4}+{\mathcal{I}}_{6}$ is finite. Since each of these terms is positive (by (6.15) and Proposition 4.1), each is individually finite. ${\mathcal{I}}_{2}<\infty\Rightarrow$ (S1) and, by Proposition 4.1, ${\mathcal{L}}_{6}<\infty\Rightarrow$ (S2). ∎

Proof of Other Half of Theorem 6.1 that (S1),(S2) $\Rightarrow$ (6.1).

Clearly (S1) $\Rightarrow{\mathcal{I}}_{2}<\infty$ and (S2) ${\Rightarrow{\mathcal{L}}_{6}<\infty}$ by Proposition 4.1, so we need only control ${\mathcal{I}}_{4}$ . Hölder lets one control $\sum\kappa^{(1)}_{n}\kappa^{(2)}_{n}\kappa^{(3)}_{n}\kappa^{(4)}_{n}$ if ${\lVert\kappa^{(j)}\rVert_{p_{j}}}<\infty$ and $\tfrac{1}{p_{1}}+\tfrac{1}{p_{2}}+\tfrac{1}{p_{3}}+\tfrac{1}{p_{4}}\geq 1$ . Since $\tfrac{4}{6}<1$ , we can’t just look at products of four $\alpha$ ’s. However since $\tfrac{1}{6}+\tfrac{1}{6}+\tfrac{1}{6}+\tfrac{1}{2}=1$ , we can control products of three $\alpha$ ’s and one $(S-1)^{2}\alpha$ . By the Gagliardo-Nirenberg inequality, (2.12), (S1)+(S2) $\Rightarrow(S-1)\alpha\in\ell^{3}$ . Since $\tfrac{1}{6}+\tfrac{1}{6}+\tfrac{1}{3}+\tfrac{1}{3}=1$ , a product of two $\alpha$ ’s and two $(S-1)\alpha$ is also summable. So the goal is to write ${\mathcal{I}}_{4}$ as sums of these two terms. We write

[TABLE]

The ${\mathcal{I}}_{42}$ term is a sum of products of two $(S-1)\alpha$ terms and two $\alpha$ terms so by the above, it is a convergent sum by (S1),(S2). Let $\eta_{n}=\alpha_{n+1}-\alpha_{n}$ so

[TABLE]

and thus

[TABLE]

We’ve already seen that the sum in ${\mathcal{I}}_{42}$ is absolutely convergent. By (6.21), ${\mathcal{I}}_{41}$ is a sum of $(\ell^{6})^{2}(\ell^{3})^{2}$ and $(\ell^{6})^{3}\ell^{2}$ terms and so a convergent sum. Thus ${\mathcal{I}}_{4}<\infty$ . ∎

7. The $k$ th Roots of Unity Case

Fix $k\in\{1,2,3,\dots\}$ . In this section, we’ll consider the conditions

[TABLE]

By (5.3), this is the same as taking $\theta_{j}=\tfrac{2(j-1)\pi}{k},\,j=1,\dots,k$ so $\{e^{i\theta_{j}}\}_{j=1}^{k}$ are the $k$ th roots of unity. Of course, if $\omega=e^{i\theta_{2}}$ is a primitive $k$ th root of unity, then $S^{k}-1=\prod_{j=1}^{k}(S-\omega^{j})$ , so (7.2)/(7.3) are precisely the Simon (=Lukic) conditions for this case. In this section, we’ll prove

Theorem 7.1.

Suppose ${\alpha}$ obeys (S2). Then

[TABLE]

In particular, (S1-2) $\Rightarrow$ (7.1).

Remarks.

${\alpha}$ need not be assumed real.
This is a special case of a result of Golinskii–Zlatoš [12].

The key input to proving this will be

Proposition 7.2.

If $\text{\rm{Tr}}(U^{k})$ is written in terms of $\alpha$ ’s only, the term quadratic in $\alpha$ is

[TABLE]

Remark.

This proof will rely on the CMV representation of unitaries. It is an interesting exercise to give a different proof using the GGT representation and ideas of Section 9.

Proof.

By (3.21), $\text{\rm{Tr}}(U^{k})$ is a homogeneous polynomial of degree $2k$ in $\alpha,\bar{\alpha}$ and $\rho$ . To be left with quadratic terms after using $\rho^{2}=1-\bar{\alpha}\alpha$ , we need products with $2k-2\,\rho$ ’s and two of $\alpha$ and/or $\bar{\alpha}$ .

As the end of the proof of theorem 3.2 explains, one gets strings of increasing or decreasing $\rho$ ’s and $\alpha$ or $\bar{\alpha}$ at turn around points. The $2k-2\,\rho$ ’s must occur in a string of $k-1$ increasing and a second string of $k-1$ decreasing $\rho$ ’s. The form, (3.15), of $\Theta$ shows we get $-\alpha$ at the bottom turn around and $\bar{\alpha}$ at the top turn around, so the only quadratic terms are $(-\alpha_{n})\prod_{j=1}^{k-1}\rho_{n+j}(\bar{\alpha}_{n+k})$ .

Each diagonal matrix element $({\mathcal{C}}^{k})_{jj}$ has such a term for $j=n+1,n+2,\dots,n+k$ , so $k$ in all which yields (7.4). ∎

Proposition 7.3.

The quadratic term in the sum rule, (3.29), for (7.1) with $|\alpha_{n}|^{2}$ “borrowed” from $-\log(1-|\alpha_{n}|^{2})$ is (up to a boundary term)

[TABLE]

Proof.

The normalized $\eta$ is $(1-\cos k\theta)\tfrac{d\theta}{2\pi}$ , so by (5.5), the potential is $\tfrac{1}{k}\cos k\theta$ and $Q$ is

[TABLE]

Thus, since $k$ in (7.4) cancels the $k^{-1}$ in (7.6), the quadratic term including the borrowed $|\alpha_{n}|^{2}$ is

[TABLE]

which is (7.5). ∎

Proof of Proposition 7.3.

The Verblunsky side of the sum rule associated to (7.1) has quadratic term (7.5) and a remainder that is finite if ${\alpha}\in\ell^{4}$ . Thus the equivalence is immediate. ∎

8. Single $k$ th Order Singularity

We are interested here in measures which obey

[TABLE]

Here the Simon–Lukic conditions are

[TABLE]

Our main goal is to prove that

Theorem 8.1.

Suppose ${\alpha}\in\ell^{4}$ . Then

[TABLE]

Remark.

This is a special case of a result of Golinskii–Zlatoš [12].

To put this in perspective, we note that Lukic [19] has proven

Theorem 8.2 ([19]).

Suppose $(S-1){\alpha}\in\ell^{2}$ . Then

[TABLE]

These two extreme cases are consistent with $\eqref{6.6.1}\iff$ (S1-2) and suggest its truth.

The key to our proof will be to show that the quadratic term in the sum rule is $c_{k}\lVert(S-1)^{k}\alpha\rVert_{2}^{2}$ for an explicit $c_{k}$ . We’ve seen that $c_{1}=\tfrac{1}{2}$ (4.5) and $c_{2}=\tfrac{1}{6}$ (6.15). The reader might stop and try to figure out the general formula.

By (6.4)

[TABLE]

Thus the normalized $\eta$ is

[TABLE]

where

[TABLE]

Using the binomial expansion $0=(1-1)^{2k}$ , we have that

[TABLE]

Therefore, we may rewrite (8.5) as

[TABLE]

It follows from (5.5) that

[TABLE]

Recalling that we need to borrow $|\alpha_{n}|^{2}$ from $-\log(1-|\alpha_{n}|^{2})$ , and that the quadratic term in $\text{\rm{Tr}}(U^{\ell}+\bar{U}^{\ell})$ equals $-\ell\sum_{n=0}^{\infty}(\alpha_{n}\bar{\alpha}_{n+\ell}+\bar{\alpha}_{n}\alpha_{n+\ell})$ up to boundary terms, we see that the quadratic term in the sum rule is

[TABLE]

where now, instead of (5.16)

[TABLE]

On the other hand,

[TABLE]

where we use the fact a $\alpha_{n+j_{1}}\bar{\alpha}_{n+j_{2}}$ term will contribute to ${\mathcal{P}}_{\ell}$ if $\ell=|j_{1}-j_{2}|$ .

Proposition 8.3.

For any $k=0,1,2,\dots$ and $\ell=0,\dots,k$ , we have that

[TABLE]

Proof.

To pick $k-\ell$ elements from among $2k$ numbered objects, we can pick $j$ from the first $k$ and $k-\ell-j$ from the second. Thus

[TABLE]

Since $\binom{p}{q}=\binom{p}{p-q}$ , we have that $\binom{k}{k-\ell-j}=\binom{k}{\ell+j}$ and $\binom{2k}{k-\ell}=\binom{2k}{k+\ell}$ . We thus get (8.12). ∎

Proof of Theorem 8.1.

Picking $j-k=\ell$ in (8.9), we see that

[TABLE]

which by (8.11),(8.12) and (8.6) equals $c_{k}\lVert(S-1)^{k}\alpha\rVert_{2}^{2}$ . When $\alpha\in\ell^{4}$ , by Hölder’s inequality, all terms in the sum rule but the quadratic are finite. So the Verblunsky side of the sum rule is finite if and only if $\lVert(S-1)^{k}\alpha\rVert_{2}^{2}<\infty$ . By the sum rule, we conclude the result. ∎

9. The (2,1) Case

Our main result in this section is half the Lukic conjecture in the $(2,1)$ case, specifically:

Theorem 9.1.

Let $\mu$ be a probability measure on $\partial{\mathbb{D}}$ of the form (1.5) with real Verblunsky coefficients $\{\alpha_{j}\}_{j=0}^{\infty}$ obeying (1.16)–(1.18). Then the integral on the left side of (1.12) is finite.

Remark.

As noted, this is important because there are examples where Simon’s conditions (i.e. (1.16) and (1.18) without (1.17)) hold, but the integral in (1.12) is $-\infty$ .

We’ll compute the sum rule guaranteed by Section 3 to say ${\mathcal{I}}_{2}+{\mathcal{I}}_{4}+{\mathcal{I}}_{6}+{\mathcal{L}}_{8}<\infty\iff$ the integral in (1.13) is finite, see (9.29)-(9.34) for notation. Then we’ll show that (1.16)–(1.18) $\Rightarrow{\mathcal{I}}_{2}<\infty,\,{\mathcal{I}}_{4}<\infty,\,{\mathcal{I}}_{6}<\infty,\,{\mathcal{L}}_{8}<\infty$ . We start by computing the potential, $V$ , of (3.4) for the $(2,1)$ case. As noted (see (5.2)), we have that

[TABLE]

Similarly

[TABLE]

Thus

[TABLE]

by (9.1)–(9.2). Thus, since $\int\cos k\theta\tfrac{d\theta}{2\pi}=\delta_{k0}$ , the normalized $d\eta$ is

[TABLE]

Using (3.4) and (3.7), we conclude that

[TABLE]

so that if $\alpha$ is real then

[TABLE]

In earlier sections, we used the CMV matrix representation to compute $\text{\rm{Tr}}(U)$ and $\text{\rm{Tr}}(U^{2})$ . While initially we computed $\text{\rm{Tr}}(U^{3})$ in this way also, we realized the calculations are simpler in the GGT matrix representation. (GGT and CMV representations are discussed in Section 4.1 and 4.2 of Simon [22].) This is given by

[TABLE]

The explicit calculation is (Simon [22, (4.15)])

[TABLE]

In [22], this is calculated using $\langle\Phi_{n}^{*},P\rangle=\lVert\Phi_{n}\rVert^{2}P(0)$ if $\deg P\leq n$ . An easier alternative is to use the Szegő recursion ([22, (1.5.25)]) and inverse Szegő recursion ([22, 1.5.46])

[TABLE]

so

[TABLE]

which upon iterating yields

[TABLE]

with ${\mathcal{G}}$ given by (9.9).

When dealing with the GGT representation, it can be an issue that $\{\varphi_{n}\}_{n=0}^{\infty}$ is not a basis but the calculations need only be done for finite matrices where the OPs are a basis (or one can use the extended GGT basis of [22, Section 4.1] noting that diagonal matrix elements of ${\mathcal{G}}^{q}$ in the extra basis elements are zero).

Define ${\mathcal{G}}^{(\ell)}$ to be the $\ell$ th diagonal of ${\mathcal{G}}$ so ( $j,k=0,\dots,n-1$ )

[TABLE]

Of course, only ${\mathcal{G}}^{(\ell_{1})}\dots\mathcal{G}^{(\ell_{q})}$ with $\sum_{m=1}^{q}\ell_{m}=0$ have non–zero main diagonal and so if we expand ${\mathcal{G}}^{q}$ using (9.16), only those terms contribute to $\text{\rm{Tr}}({\mathcal{G}}^{q})$ so

[TABLE]

We can now understand why calculations are easier with the GGT than CMV matrix. In (9.17), the sums start at $\ell_{m}=-1$ while in the analog for CMV, we start at $\ell_{m}=-2$ , so at least for $q$ not too large, there are fewer terms with GGT. Moreover, the form of (9.14)–(9.15) is covariant under translation along the diagonal while the CMV matrix diagonals have an even–odd structure.

For $q=1$ , we must have $\ell_{1}=0$ and for $q=2$ , we have $(\ell_{1},\ell_{2})=(0,0),(1,-1)$ or $(-1,1)$ . Moreover, by cyclicity of the trace, the $(1,-1)$ and $(-1,1)$ terms are equal, i.e. $\text{\rm{Tr}}({\mathcal{G}}^{2})=\text{\rm{Tr}}\left(({\mathcal{G}}^{(0)})^{2}\right)+2\text{\rm{Tr}}\left({\mathcal{G}}^{(1)}{\mathcal{G}}^{(-1)}\right)$ . We thus recover (4.4) and (5.9) when $\alpha$ is real, that is up to boundary terms:

[TABLE]

For $\text{\rm{Tr}}({\mathcal{G}}^{3})$ , we have up to cyclic permutations, $(\ell_{1},\ell_{2},\ell_{3})=(0,0,0)$ (once), $(2,-1,-1),(0,1,-1),(0,-1,1)$ (each three times). Thus up to boundary terms:

[TABLE]

We also write

[TABLE]

so the coefficient side of the sum rule is ${\mathcal{I}}_{2}+{\mathcal{I}}_{4}+{\mathcal{I}}_{6}+{\mathcal{L}}_{8}$ where

[TABLE]

where we use the fact that adding a constant to all indices in a sum only changes the sum by a boundary term.

We start with ${\mathcal{I}}_{2}$ by using Proposition 5.2. In terms of the ${\mathcal{P}}_{j}$ of (5.16), up to boundary terms

[TABLE]

by (9.30). On the other hand, by the same calculation that gave (9.3), $(S-1)^{2}(S+1)=S^{3}-S^{2}-S+1$ so

[TABLE]

Thus

[TABLE]

We conclude by Proposition 5.2 that up to boundary terms

[TABLE]

and thus

[TABLE]

By Hölder’s inequality

[TABLE]

By Proposition 4.1

[TABLE]

Thus, we need to focus on ${\mathcal{I}}_{4}$ . Let $\beta\equiv(S+1)\alpha$ . By Theorem 2.6 we have that

[TABLE]

Here is the key first step:

Proposition 9.2.

(a) For any $m_{1},{m_{2},m_{3}},m_{4}$ , we have that

[TABLE]

(b) For any $m_{1},{m_{2},m_{3}},m_{4}$ , we have that

[TABLE]

(c) For any $m_{1},{m_{2},m_{3}},m_{4}$ , we have that

[TABLE]

is conditionally convergent.

Remarks.

We only need conditional summability so, since $\gamma_{j}=\alpha_{j+2}-\alpha_{j}$ , (c) implies the conditional summability of the sum in (9.43) without the $|\cdot|$ . However, we use (a) in the proof of (c).
To avoid having to worry about boundary terms at [math], we extend all sequences to $-\infty$ by setting $\alpha_{n}=0$ for $n\leq-1$ . This doesn’t effect conditional convergence of any sums. Since $\alpha\in\ell^{6}$ , all of $\alpha,\beta,\gamma$ go to zero as $n\to\pm\infty$ .

Proof.

(a) $\tfrac{1}{6}+\tfrac{1}{6}+\tfrac{1}{3}+\tfrac{1}{3}=1$ , so since $\alpha\in\ell^{6},\,\gamma\in\ell^{3}$ , Hölder’s inequality implies (9.43).

(b) $\tfrac{1}{6}+\tfrac{1}{6}+\tfrac{1}{6}+\tfrac{1}{2}=1$ , so since $\alpha\in\ell^{6},(S-1)\gamma\in\ell^{2}$ , Hölder’s inequality implies (9.44).

(c) The intuition is simple. The continuum analog is that if $f$ is $C^{1}$ on ${\mathbb{R}}$ , $f(x)\to 0$ as $|x|\to\infty$ , then $\int_{-R_{1}}^{R_{2}}f(x)^{3}f^{\prime}(x)dx={\frac{1}{4}}\int_{-R_{1}}^{R_{2}}[f^{4}]^{\prime}(x)dx$ has a zero limit. The sum in (9.45) is a discrete analog so the key will be a summation by parts.

Since we’ll be summing by parts, we need to know the appropriate discrete Leibniz rule. Let $p\in{\mathbb{Z}}\setminus\{0\}$ and $D=S^{p}-1$ so $(Da)_{n}=a_{n+p}-a_{n}$ . Then

[TABLE]

or $D(ab)=a(Db)+(Da)S^{p}b$ . By induction, one sees that

[TABLE]

Consider the sum in (9.45) first if $m_{1}=m_{2}=m_{3}=m_{4}=0$ . Let $D=S^{2}-1$ . By (9.47)

[TABLE]

Given two sequences, $\kappa$ and $\eta$ , write $\kappa\stackrel{{\scriptstyle.}}{{=}}\eta$ to mean $\kappa-\eta\in\ell^{1}$ . In (9.48), $D\alpha=\gamma$ so if we write $S^{2}\alpha=\alpha+\gamma$ , the $\gamma$ term produces products of two $\alpha$ ’s and two $\gamma$ ’s, so in $\ell^{1}$ by (a). Thus

[TABLE]

The conditional sum of $D(\alpha^{4})$ is finite and indeed zero since $\alpha\in\ell^{6}$ and

[TABLE]

Thus $4\gamma\alpha^{3}$ is conditionally summable.

Consider next the case $m_{1}=m_{2}=1,m_{3}=m_{4}=0$ . By (9.47) and the same argument that led to (9.49)

[TABLE]

since, as above, we can replace $\alpha$ by $S^{2}\alpha$ making an $\ell^{1}$ error in the four–fold product.

Telescoping as in (9.49), we have that $D((S\alpha)^{2}\alpha^{2})$ is conditionally summable. Note that whether a sequence is conditionally summable or not doesn’t change by a translation of index so we can replace $(S^{2}\alpha)^{2}(DS\alpha)S\alpha$ by $(S\alpha)^{2}\alpha(D\alpha)$ and conclude that

[TABLE]

is conditionally summable and thus $(S\alpha)^{2}\alpha D\alpha$ is conditionally summable proving the result when $m_{1}=m_{2}=1,m_{3}=m_{4}=0$ .

Now consider general $m_{j}$ . Since $(S-1)\gamma\in\ell^{2}$ , we can change $m_{4}$ to any value we want making an $\ell^{1}$ change. Similarly, by shifting by multiples of 2 units, we can change each of $m_{1},m_{2},m_{3}$ to [math] or $1$ . If they are all equal after this, set $m_{4}$ to the common value and get conditional convergence by the case (0,0,0,0). If the first three $m$ ’s have two equal and one unequal, set $m_{4}$ to the unequal value and get either (1,1,0,0) or (0,0,1,1). We’ve handled the first and by using the $S^{2}-1$ trick, (0,0,1,1) is the same as (0,0,-1,-1) and by covariance, that is the same as (1,1,0,0). ∎

Next, we recall the remarkable fact that if (1.16)+(1.18), then $(S-1)\alpha\in\ell^{4}\iff(S-1)^{2}\alpha\in\ell^{4}$ ! (see Theorem 2.6).

Proof of Theorem 9.1.

As we’ve seen, we need only show that ${\mathcal{I}}_{4}$ is conditionally convergent. We only used (1.16)+(1.18) so far, but not (1.17) which we’ll use in the form $(S-1)\alpha\in\ell^{4}$ .

We begin by noting that because of (c) of the last Proposition, $\sum\alpha_{j}^{3}(\alpha_{j+1}-\alpha_{j-1})$ is conditionally convergent. Using that index shifts modify sums only by boundary terms, we conclude that

[TABLE]

is conditionally convergent.

Since $[{(}(S-1)\alpha{)}_{j-1}]^{4}=[\alpha_{j}-\alpha_{j-1}]^{4}$ , using again that index shifts do not affect conditional convergence and (9.53), we see that $\lVert(S-1)\alpha\rVert_{4}^{4}<\infty$ implies that

[TABLE]

is conditionally convergent.

On the other hand, by (c) of the last Proposition, in (9.32) we can replace $\alpha_{j-2}$ by $\alpha_{j}$ and $\alpha_{j-3}$ by $\alpha_{j-1}$ without effecting conditional convergence. If we do that and use (9.53) again, we see that ${\mathcal{I}}_{4}$ is a conditionally convergent sum plus

[TABLE]

This is half the sum in (9.54) so (1.17) implies conditional convergence of the sum in $\widetilde{{\mathcal{I}}}_{4}$ . ∎

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1]
2[2] G. Anderson, A. Guionnet and O. Zeitouni, An Introduction to Random Matrices , Cambridge University Press, 2010
3[3] G. Ben Arous and A. Guionnet, Large deviations for Wigner’s law and Voiculescu’s non-commutative entropy , Probab. Theory Rel. Fields, 108 (1997), 517–542.
4[4] J. Breuer, B. Simon and O. Zeitouni Large Deviations and Sum Rules for Spectral Theory – A Pedagogical Approach , J. Spec. Th., to appear
5[5] A. Dembo and O. Zeitouni Large Deviations Techniques and Applications , 2nd Edition, Springer, Berlin, 1998.
6[6] S. Denisov and S. Kupin, Asymptotics of the orthogonal polynomials for the Szegő class with a polynomial weight , J. Approx. Theory 139 (2006), 8–28.
7[7] J. Deuschel and D. Stroock, Large Deviations , Academic Press, Boston, 1989.
8[8] E. Gagliardo, Proprietà di alcune classi di funzioni in più variabili , Ric. Mat. 7 (1958), 102–137.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Large Deviations and the Lukic Conjecture

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

2. The Lukic Condition

Theorem 2.1**.**

Remark**.**

Proposition 2.2**.**

Corollary 2.3**.**

Proof.

Proof of Theorem 2.1.

Theorem 2.4**.**

Proof.

Theorem 2.5**.**

Remarks**.**

Proof.

Theorem 2.6**.**

Remarks**.**

Proof.

3. Sum Rules

Proposition 3.1**.**

Proof.

Theorem 3.2**.**

Remarks**.**

Proof.

Remark**.**

Theorem 3.3**.**

Remarks**.**

Proof.

Proposition 3.4**.**

Remark**.**

Theorem 3.5** (Abstract Gem).**

Remark**.**

Proof.

Theorem 3.6** (Abstract Sum Rule).**

Remark**.**

Proof.

4. The (1,0) Case

Proposition 4.1**.**

Remark**.**

Proof.

Theorem 4.2**.**

5. The (1,1) Case

Theorem 5.1**.**

Proposition 5.2**.**

Remarks**.**

Corollary 5.3**.**

Proof.

Proof of Theorem 5.1.

6. The (2,0) Case

Theorem 6.1**.**

Remarks**.**

Proof of half of Theorem 6.1 that (6.1) ⇒\Rightarrow⇒ (S1),(S2).

Proof of Other Half of Theorem 6.1 that (S1),(S2)⇒\Rightarrow⇒ (6.1).

7. The kkkth Roots of Unity Case

Theorem 7.1**.**

Remarks**.**

Proposition 7.2**.**

Remark**.**

Proof.

Proposition 7.3**.**

Proof.

Proof of Proposition 7.3.

8. Single kkkth Order Singularity

Theorem 8.1**.**

Remark**.**

Theorem 8.2** ([19]).**

Proposition 8.3**.**

Proof.

Proof of Theorem 8.1.

9. The (2,1) Case

Theorem 9.1**.**

Remark**.**

Theorem 2.1.

Remark.

Proposition 2.2.

Corollary 2.3.

Theorem 2.4.

Theorem 2.5.

Remarks.

Theorem 2.6.

Remarks.

Proposition 3.1.

Theorem 3.2.

Remarks.

Remark.

Theorem 3.3.

Remarks.

Proposition 3.4.

Remark.

Theorem 3.5 (Abstract Gem).

Remark.

Theorem 3.6 (Abstract Sum Rule).

Remark.

Proposition 4.1.

Remark.

Theorem 4.2.

Theorem 5.1.

Proposition 5.2.

Remarks.

Corollary 5.3.

Theorem 6.1.

Remarks.

Proof of half of Theorem 6.1 that (6.1) $\Rightarrow$ (S1),(S2).

Proof of Other Half of Theorem 6.1 that (S1),(S2) $\Rightarrow$ (6.1).

7. The $k$ th Roots of Unity Case

Theorem 7.1.

Remarks.

Proposition 7.2.

Remark.

Proposition 7.3.

8. Single $k$ th Order Singularity

Theorem 8.1.

Remark.

Theorem 8.2 ([19]).

Proposition 8.3.

Theorem 9.1.

Remark.

Proposition 9.2.

Remarks.