Model theory and metric convergence II: Averages of unitary polynomial   actions

Eduardo Due\~nez; Jos\'e N. Iovino

arXiv:1812.01653·math.DS·June 20, 2019

Model theory and metric convergence II: Averages of unitary polynomial actions

Eduardo Due\~nez, Jos\'e N. Iovino

PDF

TL;DR

This paper proves pointwise convergence of averages of polynomial sequences of unitary transformations in Hilbert spaces using model theory, extending to general abelian group actions and including a case study of the lamplighter group.

Contribution

It introduces a model-theoretic approach to establish convergence of polynomial unitary actions, covering Leibman sequences and generalizations to abelian groups.

Findings

01

Proves pointwise convergence with uniform metastability rate for polynomial unitary averages.

02

Extends convergence results to arbitrary Leibman sequences and actions of any abelian group.

03

Demonstrates realization of the lamplighter group as a quadratic Leibman sequence.

Abstract

We use model theory of metric structures to prove the pointwise convergence, with a uniform metastability rate, of averages of a polynomial sequence ${T_{n}}$ (in Leibman's sense) of unitary transformations of a Hilbert space. As a special case, this applies to unitary sequences ${U^{p (n)}}$ where $p$ is a polynomial $Z \to Z$ and $U$ a fixed unitary operator; however, our convergence results hold for arbitrary Leibman sequences. As a case study, we show that the non-nilpotent "lamplighter group" $Z ≀ Z$ is realized as the range of a suitable quadratic Leibman sequence. We also indicate how these convergence results generalize to arbitrary Folner averages of unitary polynomial actions of any abelian group $G$ in place of $Z$ .

Equations100

AV_{n} (x) = \frac{1}{n} i = 1 \sum n U^{i} (x)

AV_{n} (x) = \frac{1}{n} i = 1 \sum n U^{i} (x)

AV_{n} (x) = \frac{1}{n + 1} 0 \leq k \leq n \sum U_{0} \circ U_{1}^{k} \circ U_{2}^{(2 k)} \circ \dots \circ U_{d}^{(d k)} (x)

AV_{n} (x) = \frac{1}{n + 1} 0 \leq k \leq n \sum U_{0} \circ U_{1}^{k} \circ U_{2}^{(2 k)} \circ \dots \circ U_{d}^{(d k)} (x)

AV_{n} (f) = \frac{1}{n} i = 1 \sum n f \circ T^{i}

AV_{n} (f) = \frac{1}{n} i = 1 \sum n f \circ T^{i}

σ_{n} = \frac{1}{n + 1} 0 \leq i \leq n \sum δ_{i} for all n \in N .

σ_{n} = \frac{1}{n + 1} 0 \leq i \leq n \sum δ_{i} for all n \in N .

\prescript i μ = j \in Z \sum a_{j} δ_{j + i} if μ = j \in Z \sum a_{j} δ_{j},

\prescript i μ = j \in Z \sum a_{j} δ_{j + i} if μ = j \in Z \sum a_{j} δ_{j},

S = (R, N, Z, A_{Z}, H, B, M, L_{Z, R}^{\infty}, L_{Z, H}^{\infty}, L_{Z, B}^{\infty}, L_{Z, M}^{\infty}, L_{Z^{2}, R}^{\infty}, L_{Z^{2}, H}^{\infty}, L_{Z^{2}, B}^{\infty})

S = (R, N, Z, A_{Z}, H, B, M, L_{Z, R}^{\infty}, L_{Z, H}^{\infty}, L_{Z, B}^{\infty}, L_{Z, M}^{\infty}, L_{Z^{2}, R}^{\infty}, L_{Z^{2}, H}^{\infty}, L_{Z^{2}, B}^{\infty})

(R, N, Z, A_{Z}, H, B, M, L_{Z, R}^{\infty}, L_{Z, H}^{\infty}, L_{Z, B}^{\infty}, L_{Z, M}^{\infty}, L_{Z^{2}, R}^{\infty}, L_{Z^{2}, H}^{\infty}, L_{Z^{2}, B}^{\infty})

(R, N, Z, A_{Z}, H, B, M, L_{Z, R}^{\infty}, L_{Z, H}^{\infty}, L_{Z, B}^{\infty}, L_{Z, M}^{\infty}, L_{Z^{2}, R}^{\infty}, L_{Z^{2}, H}^{\infty}, L_{Z^{2}, B}^{\infty})

(Δ^{i} T)_{j} = T_{i + j} \cdot T_{j}^{*} for all j \in Z .

(Δ^{i} T)_{j} = T_{i + j} \cdot T_{j}^{*} for all j \in Z .

Δ^{i_{d}} \dots Δ^{i_{1}} Δ^{i_{0}} T = \mathbbm I for all i_{0}, i_{1}, \dots, i_{d} \in Z .

Δ^{i_{d}} \dots Δ^{i_{1}} Δ^{i_{0}} T = \mathbbm I for all i_{0}, i_{1}, \dots, i_{d} \in Z .

T_{k} = g_{0} \cdot g_{1}^{k} \cdot g_{2}^{(2 k)} \cdot \dots \cdot g_{d}^{(d k)} for all k \in Z,

T_{k} = g_{0} \cdot g_{1}^{k} \cdot g_{2}^{(2 k)} \cdot \dots \cdot g_{d}^{(d k)} for all k \in Z,

a \cdot \prescript [b^{k}] a = \prescript [b^{k}] a \cdot a for all k \in Z,

a \cdot \prescript [b^{k}] a = \prescript [b^{k}] a \cdot a for all k \in Z,

δ (j, k + l) = \prescript l δ (j, k) \cdot \prescript [\prescript l δ (k)] δ (j, l)

δ (j, k + l) = \prescript l δ (j, k) \cdot \prescript [\prescript l δ (k)] δ (j, l)

δ (j, k + l) = δ (j, k) \cdot \prescript [\prescript m δ (k)] δ (j, l) .

δ (j, k + l) = δ (j, k) \cdot \prescript [\prescript m δ (k)] δ (j, l) .

[[δ (i, j), \prescript k δ (x) \cdot \prescript l δ (y) \cdot \prescript m δ (z)]] = \mathbbm I whenever x + y + z = 0.

[[δ (i, j), \prescript k δ (x) \cdot \prescript l δ (y) \cdot \prescript m δ (z)]] = \mathbbm I whenever x + y + z = 0.

δ (i, j) \cdot \prescript [δ (l)^{k}] δ (m, n) = \prescript [δ (l)^{k}] δ (m, n) \cdot δ (i, j) .

δ (i, j) \cdot \prescript [δ (l)^{k}] δ (m, n) = \prescript [δ (l)^{k}] δ (m, n) \cdot δ (i, j) .

T_{0}

T_{0}

{{R}} := \prescript [R] A .

{{R}} := \prescript [R] A .

\prescript j R = W (\prescript j A, \prescript j B) = W (A, A^{j} B) = W^{'} (A, B)

\prescript j R = W (\prescript j A, \prescript j B) = W (A, A^{j} B) = W^{'} (A, B)

i = k \prod k x_{i}

i = k \prod k x_{i}

i = k \prod k - 2 x_{i}

i = k \prod k - 2 x_{i}

T_{j} = [\prescript op \prod_{i = 1}^{j} (a^{i - 1} b)] \cdot c for all j \in Z .

T_{j} = [\prescript op \prod_{i = 1}^{j} (a^{i - 1} b)] \cdot c for all j \in Z .

α_{k} α_{l}

α_{k} α_{l}

β^{k} α_{l}

A \cdot A_{[k]} = A_{[k]} \cdot A,

A \cdot A_{[k]} = A_{[k]} \cdot A,

A_{[k]} A_{[l]}

A_{[k]} A_{[l]}

B^{k} A_{[l]}

(Δ^{∙} T)_{i, j} = T_{i + j} \circ T_{j}^{*} for all i, j \in Z .

(Δ^{∙} T)_{i, j} = T_{i + j} \circ T_{j}^{*} for all i, j \in Z .

Δ^{i_{d}} \dots Δ^{i_{1}} Δ^{i_{0}} T = \mathbbm I for all i_{0}, i_{1}, \dots, i_{d} \in Z,

Δ^{i_{d}} \dots Δ^{i_{1}} Δ^{i_{0}} T = \mathbbm I for all i_{0}, i_{1}, \dots, i_{d} \in Z,

G = U_{H} = {U \in B_{H} : U \circ U^{*} = I = U^{*} \circ U}

G = U_{H} = {U \in B_{H} : U \circ U^{*} = I = U^{*} \circ U}

(T\circ T^{*}=\mathbbm{I}=T^{*}\circ T)\wedge\forall i_{d}\dots\forall i_{1}\forall i_{0}\bigl{[}\Delta^{\!i_{d}}(\dots(\Delta^{\!i_{1}}(\Delta^{\!i_{0}}T))\dots)=\mathbbm{I}\bigr{]}.

(T\circ T^{*}=\mathbbm{I}=T^{*}\circ T)\wedge\forall i_{d}\dots\forall i_{1}\forall i_{0}\bigl{[}\Delta^{\!i_{d}}(\dots(\Delta^{\!i_{1}}(\Delta^{\!i_{0}}T))\dots)=\mathbbm{I}\bigr{]}.

AV_{n} T = ⟨ T, σ_{n} ⟩ .

AV_{n} T = ⟨ T, σ_{n} ⟩ .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

MnLargeSymbols’164 MnLargeSymbols’171

Model theory and metric convergence II:

Averages of unitary polynomial actions

Eduardo Dueñez and José N. Iovino

Department of Mathematics

The University of Texas at San Antonio

One UTSA Circle

San Antonio, TX 78249-0664

U.S.A.

[email protected]

(Date: March 2, 2024)

Abstract.

We use model theory of metric structures to prove the pointwise convergence, with a uniform metastability rate, of averages of a polynomial sequence $\{T_{n}\}$ (in Leibman’s sense) of unitary transformations of a Hilbert space. As a special case, this applies to unitary sequences $\{U^{p(n)}\}$ where $p$ is a polynomial $\mathbb{Z}\to\mathbb{Z}$ and $U$ a fixed unitary operator; however, our convergence results hold for arbitrary Leibman sequences. As a case study, we show that the non-nilpotent “lamplighter group” $\mathbb{Z}\wr\mathbb{Z}$ is realized as the range of a suitable quadratic Leibman sequence. We also indicate how these convergence results generalize to arbitrary Følner averages of unitary polynomial actions of any abelian group $\mathbb{G}$ in place of $\mathbb{Z}$ .

Key words and phrases:

Mean Ergodic Theorem, PET induction, Leibman sequences, Henson structures

2010 Mathematics Subject Classification:

Primary: 37A30; Secondary: 03C98, 46Bxx, 28-xx

We thank Xavier Caicedo, Christopher Eagle and Franklin Tall for their encouragement and feedback, as well as the Banff International Research Station for hosting the June 2016 FRG “Topological Methods in Model Theory” where many ideas in the Appendix to this manuscript were first conceived.

This research was funded by NSF grant DMS-1500615

Introduction

The first result on “mean” convergence of averages was von Neumann’s 1932 Mean Ergodic Theorem [vN32]:

Mean Ergodic Theorem (MET).

For any unitary operator $U$ on a Hilbert space $\mathcal{H}$ and any $x\in\mathcal{H}$ , the sequence $\operatorname{AV}_{\bullet}(x)=(\operatorname{AV}_{n}(x):n\in\mathbb{N})$ of pointwise averages

[TABLE]

converges as $n\to\infty$ . The limit is equal to the orthogonal projection of $x$ on the space of vectors fixed by $U$ .

Historically, generalizations of von Neumann’s theorem have largely followed a path influenced by a measure-theoretic viewpoint that is completely absent from the formulation above as a statement about convergence in Hilbert spaces. We provide further historical background below. Leaving history and measure theory aside for the moment, one may suggest the following different possible directions of generalization for MET:

(1)

Replace the sequence $(U^{i}:i\in\mathbb{N})$ with a “higher-degree” sequence $(U^{p(i)}:i\in\mathbb{N})$ where $p$ is a fixed polynomial. 2. (2)

The sequence $(T_{i})=(U^{p(i)})$ above necessarily satisfies the commutativity condition $T_{i}\circ T_{j}=T_{j}\circ T_{i}$ for all $i,j$ . To what extent can such commutativity requirement be removed? 3. (3)

What conditions on a family $(T_{i})$ of unitary operators indexed by a semigroup other than $\mathbb{N}$ ensure the pointwise convergence of suitable averages?

Theorem 4 in this manuscript is arguably the most natural generalization of von Neumann’s result simultaneously in all three directions above. (For technical reasons, Theorem 4 is proved in the context of (polynomial) actions of groups rather than semigroups.) Theorem 1, stated below, is a very particular case of more general results (Theorems 2, 3 and 4). However, it is easiest to formulate and already generalizes MET all the way in direction (1) and beyond.

Theorem 1 (MET for abelian unitary polynomial actions of $\mathbb{Z}$ ).

Fix $d\in\mathbb{N}$ . Let $\mathcal{H}$ be a Hilbert space, and let $U_{0},U_{1},\dots,U_{d}$ be pairwise-commuting unitary operators on $\mathcal{H}$ . For every $x\in\mathcal{H}$ , the sequence $\operatorname{AV}_{\bullet}(x)=(\operatorname{AV}_{n}(x):n\in\mathbb{N})$ of averages111Here, ${k\choose j}=k(k-1)\cdots(k-j+1)/j!$ is the $j$ -th binomial coefficient.

[TABLE]

converges as $n\to\infty$ .

In particular, if $p:\mathbb{Z}\to\mathbb{Z}$ is a polynomial of degree at most $d$ and $U$ is a unitary operator on $\mathcal{H}$ , then $\left(\sum_{0\leq k\leq n}U^{p(n)}(x)/(n+1):n\in\mathbb{N}\right)$ converges.

Furthermore, there exists a universal metastability rate (depending only on $d$ ) that applies uniformly to all sequences of averages of arbitrary $x$ in the unit ball of an arbitrary Hilbert space $\mathcal{H}$ under arbitrary unitary operators $U_{0},U_{1},\dots,U_{d}$ on $\mathcal{H}$ .

The notion of uniformly metastable convergence above was first introduced in ergodic theory by Tao. It is a main theme of our prior manuscript, but shall presently play a minor role [DnI17, Tao08, Tao12].

Taking a step in direction (2), pairwise commutativity is not a necessary assumption; the sequence of averages under a family $(T_{i})$ converges provided $i\mapsto T_{i}$ is a Leibman polynomial sequence in the group $\mathrm{U}_{\mathcal{H}}$ of unitary operators on $\mathcal{H}$ (Theorems 2 and 3), but the range of this sequence need not generate an abelian group. The definition of Leibman polynomial sequence (Definition 2.1) is motivated by the familiar fact that degree- $d$ polynomials $\mathbb{R}\to\mathbb{R}$ are characterized as those functions having $(d+1)$ -iterated finite differences equal to zero. The same essential definition gives the notion of Leibman polynomial mapping from an arbitrary group $\mathbb{G}$ into $\mathrm{U}_{\mathcal{H}}$ [Lei02]. Theorem 4 generalizes von Neumann’s result in direction (3) for Leibman polynomials $(T_{i}:i\in\mathbb{G})\subset\mathrm{U}_{\mathcal{H}}$ on abelian groups $\mathbb{G}$ endowed with a notion of averaging provided by a countable Følner net.

Continuing our historical remarks, the formulation of von Neumann’s result above hides its conceptual genesis via the study of convergence of averages of square-integrable functions $f\in\mathscr{L}^{2}(\Omega)$ on a probability space $(\Omega,\mu)$ under the action of a measure-preserving transformation $T$ of $\Omega$ . In this setting, MET asserts that the sequence $\operatorname{AV}_{\bullet}(f)$ of averages

[TABLE]

converges in $\mathscr{L}^{2}(\Omega)$ (after all, $f\mapsto f\circ T$ is a unitary transformation of $\mathscr{L}^{2}(\Omega)$ ). This particular case of von Neumann’s result explains why it is called a convergence result “in mean”, i.e., in the mean-square (“ $\mathscr{L}^{2}$ ”) sense. (By contrast, Birkhoff’s Ergodic Theorem asserts the almost-everywhere pointwise convergence of the averages $\operatorname{AV}_{n}(f)$ for any $f\in\mathscr{L}^{1}(\Omega)$ [Bir31].) The $\mathscr{L}^{2}$ setting entails no loss of generality since every Hilbert space $\mathcal{H}$ is realized as a space of square-integrable functions. However, this viewpoint is artificial for purposes of studying convergence under unitary actions (at least insofar as simple actions are concerned, in contrast to multiple actions mentioned below).

Although generalizations of MET in direction (1) seem very natural, we are not aware of direct proofs of Theorem 1, but only of indirect proofs as byproduct of results on mean convergence of “multiple” ergodic averages. Starting in the 1970’s, Furstenberg pioneered the ergodic study of actions of multiple simultaneous transformations; equivalently, the study of convergence of “multiple averages” of the product of two or more measurable bounded functions on a probability space $\Omega$ as acted upon by powers of measure-preserving transformations. As an application of multiple averages, Furstenberg obtained a purely ergodic proof of Szemerédi’s Theorem on the existence of arbitrary long arithmetic progressions in positive-density subsets of the integers [Fur77, Sze75]. However, Furstenberg’s seminal results from the seventies did not extend von Neumann’s theorem in either of the directions (1)–(3). It was Bergelson who, in 1987, first extended some of Furstenberg’s results to multiple ergodic averages of (plus quam linear) polynomial powers of a fixed measure-preserving transformation acting on products of functions [Ber87]. When specialized to simple measure-preserving actions, Bergelson’s results are a step toward generalizing von Neumann’s MET in direction (1). However, there is no purely Hilbert-theoretical formulation of Bergelson’s weak mixing hypothesis: Even the convergence of pointwise averages of $(U^{p(n)})$ stated in Theorem 1 only follows unconditionally from 2005 results for multiple ergodic averages of Host and Kra, and of Leibman (which depend on no mixing assumptions) [HK05, Lei05].

To our knowledge, Walsh’s theorem [Wal12] on mean convergence of nilpotent ergodic averages is the first result in the literature from which Theorem 1 follows as a corollary. (Pointwise convergence of averages of $(U^{n}\circ V^{n^{2}})$ under the assumption $U\circ V=V\circ U$ is a special case of 2009 results of Austin [Aus15a, Aus15b].) Thus, Walsh’s theorem actually implies the convergence of averages asserted in the more general Theorem 2, but only under the additional explicit hypothesis that $(T_{i})$ generates a nilpotent subgroup of $\mathrm{U}_{\mathcal{H}}$ . However, our methods do not require a nilpotence hypothesis, but only the more intrinsic property that $(T_{i})$ be a Leibman sequence in the sense of Definition 2.11 (or in Leibman’s more general sense of polynomial mapping used in Theorem 4). In Section 2.2, we construct a quadratic Leibman sequence whose range generates the non-nilpotent “lamplighter group” $\mathbb{Z}\wr\mathbb{Z}$ .

Generalizations of Walsh’s theorem by Austin and Zorin-Kranich imply steps in direction (3) [Aus16, ZK16]. However, Theorems 2, 3 and 4 appear to be new in the general form stated. Nevertheless, given the close relation of our results to others in the existing literature, the main novelty is our “soft” direct approach to proving pointwise convergence of polynomial averages in Hilbert spaces using the framework of Henson metric structures. Our viewpoint is heavily influenced by Tao’s outline [Tao12] of a nonstandard proof à la Robinson of Walsh’s theorem (although we use only standard real numbers, and none of Robinson’s apparatus as such). A significant part of the manuscript consists of natural definitions and basic results on model-theoretic notions of integration and convergence that parallel classical ones; nevertheless, we capture, refine, and in some cases extend such results in Henson’s framework. Section 1 contains the rather long definition of the Henson class of PET structures over $\mathbb{Z}$ . Section 2 introduces the notion of Leibman polynomial sequence; it also exhibits a quadratic Leibman sequence whose range generates the non-nilpotent group $\mathbb{Z}\wr\mathbb{Z}$ . In Section 3, we state and prove Theorems 2 and 3 on metastable convergence of polynomial unitary averages for Leibman sequences (over $\mathbb{Z}$ ), and also explain how Theorem 1 follows as an immediate corollary. In Section 4, we state and prove the most general of our ergodic convergence results in the form of Theorem 4, which generalizes MET in all three directions (1)–(3). A number of foundational results are contained in the Appendix, which bears a close relation to our prior manuscript [DnI17]. These results pertain to measure theory and integration of real functions, as well as abstract notions of integration of functions taking values in Banach spaces. In this way we obtain a Dominated Convergence Theorem for notions of integration in an ad hoc Henson class of Banach integration frameworks (Theorem 5). We also show that the compactness of Henson’s logic implies a Uniform Metastability Principle for convergence in models of any Henson theory (Proposition A.10). Via this principle, all our results on convergence of averages admit refinements to convergence with metastability rates that are universal. These are gratis refinements thanks to the model-theoretic approach.

1. PET Structures

1.1. Classical PET Structures

Notation 1.1.

Below we list a number of formal symbols $\mathbb{R},\mathbb{Z},\mathbb{N},\mathcal{H},\dots$ that will eventually become sort descriptors for a Henson language of metric structures. However, throughout this subsection, these symbols have the following classical interpretations:

•

$\mathbb{R},\mathbb{Z},\mathbb{N}$ shall denote the sets of real numbers, integers and naturals.

•

$\mathcal{H}$ shall denote a real Hilbert space.

•

$\mathfrak{B}$ shall denote the real Banach algebra $\mathfrak{B}(\mathcal{H},\mathcal{H})$ of bounded operators on $\mathcal{H}$ .

•

$\mathcal{A}_{\mathbb{Z}}$ shall denote the Boolean algebra of all subsets of $\mathbb{Z}$ .

•

$\mathfrak{M}$ shall denote the real Banach space of signed finite measures on $\mathbb{Z}$ (i.e., on the measure space $(\mathbb{Z},\mathcal{A}_{\mathbb{Z}})$ ).

•

$\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z},\mathbb{R})$ of bounded real functions on $\mathbb{Z}$ .

•

$\mathscr{L}^{\infty}_{\mathbb{Z},\mathcal{H}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z},\mathcal{H})$ of bounded functions $\mathbb{Z}\to\mathcal{H}$ .

•

$\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z},\mathfrak{B})$ of bounded functions $\mathbb{Z}\to\mathfrak{B}$ .

•

$\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{M}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z},\mathfrak{M})$ of bounded functions $\mathbb{Z}\to\mathfrak{M}$ .

•

$\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathbb{R}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z}\times\mathbb{Z},\mathbb{R})$ of bounded real functions on $\mathbb{Z}\times\mathbb{Z}$ .

•

$\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathcal{H}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z}\times\mathbb{Z},\mathcal{H})$ of bounded functions $\mathbb{Z}\times\mathbb{Z}\to\mathcal{H}$ .

•

$\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ shall denote the Banach space $\mathscr{L}^{\infty}(\mathbb{Z}\times\mathbb{Z},\mathfrak{B})$ of bounded functions $\mathbb{Z}\times\mathbb{Z}\to\mathfrak{B}$ .

From a model-theoretic viewpoint, the sets $\mathbb{R},\mathbb{N},\mathbb{Z},\dots$ denoted by the formal symbols above are the sorts of a metric Henson structure $\mathscr{M}$ . (Discrete sorts $\mathbb{N}$ , $\mathbb{Z}$ , $\mathcal{A}_{\mathbb{Z}}$ are still viewed as metric spaces endowed with the discrete metric.) In addition, $\mathscr{M}$ is endowed with a number of distinguished elements (“constants”) and continuous functions between sorts. The distinguished elements include:

•

All elements of $\mathbb{N}$ and $\mathbb{Z}$ .

•

All rational numbers in $\mathbb{R}$ .

•

The zero element of each real Banach space above ( $\mathcal{H},\mathfrak{B},\mathfrak{M},\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}},\dots$ ).

•

The identity operator $I\in\mathfrak{B}$ .

•

The zero (empty set $\emptyset$ ) and unity (improper subset $\mathbb{Z}\subseteq\mathbb{Z}$ ) of the Boolean algebra $\mathcal{A}_{\mathbb{Z}}$ .

The distinguished functions between sorts include:

•

The discrete metric in each the discrete sorts $\mathbb{Z}$ , $\mathbb{N}$ , $\mathcal{A}_{\mathbb{Z}}$ .

•

The operations of addition, subtraction, multiplication, absolute value, and lattice operations (binary minimum and maximum) on $\mathbb{R}$ .

•

The order $\leq$ of $\mathbb{N}$ , identified with its characteristic function $\llbracket\cdot\!\leq\!\cdot\rrbracket:\mathbb{N}\times\mathbb{N}\to\{0,1\}$ .

•

The membership relation from $\mathbb{Z}$ to $\mathcal{A}_{\mathbb{Z}}$ , identified with its characteristic function $\llbracket{\cdot}\boldsymbol{\in}{\cdot}\rrbracket:\mathbb{Z}\times\mathcal{A}_{\mathbb{Z}}\to\{0,1\}$ .

•

The group operations (unary negation, binary addition and subtraction) of $\mathbb{Z}$ .

•

The operations of union, intersection and complementation on $\mathcal{A}_{\mathbb{Z}}$ .

•

The Hilbert space operations (addition, scalar multiplication, and inner product $(x,y)\mapsto x\cdot y$ ) on $\mathcal{H}$ . For convenience, also the norm $\left\|x\right\|=\sqrt{x\cdot x}$ .

•

The operations of addition and scalar product, and the Banach norm $\left\|\cdot\right\|$ on each Banach sort $\mathfrak{B},\mathfrak{M},\mathscr{L}^{\infty}_{X,Y}$ .

For $f\in\mathscr{L}^{\infty}_{X,Y}$ , the Banach norm is $\left\|f\right\|=\sup_{x\in X}\left\|f(x)\right\|$ , where $\left\|f(x)\right\|$ is the norm of $f(x)$ as an element of Banach sort $Y$ . The Banach norm on $\mathfrak{B}$ is $\left\|T\right\|=\sup\{\left\|T(x)\right\|:x\in\mathcal{H},\left\|x\right\|\leq 1\}$ . The Banach norm on $\mu\in\mathfrak{M}$ is “total variation”: Recall that $\mu$ has an atomic decomposition $\mu=\sum_{i\in\mathbb{Z}}c_{i}\delta_{i}$ where $\delta_{i}$ is the unit mass at $i$ and $c_{i}=\mu(\{i\})$ . With this notation, $\left\|\mu\right\|=\sum_{i}|c_{i}|$ .

(To abbreviate the long list of distinguished functions, above and in what follows we use $X$ to denote either of the “domain” discrete sets $\mathbb{Z}$ , $\mathbb{Z}^{2}$ of the various sorts $\mathscr{L}^{\infty}$ , and $Y$ to denote the “codomain” Banach sorts $\mathbb{R}$ , $\mathcal{H}$ , $\mathfrak{B}$ , $\mathfrak{M}$ .)

The list of distinguished functions continues as follows:

•

The operations $\mathscr{L}^{\infty}_{X,\mathcal{H}}\times\mathscr{L}^{\infty}_{X,\mathcal{H}}\to\mathscr{L}^{\infty}_{X,\mathbb{R}}$ induced by (pointwise) application of the inner product of $\mathcal{H}$ .

•

The unary operation of pointwise absolute value $\left|\cdot\right|$ and the binary lattice operations (pointwise $\max$ and $\min$ ) on sorts $\mathscr{L}^{\infty}_{X,\mathbb{R}}$ .

•

The unary operation $\left|\cdot\right|$ of measure of total variation and the binary lattice operations (“pointwise” $\max$ and $\min$ ) on $\mathfrak{M}$ (i.e., $\left|\mu\right|=\sum_{i}\left|a_{i}\right|\delta_{i}$ , $\max(\mu,\nu)=\sum_{i}\max\{a_{i},b_{i}\}\delta_{i}$ , and $\min(\mu,\nu)=\sum_{i}\min\{a_{i},b_{i}\}\delta_{i}$ if $\mu=\sum_{i}a_{i}\delta_{i}$ and $\nu=\sum_{i}b_{i}\delta_{i}$ ).

•

The operation of pointwise magnitude $\left|\cdot\right|:\mathscr{L}^{\infty}_{X,Y}\to\mathscr{L}^{\infty}_{X,\mathbb{R}}$ , namely $|f|:x\mapsto\left\|f(x)\right\|$ for any $f\in\mathscr{L}^{\infty}_{X,Y}$ .

•

The unary adjoint operation $T\mapsto T^{*}$ on $\mathfrak{B}$ , and the corresponding induced operations (pointwise adjoint) on sorts $\mathscr{L}^{\infty}_{X,\mathfrak{B}}$ .

•

The binary operation $(S,T)\mapsto S\circ T$ of composition on $\mathfrak{B}$ , and the corresponding induced operations of pointwise composition on sorts $\mathscr{L}^{\infty}_{X,\mathfrak{B}}$ .

•

The inclusions:

–

$\mathbb{Z}\hookrightarrow\mathcal{A}_{\mathbb{Z}}:i\mapsto\{i\}$ .

–

$\mathcal{A}_{\mathbb{Z}}\hookrightarrow\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}}:A\mapsto\chi_{A}$ where $\chi_{A}$ is the characteristic function of the subset $A\subseteq\mathbb{Z}$ .

–

$\mathbb{Z}\hookrightarrow\mathfrak{M}$ given by $i\mapsto\delta_{i}$ (the unit point mass at $i$ ).

–

$Y\hookrightarrow\mathscr{L}^{\infty}_{X,Y}$ , with $y\in Y$ identified with the constant function $y(\blacksquare):x\mapsto y$ in $\mathscr{L}^{\infty}_{X,Y}$ ;

–

The right inclusion map $\mathscr{L}^{\infty}_{\mathbb{Z},Y}\hookrightarrow\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}$ whereby $f\in\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ is identified with $f(\blacksquare,\cdot):(w,x)\mapsto f(x)$ ; also, the analogous left inclusion map identifying $f$ with $f(\cdot,\blacksquare):(w,x)\mapsto f(w)$ .

•

The function-evaluation maps

–

$(T,x)\mapsto T(x)$ from $\mathfrak{B}\times\mathcal{H}$ to $\mathcal{H}$ .

–

$(f,x)\mapsto f(x)$ from $\mathscr{L}^{\infty}_{X,Y}\times X$ to $Y$ ;

Also, the maps $\mathscr{L}^{\infty}_{X,\mathfrak{B}}\times\mathscr{L}^{\infty}_{X,\mathcal{H}}\to\mathscr{L}^{\infty}_{X,\mathcal{H}}$ induced by pointwise evaluation.

•

The partial evaluation maps:

–

Left evaluation $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\times\mathbb{Z}\to\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ , namely $(F,i)\mapsto F(i,\cdot)$ where $F(i,\cdot):j\mapsto F(i,j)$ .

–

Right evaluation $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\times\mathbb{Z}\to\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ , namely $(F,j)\mapsto F(\cdot,j)$ where $F(\cdot,j):i\mapsto F(i,j)$ .

(Note that the left evaluation map allows us to identify $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}$ with the space $\mathscr{L}^{\infty}(\mathbb{Z},\mathscr{L}^{\infty}_{\mathbb{Z},Y})$ of all bounded functions $\mathbb{Z}\to\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ —thus making a potential sort $\mathscr{L}^{\infty}(\mathbb{Z},\mathscr{L}^{\infty}_{\mathbb{Z},Y})$ superfluous. We also have a different identification of $\mathscr{L}^{\infty}(\mathbb{Z},\mathscr{L}^{\infty}_{\mathbb{Z},Y})$ with $\mathscr{L}^{\infty}(\mathbb{Z}^{2},Y)$ via right evaluation.)

•

The Følner-measure map $\sigma:\mathbb{N}\to\mathfrak{M}$ , where

[TABLE]

( $\sigma_{n}$ is the average of unit point masses at the points $0,1,2,\dots,n$ .)

•

The translation action of $\mathbb{Z}$ on $\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ . We regard this action as a function $\mathscr{L}^{\infty}_{\mathbb{Z},Y}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}=\mathscr{L}^{\infty}(\mathbb{Z},\mathscr{L}^{\infty}_{\mathbb{Z},Y})$ (with the latter identification by partial evaluation on the left). The action is denoted $f\mapsto\prescript{}{\bullet}{f}$ where $\prescript{}{\bullet}{f}\in\mathscr{L}^{\infty}(\mathbb{Z},\mathscr{L}^{\infty}_{\mathbb{Z},Y})$ is the function $i\mapsto\prescript{}{i}{f}$ with $\prescript{}{i}{f}\in\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ the function $j\mapsto f(i+j)$ .

•

The shear transformation $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}$ , namely $F\mapsto\widetilde{F}$ where $\widetilde{F}:(i,j)\mapsto F(i,i+j)$ .

•

The translation action of $\mathbb{Z}$ on $\mathfrak{M}$ , regarded as a mapping $\mathfrak{M}\to\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{M}}$ and denoted $\mu\mapsto\prescript{}{\bullet}{\mu}$ where $\prescript{}{\bullet}{\mu}\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{M}}$ is the mapping $i\mapsto\prescript{}{i}{\mu}$ , with $\prescript{}{i}{\mu}\in\mathfrak{M}$ the measure $\mu$ shifted by $-i$ , namely

[TABLE]

which is classically characterized by the property that $\langle{f},{\mu}\rangle=\langle{\prescript{}{i}{f}},{\prescript{}{i}{\mu}}\rangle$ for all $f\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}}$ and $i\in\mathbb{Z}$ .

•

The involutions $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}$ induced by the involution $(i,j)\mapsto(j,i)$ of $\mathbb{Z}^{2}$ .

•

The integration operations

–

$\mathscr{L}^{\infty}_{\mathbb{Z},Y}\times\mathfrak{M}\to Y:(f,\mu)\mapsto\langle{f},{\mu}\rangle=\sum_{i\in\mathbb{Z}}c_{i}f(i)$ for $\mu=\sum_{i}c_{i}\delta_{i}$ .

–

(Left integral) $\mathfrak{M}\times\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\to\mathscr{L}^{\infty}_{\mathbb{Z},Y}:(\mu,F)\mapsto\llangle{\mu},{F}\rrangle$ , where $\llangle{\mu},{F}\rrangle\in\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ is the function $j\mapsto\langle{F(\cdot,j)},{\mu}\rangle=\sum_{i}c_{i}F(i,j)$ .

–

(Right integral) $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\times\mathfrak{M}\to\mathscr{L}^{\infty}_{\mathbb{Z},Y}:(F,\mu)\mapsto\llangle{F},{\mu}\rrangle$ , where $\llangle{F},{\mu}\rrangle\in\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ is the function $i\mapsto\langle{F(i,\cdot)},{\mu}\rangle=\sum_{j}c_{j}F(i,j)$ .

–

$\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}\times\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{M}}\to\mathscr{L}^{\infty}_{\mathbb{Z},Y}:(F,\mu_{\bullet})\mapsto\llangle{F},{\mu_{\bullet}}\rrangle$ where $\llangle{F},{\mu_{\bullet}}\rrangle\in\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ is the function $j\mapsto\langle{F(\cdot,j)},{\mu_{j}}\rangle$ (i.e., the operation induced by “pointwise integration” when $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}$ is identified with $\mathscr{L}^{\infty}(\mathbb{Z},\mathscr{L}^{\infty}_{\mathbb{Z},Y})$ via left partial evaluation).

For visual convenience, we may use integral notation and write $\int\!f\,d\mu$ or $\int\!f(i)\,d\mu(i)$ for $\langle{f},{\mu}\rangle$ , and $\int\!F(i,\cdot)\,d\mu(i)$ for $\llangle{\mu},{F}\rrangle$ (resp., $\int\!F(\cdot,j)\,d\mu(j)$ for $\llangle{F},{\mu}\rrangle$ ).

*Remarks 1.2**.*

•

There are redundancies on the list of functions above. For instance, the $\mathscr{L}^{2}$ -norm on $\mathcal{H}$ is implicitly defined by its inner product: $\left\|x\right\|^{2}=x\cdot x$ . As a less trivial example, the action of $\mathbb{Z}$ on $\mathscr{L}^{\infty}_{\mathbb{Z},Y}$ is obtained from the right inclusion $\mathscr{L}^{\infty}_{\mathbb{Z},Y}\hookrightarrow\mathscr{L}^{\infty}_{\mathbb{Z}^{2},Y}$ followed by the shear transformation. However, for reasons of exposition we make no effort to present a minimal list of distinguished functions. The model-theoretic approach fundamentally requires that all sorts, functions and constants that are relevant to the problem at hand be part of the structures under study.

•

The nonstrict order relations ( $\leq$ and $\geq$ ) of $\mathbb{R}$ are the only predicate symbols of a Henson language. However, any discrete predicate $P$ may be identified with a $\{0,1\}$ -valued function $\chi_{P}$ (the characteristic function of the truth set of $P$ ), so the usual interpretation of $P(x)$ (resp., of $\neg P(x)$ ) agrees with the interpretation of the Henson formula $\chi_{P}(x)\geq 1/2$ (resp., of $\chi_{P}(x)\leq 1/2$ ).

Definition 1.3 (Classical PET structure over $\mathbb{Z}$ ).

A classical PET structure (over $\mathbb{Z}$ ) is a triple $\mathscr{M}=(\mathbf{S},\mathbf{C},\mathbf{F})$ where

[TABLE]

is a collection of sorts, $\mathbf{C}$ is a collection of distinguished elements (constants), and $\mathbf{F}$ is a collection of distinguished functions between sorts, provided these sorts, constants and functions are obtained in the manner prescribed by Notation 1.1.

1.2. Abstract PET structures

Definition 1.4 (Henson signature and language for PET structures over $\mathbb{Z}$ ).

The Henson signature for PET structures over $\mathbb{Z}$ consists of three ingredients:

•

A collection of formal symbols, called sort descriptors (or sort names) in one-to-one correspondence with the collection $\mathbf{S}$ of sorts of a classical PET structure. For definiteness, the collection of descriptors is taken to be

[TABLE]

its members regarded as purely formal symbols.

•

A collection of lexical constant symbols containing a unique symbol $\mathtt{c}$ for each of the distinguished elements in Definition 1.3, with each such symbol endowed with a sort descriptor $s$ naming that sort to which the element $c$ named by $\mathtt{c}$ belongs per Definition 1.3.

•

A collection of lexical function symbols containing a unique symbol $\mathtt{f}$ for each of the functions named in Definition 1.3, with each such symbol endowed with a sort-specification of the form $s_{1}\times\dots\times s_{n}\to s_{0}$ where $s_{0},s_{1},\dots,s_{n}$ are sort descriptors chosen in accordance with the specification of the domain (Cartesian product of sorts named by $s_{1},\dots,s_{n}$ ) and codomain (sort named by $s_{0}$ ) of the function $f$ named by the symbol $\mathtt{f}$ .

The Henson language $\mathcal{L}$ for PET structures over $\mathbb{Z}$ is the Henson language (of positive bounded formulas) whose signature is the one just described [HI02, Iov14, DnI17].

Definition 1.5 (PET structure over $\mathbb{Z}$ ).

Let $\mathcal{L}$ be the Henson language for PET structures. Let $\mathbf{PET}$ be the class of all classical PET structures over $\mathbb{Z}$ per Definition 1.3, and let ${\operatorname{Th}_{\mathbf{PET}}}$ be the $\mathcal{L}$ -theory of $\mathbf{PET}$ in Henson’s logic of approximate satisfaction of positive bounded formulas. An (abstract) PET structure over $\mathbb{Z}$ is a model of ${\operatorname{Th}_{\mathbf{PET}}}$ .

The class $\overline{\mathbf{PET}}$ of abstract PET structures obviously extends $\mathbf{PET}$ .

*Remarks 1.6**.*

•

In principle, one may provide an explicit axiomatization in positive bounded Henson formulas of the class $\mathbf{PET}$ . However, given the large number of sorts and functions in a PET structure this task is impractical. We refer the reader to our prior manuscript in which we provide explicit Henson axiomatizations of certain classes of structures somewhat more general than $\mathbf{PET}$ [DnI17]. Nevertheless, it should be clear that the Henson theory ${\operatorname{Th}_{\mathbf{PET}}}$ is uniform in the sense that it imposes bounds on constants as well as local bounds and local moduli of uniform continuity on distinguished functions. Moreover, ${\operatorname{Th}_{\mathbf{PET}}}$ obviously is identical to the theory $\operatorname{Th}_{\overline{\mathbf{PET}}}$ of all abstract PET structures.

•

The Følner map $\sigma:\mathbb{N}\to\mathfrak{M}$ per Notation 1.1 implies a particular choice of a “notion of averaging” over $\mathbb{Z}$ that is built into ${\operatorname{Th}_{\mathbf{PET}}}$ . Nonequivalent definitions of the PET class over $\mathbb{Z}$ and of $\overline{\mathbf{PET}}$ are obtained by changing this choice (e.g., letting $\sigma_{n}=1/(2n+1)\sum_{-n\leq i\leq n}\delta_{i}$ in classical structures), but Theorems 2 and 3 on PET structures over $\mathbb{Z}$ remain true under such alternate choice (in fact, they are special cases of the more general Theorem 4).

•

If $\mathscr{M}$ is a PET structure, then the $\mathbb{R}$ -named sort $\mathbb{R}^{\mathscr{M}}$ of $\mathscr{M}$ , under the corresponding operations $+_{\mathbb{R}}^{\mathscr{M}},-_{\mathbb{R}}^{\mathscr{M}},\dots$ , is (isomorphic to) the standard real numbers; we shall identify $\mathbb{R}^{\mathscr{M}}$ with $\mathbb{R}$ . Correspondingly, the “Hilbert sort” $\mathcal{H}^{\mathscr{M}}$ of $\mathscr{M}$ is a classical real Hilbert space. Typically, the $\mathbb{N}$ -named sort $\mathcal{N}=\mathbb{N}^{\mathscr{M}}$ of $\mathscr{M}$ is a proper extension of the set $\mathbb{N}$ of standard natural numbers (when the latter is identified with the set of interpretations $\mathtt{m}^{\mathscr{M}}$ of the constant symbols $\mathtt{m}$ of $\mathcal{L}$ , one for each standard natural $m$ )222The language $\mathcal{L}$ has constants naming only the standard integers and natural numbers, but no nonstandard elements of the sorts $\mathbb{N}^{\mathscr{M}}$ , $\mathbb{Z}^{\mathscr{M}}$ ., and similarly $\mathcal{Z}=\mathbb{Z}^{\mathscr{M}}$ extends $\mathbb{Z}$ in general. While $\mathfrak{B}^{\mathscr{M}}$ may be identified (via the evaluation map $\mathfrak{B}\times\mathcal{H}\to\mathcal{H}$ ) with an algebra of bounded operators on $\mathcal{H}^{\mathscr{M}}$ , it need not contain all bounded operators. The sort $\mathcal{A}_{\mathbb{Z}}^{\mathscr{M}}$ may be identified (via $\llbracket\cdot\in\cdot\rrbracket$ ) with a Boolean algebra of some, but not necessarily all subsets of $\mathbb{Z}^{\mathscr{M}}$ , while $(\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}})^{\mathscr{M}}$ may be identified (via the evaluation map) with a space of (not necessarily all) bounded functions $\mathcal{Z}\to\mathbb{R}$ .

One of the subtlest differences between classical and abstract PET structures is the fact that $\mathfrak{M}^{\mathscr{M}}$ typically consists of measures that are finitely but not countably additive on $\mathbb{Z}^{\mathscr{M}}$ (in particular, such measures need not have atomic decompositions as in the classical case). Fortunately, this difference turns out not to be critical, at least if one works in saturated PET structures: In this setting, the interplay between sorts $\mathbb{Z}^{\mathscr{M}}$ , $\mathcal{A}_{\mathbb{Z}}^{\mathscr{M}}$ and $(\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}})^{\mathscr{M}}$ comes to the rescue via analogues of Loeb measure and Loeb integration [DnI17]. Appendix A.2 contains a basic discussion of Loeb structures.

2. Leibman sequences

2.1. Classical Leibman sequences

Leibman introduced the notion of polynomial sequences in a group $G$ [Lei98]. Leibman’s polynomial sequences in $G$ generalize sequences (indexed by $\mathbb{Z}$ ) of the form $(g^{p(j)}:j\in\mathbb{Z})$ where $p:\mathbb{Z}\to\mathbb{Z}$ is a polynomial and $g\in G$ is fixed. Fix such a sequence ${T_{\bullet}}=(T_{j}:j\in\mathbb{Z})$ where $T_{j}=g^{p(j)}$ . For fixed $i\in\mathbb{Z}$ , the sequence $\Delta^{\!i}{T_{\bullet}}=(T_{i+j}\circ T_{j}^{*}:j\in\mathbb{Z})$ of “step- $i$ discrete differences” of ${T_{\bullet}}$ is of the form $(g^{q(j)})$ where $q=\Delta^{\!i}p:j\mapsto p(i+j)-p(j)$ is a polynomial of degree less than $p$ (or possibly the zero polynomial). This motivates Leibman’s recursive definition of polynomial sequence as follows.

Definition 2.1 (Discrete difference and Leibman sequence).

Let $G$ be a multiplicative group with identity $I$ . When convenient, the inverse $g^{-1}$ of an element $g$ of $G$ will be denoted $g^{*}$ . Let $G^{\mathbb{Z}}$ be the group of all $\mathbb{Z}$ -sequences $T:j\mapsto T_{j}$ from $\mathbb{Z}$ into $G$ under the operation of pointwise multiplication induced from $G$ , and endow $G^{\mathbb{Z}}$ with the translation action $\prescript{}{i}{T}$ of $\mathbb{Z}$ , namely $(\prescript{}{i}{T})_{j}=T_{i+j}$ . For $i\in\mathbb{Z}$ , the discrete-difference operator is the function $T\mapsto\Delta^{\!i}T:=\prescript{}{i}{T}\cdot T^{*}$ from $G^{\mathbb{Z}}$ to $G^{\mathbb{Z}}$ ; it is uniquely characterized by the identity

[TABLE]

(Since $(T^{*})_{j}=(T_{j})^{*}$ , parentheses may be omitted without ambiguity.) We will also omit parentheses when writing iterated discrete differences; thus, $\Delta^{\!i}\Delta^{\!j}T$ means $\Delta^{\!i}(\Delta^{\!j}T)$ .

Let $\mathbbm{I}$ denote the constant sequence $j\mapsto I$ . Given $d\in\mathbb{N}$ , a Leibman sequence in $G$ of degree at most $d$ is any $T\in G^{\mathbb{Z}}$ all of whose $(d+1)$ -fold iterated discrete differences are trivial, i.e.,

[TABLE]

A Leibman sequence is a Leibman sequence of any degree $d$ ; its degree $\deg_{\mathrm{L}}\!T$ is the least such $d$ . (We define formally $\deg_{\mathrm{L}}\!\mathbbm{I}=-\infty$ .) A Leibman sequence $T$ of degree at most [math] is called translation-invariant or constant; it is of the form $T=g\mathbbm{I}$ for some $g\in G$ (i.e., $T_{k}=g$ for all $k\in\mathbb{Z}$ ).

*Remarks 2.2**.*

•

The definition of Leibman sequence above is indirect and recursive; it involves only the group structures of $(\mathbb{Z},+)$ and $(G,\cdot)$ , but not the product of $\mathbb{Z}$ as one might otherwise expect from the usual construction of polynomials starting with monomials built from multiplication.

•

It can be shown (by an application of the usual method of finite differences) that if $T$ is a Leibman sequence of degree at most $d$ in an abelian group $G$ , then there exist $g_{0},g_{1},\dots,g_{d}\in G$ such that

[TABLE]

where ${k\choose j}=k(k-1)\cdots(k-j+1)/j!$ is the $j$ -th binomial coefficient.333See Proposition 2.6 below for the case of sequences in $G$ abelian that are at most quadratic. One may regard $g_{0},g_{1},\dots,g_{d}$ as the “coefficients” of the Leibman polynomial $T$ . In particular, this abelian setting comprises all families $(g^{p(k)})$ where $p$ is a polynomial $\mathbb{Z}\to\mathbb{Z}$ and $g\in G$ is fixed. Theorem 1 states the convergence of ergodic averages in the abelian case; nevertheless, Theorems 2, 3 and 4 only assume that $T$ is a unitary Leibman sequence per Definition 2.11, but no additional explicit commutativity hypotheses.

•

Translations commute with inversion and with discrete differences, i.e., $(\prescript{}{j}{T})^{*}=\prescript{}{j}{(}T^{*})$ and $\prescript{}{j}{(}\Delta^{\!i}T)=\Delta^{\!i}(\prescript{}{j}{T})$ . (The latter equality depends on the commutativity of addition on $\mathbb{Z}$ .) In particular, Leibman degree is invariant under translation. However, the discrete difference operators do not commute with adjoints, so Leibman degree is not invariant under taking adjoints. Correspondingly, $T^{*}$ need not be a Leibman polynomial if $T$ is.

Leibman sequences of degree at most $1$ are easily characterized:

Proposition 2.3.

Given any fixed choice of $a,b\in G$ , there exists a unique Leibman sequence $T$ of degree at most $1$ satisfying $b=T_{0}$ and $a=\Delta^{\!1}T_{0}$ , namely $T:k\mapsto a^{k}b$ .

The straightforward proof of Proposition 2.3 is left to the reader.

2.2. Quadratic Leibman sequences

In this section we characterize classical Leibman sequences that are quadratic, i.e., of degree at most $2$ . In particular, we construct a quadratic Leibman sequence whose range generates the non-nilpotent “lamplighter” group $\mathbb{Z}\wr\mathbb{Z}$ (Corollary 2.9). Throughout this section, $G$ will denote a multiplicative group with identity $I$ ; the inverse $g^{-1}$ of $g\in G$ is denoted $g^{*}$ when convenient.

In what follows, we fix a quadratic Leibman sequence $T$ . One may suspect that $T$ is uniquely characterized by three constants, say $a=\Delta^{\!1}\Delta^{\!1}T_{0}$ , $b=\Delta^{\!1}T_{0}$ and $c=T_{0}$ ; this is easily shown to be true (See Proposition 2.4 below). However, in contrast to Proposition 2.3, the constants $a,b,c$ are not arbitrary: The requirement that they correspond to a bona fide Leibman sequence $T$ imposes nontrivial relations among $a$ and $b$ : They must generate a factor of $\mathbb{Z}\wr\mathbb{Z}$ by Propositions 2.4 and 2.8.

Proposition 2.4.

Given a quadratic Leibman sequence $T$ in a group $G$ , the elements $a=\Delta^{\!1}\Delta^{\!1}T_{0}=T_{2}T_{1}^{*}T_{0}T_{1}^{*}$ and $b=\Delta^{\!1}T_{0}=T_{1}T_{0}^{*}$ satisfy the commutation relations

[TABLE]

where $\prescript{[h]}{}{\!}g:=hgh^{*}$ is the conjugate of $g$ by $h$ . Conversely, given $a,b,c\in G$ such that the above relations hold for $a$ and $b$ , there exists a unique quadratic Leibman sequence $T$ satisfying $\Delta^{\!1}\Delta^{\!1}T_{0}=a$ , $\Delta^{\!1}T_{0}=b$ and $T_{0}=c$ .

Note that the commutation relations (2.1) do not involve $c$ at all.

2.2.1. Proof of Proposition 2.4

Free variables $i,j,k,l,m,n,p,x,y,z$ will denote elements of $\mathbb{Z}$ throughout. The iterated discrete differences $\Delta^{\!i}T,\Delta^{\!i}\Delta^{\!j}T,\Delta^{\!i}\Delta^{\!j}\Delta^{\!k}T$ of $T$ will be denoted $\delta(i),\delta(i,j),\delta(i,j,k)$ , respectively.

First, let $T$ be a quadratic Leibman sequence; thus, $\delta(i,j,k)=\mathbbm{I}$ for all $i,j,k$ , by definition of Leibman degree. It follows that $\prescript{}{i}{\delta}(j,k)=\delta(j,k)$ , i.e., $\delta(j,k)$ is constant.

Denote by $\prescript{[S]}{}{\!}R$ the conjugate $SRS^{*}$ of $R$ by $S$ ( $R,S\in G^{\mathbb{Z}}$ ). Straightforward algebra shows that the “cocycle identity”

[TABLE]

holds for arbitrary $T\in G^{\mathbb{Z}}$ . Under the assumption that $T$ is quadratic, all terms $\delta(\cdot,\cdot)$ in the identity above are constant, so the cocycle identity improves itself to one with an extra free parameter $m$ :

[TABLE]

Let $\llbracket R,S\rrbracket=R^{*}S^{*}RS$ be the commutator of $R,S$ . Using the cocycle identity (2.3) to expand $\delta(i,j+k+l)$ in two different ways, we find $\llbracket\delta(i,j),\prescript{}{m}{\delta}(k+l)^{*}\cdot\prescript{}{n}{\delta}(k)\cdot\prescript{}{p}{\delta}(l)\rrbracket=\mathbbm{I}$ . Using the relation $\delta(x)^{*}=\prescript{}{x}{\delta}(-x)$ , this identity may be rewritten

[TABLE]

Let $H$ be the subgroup of $G^{\mathbb{Z}}$ generated by all elements $\prescript{}{k}{\delta}(x)$ , and $K$ the subgroup of $H$ generated by all elements $\prescript{}{k}{\delta}(x)\cdot\prescript{}{l}{\delta}(y)\cdot\prescript{}{m}{\delta}(z)$ with $x+y+z=0$ . It follows from the commutation relations (2.4) that $K$ is a subgroup of the centralizer of $\delta(i,j)$ in $G^{\mathbb{Z}}$ . It is easy to see that $K$ is a normal subgroup of $H$ , and $x\mapsto\delta(x)\pmod{K}$ is a homomorphism $\mathbb{Z}\to H/K$ . Note that $\prescript{[\delta(l)^{k}]}{}{\delta}(m,n)=\delta(l)^{k}\cdot\prescript{}{m}{\delta}(n)\cdot\delta(n)^{-1}\cdot\delta(l)^{-k}\equiv\delta(kl)\cdot\delta(n)\cdot\delta(-n)\cdot\delta(-kl)\equiv\mathbbm{I}\pmod{K}$ , so the following commutation identity follows:

[TABLE]

Putting $i=j=l=m=n=1$ (with $k$ arbitrary) and evaluating at [math], we obtain the commutation relations (2.1).

Conversely, let $a,b,c$ satisfy the commutation relations (2.1). We claim that a unique $T\in G^{\mathbb{Z}}$ exists satisfying

[TABLE]

Let $T_{0}=c$ and $T_{1}=bc$ , so the first two conditions above hold. For $k\geq 0$ , the condition $T_{k+2}T_{k+1}^{*}T_{k}T_{k+1}^{*}=\delta(1,1)_{k}=a$ is equivalent to the forward recurrence $T_{k+2}=aT_{k+1}T_{k}^{*}T_{k+1}$ , while for $k\leq 0$ , it is equivalent to the backward recurrence $T_{k}=T_{k+1}T_{k+2}^{*}aT_{k+1}$ . Using both recurrences with the initial values $T_{0}=c$ , $T_{1}=bc$ , we obtain a unique $T\in G^{\mathbb{Z}}$ satisfying the conditions (2.6).

To prove that $T$ is indeed a quadratic Leibman sequence, it remains to show that $\delta(i,j)$ is constant for all $i,j$ . This is done inductively, starting from (2.6), which implies that $\delta(1,1)$ is constant. The details follow.

Lemma 2.5.

Let $A=\delta(1,1)=a\mathbbm{I}$ and $B=\delta(1)$ . For $R\in G^{\mathbb{Z}}$ , let the naive degree $\deg R$ of $R$ be the sum of the exponents of all occurrences of $B$ when $R$ is written as a word in the alphabet $A^{1},A^{-1},B^{1}$ , $B^{-1}$ . Then naive degree is invariant under translations: $\deg R=\deg(\prescript{}{j}{\!}R)$ for all $j\in\mathbb{Z}$ . Define

[TABLE]

Let $\langle A,B\rangle$ be the subgroup of $G^{\mathbb{Z}}$ generated by $A$ and $B$ . The elements $\{\!\!\{{R}\}\!\!\}$ for $R\in\langle A,B\rangle$ are constant and commute with each other pairwise. Moreover, $\{\!\!\{{R}\}\!\!\}$ depends only on $\deg R$ ; in fact, $\{\!\!\{{R}\}\!\!\}=\{\!\!\{{B^{\deg R}}\}\!\!\}$ .444The naive degree need not be well defined as an integer, but it is well defined as an integer modulo $N$ if $N$ is the least positive integer (if any) such that $I$ has an expression as a word of naive degree $N$ (in addition to its expression as the empty word of naive degree [math]). Thus, naive degree induces a well-defined notion of degree (modulo $N$ ) with respect to which the identity $\{\!\!\{{R}\}\!\!\}=\{\!\!\{{B^{\deg R}}\}\!\!\}$ holds.

Proof.

Let $\mathcal{W}=\mathcal{W}(X,Y)$ denote a word in four formal symbols $X,X^{*},Y,Y^{*}$ . If $P,Q,R\in G^{\mathbb{Z}}$ are such that the word $\mathcal{W}$ evaluates to $R$ (using the group operation of $G^{\mathbb{Z}}$ ) under the substitutions $X=P$ , $X^{*}=P^{-1}$ , $Y=Q$ , $Y^{*}=Q^{-1}$ , we write $R=\mathcal{W}(P,Q)$ . For an arbitrary such word $\mathcal{W}$ , let $R=\mathcal{W}(A,B)$ . The equalities $A=\prescript{}{j}{A}$ and $\prescript{}{j}{B}=A^{j}B$ imply

[TABLE]

for a new word555To be precise, $\mathcal{W}(A,A^{j}B)$ is a word $\mathcal{W}^{\prime}(A,B)$ where $\mathcal{W}^{\prime}(X,Y)$ is obtained from $\mathcal{W}(X,Y)$ performing the substitutions $Y=X^{j}Y$ , $Y^{*}=Y^{*}X^{-j}$ , where powers $X^{k}$ , $X^{-k}$ (for $k\geq 0$ ) are interpreted as the $k$ -words $X\dots X$ and $X^{*}\dots X^{*}$ , respectively. $\mathcal{W}^{\prime}=\mathcal{W}^{\prime}(X,Y)$ using exactly as many of each of the symbols $Y$ , $Y^{*}$ as $\mathcal{W}$ (but possibly more of $X$ , $X^{*}$ ). It follows that naive degree is invariant under translations.

We have $\prescript{}{j}{\!}\{\!\!\{{\!R}\}\!\!\}=\prescript{}{j}{(}RAR^{*})=\prescript{}{j}{R}\cdot\prescript{}{j}{A}\cdot\prescript{}{j}{R^{*}}=\{\!\!\{{\prescript{}{j}{\!}R}\}\!\!\}$ since $\prescript{}{j}{A}=A$ . Because of the translation invariance of naive degree, once the identity $\{\!\!\{{R}\}\!\!\}=\{\!\!\{{B^{\deg R}}\}\!\!\}$ is proved, it shall follow that $\{\!\!\{{R}\}\!\!\}$ is constant, since $\prescript{}{j}{\{\!\!\{{R}\}\!\!\}}=\{\!\!\{{\prescript{}{j}{R}}\}\!\!\}=\{\!\!\{{B^{\deg(\prescript{}{j}{R})}}\}\!\!\}=\{\!\!\{{B^{\deg R}}\}\!\!\}=\{\!\!\{{R}\}\!\!\}$ .

By identity (2.7), proving $\{\!\!\{{R}\}\!\!\}\{\!\!\{{S}\}\!\!\}=\{\!\!\{{S}\}\!\!\}\{\!\!\{{R}\}\!\!\}$ for all $R,S\in G^{\mathbb{Z}}$ reduces to showing $\{\!\!\{{R}\}\!\!\}_{0}\cdot\{\!\!\{{S}\}\!\!\}_{0}=\{\!\!\{{S}\}\!\!\}_{0}\cdot\{\!\!\{{R}\}\!\!\}_{0}$ . Abusing notation, define $\{\!\!\{{g}\}\!\!\}=\prescript{[g]}{}{a}=gag^{*}$ for $g\in G$ . The remainder of the proof thus reduces to proving

(1)

$\{\!\!\{{g}\}\!\!\}$ depends only on the naive degree $\deg g$ of $g\in G$ —defined as the sum of the exponents of $b$ in an expression $g=\mathcal{W}(a,b)$ of $g$ as a word $\mathcal{W}$ on $a^{1},a^{-1},b^{1}$ , and $b^{-1}$ —in fact, $\{\!\!\{{g}\}\!\!\}=\{\!\!\{{b^{n}}\}\!\!\}$ where $n=\deg g$ , and 2. (2)

the elements $\{\!\!\{{g}\}\!\!\}=\prescript{[g]}{}{\!}a$ for $g$ in the subgroup $\langle a,b\rangle$ generated by $a$ and $b$ commute pairwise.

Since $\{\!\!\{{g}\}\!\!\}$ commutes with $\{\!\!\{{h}\}\!\!\}$ iff $\{\!\!\{{g^{*}h}\}\!\!\}$ commutes with $\{\!\!\{{I}\}\!\!\}=a$ , property (2) follows from

(2’)

$\{\!\!\{{g}\}\!\!\}$ commutes with $a$ if $g\in\langle a,b\rangle$ .

Note that $g\in\langle a,b\rangle$ satisfies properties (1) and (2’) iff either one of $g^{*},ag,a^{*}g$ does.

For $m\geq 0$ , let $\langle a,b\rangle_{m}$ be the set of elements of $\langle a,b\rangle$ that are words $\mathcal{W}(a,b)$ using no more than $m$ symbols $b,b^{*}$ . By induction on $m$ , we prove assertions (1) and (2’) for $g\in\langle a,b\rangle_{m}$ . (This will prove the assertions for all $g\in\langle a,b\rangle=\bigcup_{m}\langle a,b\rangle_{m}$ .) First, $\langle a,b\rangle_{0}$ consists of powers $a^{k}$ having naive degree zero. Since $a$ commutes with $a^{k}$ , it follows that $\{\!\!\{{a^{k}}\}\!\!\}=\prescript{[a^{k}]}{}{a}=a$ ; thus, assertions (1) and (2’) hold for $m=0$ . Next, assume both assertions hold for some fixed $m\geq 0$ . Let $g\in\langle a,b\rangle_{m+1}$ be arbitrary. Without loss of generality (possibly multiplying $g$ by powers of $a$ or $a^{*}$ on the left) we may assume that $g=b^{\pm 1}h$ with $h\in\langle a,b\rangle_{m}$ . If $g=b^{\pm 1}h$ , then $\deg g=\deg h\pm 1$ , and it follows from the inductive hypothesis that $\{\!\!\{{g}\}\!\!\}=\{\!\!\{{b^{\pm 1}h}\}\!\!\}=\prescript{[b^{\pm 1}]}{}{\!}{\{\!\!\{{h}\}\!\!\}}=\prescript{[b^{\pm 1}]}{}{\!}{\{\!\!\{{b^{\deg h}}\}\!\!\}}=\{\!\!\{{b^{\pm 1}b^{\deg{h}}}\}\!\!\}=\{\!\!\{{b^{\deg g}}\}\!\!\}$ . Note that $\{\!\!\{{b^{k}}\}\!\!\}=\prescript{[b^{k}]}{}{\!}a$ commutes with $a$ by the hypothesis of Proposition 2.4; hence, so does $\{\!\!\{{g}\}\!\!\}$ . This shows that assertions (1) and (2’) hold for $m+1$ , completing the proof of Lemma 2.5. ∎

Continuing the proof of Proposition 2.4, let $A=\delta(1,1)$ and $B=\delta(1)$ as above, and let $\{\!\!\{{\langle A,B\rangle}\}\!\!\}$ be the subgroup of $\langle A,B\rangle$ generated by elements $\{\!\!\{{R}\}\!\!\}$ with $R\in\langle A,B\rangle$ . By Lemma 2.5, $\{\!\!\{{\langle A,B\rangle}\}\!\!\}$ is an abelian group of constants. By induction, one shows first that for all $k,l$ we have $\prescript{}{k}{\delta}(l)\in\langle A,B\rangle$ , and subsequently that $\delta(k,l)\in\{\!\!\{{\langle A,B\rangle}\}\!\!\}$ (using Lemma 2.5 and the cocycle identity (2.2) to induct on $l$ , then the identity $\delta(i+j,l)=\prescript{}{j}{\delta}(i,l)\delta(j,l)$ to induct on $k$ , plus simple manipulations to extend to negative $k,l$ ). Thus, $\delta(k,l)$ is constant for all $k,l$ . This implies that $\mathbbm{I}=\delta(j,k,l)=\Delta^{\!j}\Delta^{\!k}\Delta^{\!l}T$ for all $j,k,l$ , showing that $T$ is a quadratic Leibman sequence and concluding the proof of Proposition 2.4.

2.2.2. Some consequences of Proposition 2.4

First, we give some definitions. Given a sequence $(x_{i}:i\in\mathbb{Z})$ in any multiplicative group $\mathbb{G}$ , there is a natural notion of product $\prod_{i=k}^{l}x_{i}$ of the terms $x_{i}$ “as $i$ ranges from $k$ to $l$ ”; it is characterized by the properties

(1)

$\prod_{i=k}^{k-1}x_{i}=1_{\mathbb{G}}$ (the identity of $\mathbb{G}$ ), and 2. (2)

$\prod_{i=k}^{l+1}x_{i}=\left(\prod_{i=k}^{l}x_{i}\right)\cdot x_{l+1}$

for all $k,l\in\mathbb{Z}$ .

Informally, terms $x_{i}$ are multiplied left-to-right in succession. For fixed $k$ , one obtains the familiar definitions

[TABLE]

but also the less familiar

[TABLE]

There is a corresponding notion of product $\operatorname{\prescript{\mathrm{op}}{}{\prod}}$ evaluated in the opposite group $\mathbb{G}^{\mathrm{op}}$ of $\mathbb{G}$ : iterated products are computed right-to-left instead, namely

(1)

$\operatorname{\prescript{\mathrm{op}}{}{\prod}}_{i=k}^{k-1}x_{i}=1_{\mathbb{G}}$ , and 2. (2)

$\operatorname{\prescript{\mathrm{op}}{}{\prod}}_{i=k}^{l+1}x_{i}=x_{l+1}\cdot\left(\operatorname{\prescript{\mathrm{op}}{}{\prod}}_{i=k}^{l}x_{i}\right)$

for all $k,l\in\mathbb{Z}$ .666One may alternatively define $\operatorname{\prescript{\mathrm{op}}{}{\prod}}_{i=k}^{l}x_{i}:=\bigl{(}\prod_{i=k}^{l}x_{i}^{-1}\bigr{)}^{-1}$ .

Proposition 2.6.

If $a,b,c$ are elements of a group $G$ satisfying the commutation relations (2.1), the unique quadratic Leibman sequence $T$ satisfying $\Delta^{\!1}\Delta^{\!1}T_{0}=a$ , $\Delta^{\!1}T_{0}=b$ and $T_{0}=c$ is given by the expression

[TABLE]

In particular, if $a$ and $b$ commute, then $T_{j}=a^{j\choose 2}b^{j}c$ where ${j\choose 2}=j(j-1)/2$ for all $j\in\mathbb{Z}$ .

Proof.

From the identities $\delta(j+1)=\prescript{}{j}{\delta}(1)\cdot\delta(j)$ and $\prescript{}{j}{\delta}(1)=\delta(j,1)\delta(1)=\delta(1,1)^{j}\cdot\delta(1)=A^{j}B$ , we obtain $\prescript{}{j}{T}\cdot T^{*}=\delta(j)=\operatorname{\prescript{\mathrm{op}}{}{\prod}}_{i=1}^{j}(A^{i-1}B)$ . Thus, $\prescript{}{j}{T}=\operatorname{\prescript{\mathrm{op}}{}{\prod}}_{i=1}^{j}(A^{i-1}B)\cdot T$ . Equation (2.8) follows evaluating the latter identity at [math].

If $a$ and $b$ commute, then $T_{j}=\prod_{i=1}^{j}a^{i-1}\cdot\prod_{i=1}^{j}b\cdot c=a^{\sum_{i=1}^{j}(i-1)}\cdot b^{j}\cdot c$ , where $\sum_{i=1}^{j}(i-1)=j(j-1)/2={j\choose 2}$ . (For $j\leq 0$ , the sum $\sum_{i=1}^{j}$ is understood in the obvious sense analogous to the definition of $\prod_{i=1}^{j}$ above.) ∎

Definition 2.7.

The restricted wreath product $\mathbb{Z}\wr\mathbb{Z}$ of $\mathbb{Z}$ with itself is called the lamplighter group. It is realized as a group on generators $(\alpha_{k}:k\in\mathbb{Z})$ and $\beta$ subject to the relations

[TABLE]

The subgroups $H=\langle\beta\rangle$ and $K=\langle\alpha_{k}:k\in\mathbb{Z}\rangle$ of $\mathbb{Z}\wr\mathbb{Z}$ are abelian. Together, they generate $\mathbb{Z}\wr\mathbb{Z}$ , and it is easy to show that each is its own centralizer in $\mathbb{Z}\wr\mathbb{Z}$ ; therefore, $\mathbb{Z}\wr\mathbb{Z}$ has trivial center. In particular, $\mathbb{Z}\wr\mathbb{Z}$ is not nilpotent. It is, however, solvable, being a semidirect product of the two abelian groups $H$ , $K$ .

Proposition 2.8.

Given a quadratic Leibman sequence $T$ in a group $G$ , its discrete differences $\Delta^{\!i}T,\Delta^{\!i}\Delta^{\!j}T$ ( $i,j\in\mathbb{Z}$ ) generate a subgroup $\langle\Delta^{\!\!\circ}T\rangle$ of $G^{\mathbb{Z}}$ isomorphic to a factor of $\mathbb{Z}\wr\mathbb{Z}$ . Actually, the group $\langle\Delta^{\!\!\circ}T\rangle$ is already generated by $A=\Delta^{\!1}\!\Delta^{\!1}T$ and $B=\Delta^{\!1}T$ . The values $\Delta^{\!i}T_{m},\Delta^{\!i}\Delta^{\!j}T_{m}$ ( $i,j,m\in\mathbb{Z}$ ) of these discrete differences generate a subgroup $\langle\Delta^{\!\!\circ}T\rangle_{\!\bullet}$ of $G$ also isomorphic to a factor of $\mathbb{Z}\wr\mathbb{Z}$ . In fact, $a=A_{0}$ and $b=B_{0}$ already generate $\langle\Delta^{\!\!\circ}T\rangle_{\!\bullet}$ . Furthermore, there exists a quadratic Leibman sequence such that both $\langle\Delta^{\!\!\circ}T\rangle$ and $\langle\Delta^{\!\!\circ}T\rangle_{\!\bullet}$ are isomorphic to $\mathbb{Z}\wr\mathbb{Z}$ itself.

Proof.

The proof of Proposition 2.4 shows that $\langle\Delta^{\!\!\circ}T\rangle$ is generated by $A$ and $B$ . A fortiori, $\langle\Delta^{\!\!\circ}T\rangle$ is generated by $B$ and $A_{[k]}:=\prescript{[B^{k}]}{}{\!}{A}$ for $k\in\mathbb{Z}$ (since $A_{[0]}=A$ ). The special case $i=j=l=m=n=1$ (with $k$ arbitrary) of equation (2.5) gives the commutation relations

[TABLE]

whence the following relations are easily proved using induction and the definition of $A_{[k]}$ :

[TABLE]

In general, further relations between $A$ and $B$ may hold; nevertheless, we see that the generators $A_{[k]}$ ( $k\in\mathbb{Z}$ ) and $B$ of $\langle\Delta^{\!\!\circ}T\rangle$ satisfy the defining relations (2.9), (2.10) of $\mathbb{Z}\wr\mathbb{Z}$ , so $\langle\Delta^{\!\!\circ}T\rangle$ is isomorphic to a factor of $\mathbb{Z}\wr\mathbb{Z}$ . Evaluation at zero is a homomorphism $G^{\mathbb{Z}}\to G$ that restricts to a surjection $\langle\Delta^{\!\!\circ}T\rangle\to\langle\Delta^{\!\!\circ}T\rangle_{\!\bullet}$ , so $\langle\Delta^{\!\!\circ}T\rangle_{\!\bullet}$ is isomorphic to a factor of $\langle\Delta^{\!\!\circ}T\rangle$ , and thus of $\mathbb{Z}\wr\mathbb{Z}$ , generated by $a=A_{0}$ and $b=B_{0}$ .

Reciprocally, let $(\alpha_{k}:k\in\mathbb{Z})$ and $\beta$ be the canonical generators of $\mathbb{Z}\wr\mathbb{Z}$ , i.e., these elements obey only relations implied by (2.9) and (2.10). It follows from Proposition 2.4 that there is a unique quadratic Leibman sequence $T$ in $\mathbb{Z}\wr\mathbb{Z}$ satisfying $\Delta^{\!1}\Delta^{\!1}T_{0}=\alpha_{0}$ , $\Delta^{\!1}T_{0}=\beta$ and $T_{0}=I$ . For this sequence $T$ we have $\langle\Delta^{\!\!\circ}T\rangle_{\!\bullet}\simeq\mathbb{Z}\wr\mathbb{Z}$ , and hence $\langle\Delta^{\!\!\circ}T\rangle\simeq\mathbb{Z}\wr\mathbb{Z}$ also. ∎

Corollary 2.9.

There exists a quadratic Leibman sequence $T$ with $T_{0}=I$ whose range generates a non-nilpotent group.

Proof.

The quadratic Leibman sequence $T$ constructed in the proof of Proposition 2.8 has $T_{0}=I$ , and its range generates $\mathbb{Z}\wr\mathbb{Z}$ , which is not nilpotent. ∎

*Remark 2.10**.*

Bergelson and Leibman used the lamplighter group to construct counterexamples showing that multiple recurrence and multiple convergence results that hold for ergodic actions generating nilpotent groups do fail for non-nilpotent groups [BL04]. In contrast to the case of multiple ergodic averages, all (simple) ergodic convergence results in the present manuscript—including Theorems 2 and 4—hold under the sole hypothesis that the family $T_{\bullet}$ is a Leibman sequence, which already in the quadratic setting includes cases in which the range of $T_{\bullet}$ is non-nilpotent, per Corollary 2.9 above.

2.3. Leibman sequences in PET structures

Definition 2.11 (Discrete difference and abstract Leibman sequence).

Let $\mathscr{M}$ be a PET structure over $\mathbb{Z}$ . The discrete-difference operator is the function $\Delta^{\!\bullet}:\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ uniquely characterized by the identity

[TABLE]

Alternatively, for $i\in\mathcal{Z}$ , the left evaluation at $i$ of $\Delta^{\!\bullet}T$ is $\Delta^{\!i}T=\prescript{}{i}{T}\circ T^{*}$ .

Let $\mathbbm{I}=I(\blacksquare)$ denote the constant family $i\mapsto I$ in $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ . Given $d\in\mathbb{N}$ , a unitary Leibman sequence of degree at most $d$ is any $T\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ satisfying

[TABLE]

that takes values in $\mathrm{U}_{\mathcal{H}}$ , i.e., also satisfying $T^{*}\circ T=\mathbbm{I}=T\circ T^{*}$ . A Leibman sequence is a Leibman sequence of any degree $d$ ; its degree $\deg_{\mathrm{L}}\!T$ is the least such $d$ . (We define formally $\deg_{\mathrm{L}}\!\mathbbm{I}=-\infty$ .)

*Remarks 2.12**.*

•

Note that abstract Leibman sequences per Definition 2.11 are “internal”, i.e., obtained from elements of $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ —that are otherwise only incidentally regarded as functions $\mathcal{Z}\to\mathfrak{B}$ via evaluation; they may be regarded as taking values in the group

[TABLE]

of (internal) unitary transformations of the Hilbert space $\mathcal{H}$ . (For this reason, we sometimes refer to these Leibman sequences as unitary.) Accordingly, the defining property of a Leibman sequence amounts to the requirement that $d+1$ discrete-difference operations, possibly involving nonstandard elements $j\in\mathcal{Z}$ , always transform $T$ into $\mathbbm{I}$ , i.e., the constant function $i\mapsto I$ for all $i\in\mathcal{Z}$ , not merely for all $i\in\mathbb{Z}$ . The proofs of Theorems 2 and 3 below crucially depend on the richer structure of $\mathcal{Z}$ in saturated PET structures—even if ultimately the results are valid in all PET structures, including classical ones whose Leibman sequences are bona fide functions $\mathbb{Z}\to\mathrm{U}_{\mathcal{H}}\subset\mathfrak{B}$ .

•

We have $\Delta^{\!\bullet}T=\prescript{}{\bullet}{T}\circ(T^{*}_{\blacksquare,\cdot})$ ; hence, discrete differentiation is obtained from the $\mathbb{Z}$ -action on $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ , the right inclusion $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\hookrightarrow\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ , plus the pointwise operations of composition and taking adjoint; thus, $\Delta^{\!\bullet}$ may as well be regarded as a distinguished function of any PET structure $\mathscr{M}$ .

•

The predicate “ $T$ is a unitary Leibman sequence of degree at most $d$ ” is captured by a single Henson formula $\lambda_{d}(T)$ , namely777Expressions of the type $x=y$ such as those in (2.11) are not Henson formulas sensu stricti, but may be regarded as abbreviations of formulas $\mathrm{d}(x,y)\leq 0$ (or $\left\|y-x\right\|\leq 0$ in Banach sorts).

[TABLE]

•

Since any group $G$ is realized as a subgroup of a suitable unitary group $\mathrm{U}_{\mathcal{H}}$ ,888One may identify $G$ with its faithful homomorphic image under the translation action $G\curvearrowright\mathscr{L}^{2}(G)$ , which realizes $G$ as a group of unitary transformations of $\mathcal{H}=\mathscr{L}^{2}(G)$ . it follows that any classical Leibman sequence is realized as a Leibman sequence in a classical PET structure. In view of Proposition 2.8 and Corollary 2.9, we have instances of pointwise ergodic convergence per Theorem 2 in the setting of quadratic Leibman sequences $(T_{n})$ of unitary operators generating a non-nilpotent group of unitary transformations. To our knowledge, this is the first explicit example of pointwise convergence of averages of a non-nilpotent (in fact, not even virtually nilpotent999A group is virtually nilpotent if it has a finite-index nilpotent subgroup.) family of unitary transformations.

3. An ergodic theorem for unitary polynomial actions of $\mathbb{Z}$

Throughout the end of this section, $\mathcal{L}$ will be the Henson language for PET structures over $\mathbb{Z}$ . All structures will be in the class $\overline{\mathbf{PET}}$ of abstract PET structures over $\mathbb{Z}$ .

3.1. The sequence of ergodic averages

Convention 3.1.

Henceforth, the standalone symbols $\mathbb{R},\mathbb{Z},\mathbb{N}$ shall denote the usual sets of real, integer and natural numbers. If $\mathscr{M}$ is a PET structure over $\mathbb{Z}$ , we shall use interpretation of constants (and the density of $\mathbb{Q}$ in $\mathbb{R}$ ) to identify $\mathbb{R}$ with the sort $\mathbb{R}^{\mathscr{M}}$ , and also $\mathbb{Z}$ and $\mathbb{N}$ with subsets of $\mathcal{Z}=\mathbb{Z}^{\mathscr{M}}$ and $\mathcal{N}=\mathbb{N}^{\mathscr{M}}$ , respectively.101010The identification of $\mathbb{N}$ with a subset of $\mathbb{Z}$ is neither necessary nor beneficial. Theorem 4 below considers ergodic averages relative to Følner nets indexed by any countable directed set $\mathbb{D}$ (in place of $\mathbb{N}$ ) over an arbitrary abelian group $\mathbb{G}$ (in place of $\mathbb{Z}$ ). By an abuse of notation, when the structure $\mathscr{M}$ is clear from context, we may omit the superscript and write $\mathcal{H}$ , $\mathfrak{B}$ , $\mathcal{A}_{\mathbb{Z}}$ $\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}}$ , … to denote the sorts $\mathcal{H}^{\mathscr{M}}$ , $\mathfrak{B}^{\mathscr{M}}$ , $\mathcal{A}_{\mathbb{Z}}^{\mathscr{M}}$ , $(\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}})^{\mathscr{M}}$ , … of $\mathscr{M}$ .

Definition 3.2 (Ergodic averages).

Let $\mathscr{M}$ be a PET structure over $\mathbb{Z}$ and let $T\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ . Via the evaluation $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\times\mathcal{Z}\to\mathfrak{B}$ , one may regard $T$ as a function $i\mapsto T_{i}$ . For $n\in\mathcal{N}$ , the $n$ -th average of $T$ is

[TABLE]

The sequence of averages of $T$ is $\operatorname{AV}_{\bullet}T=(\operatorname{AV}_{n}T:n\in\mathbb{N})$ .

Similarly, for $x\in\mathcal{H}$ , the $n$ -th average of $x$ under $T$ is $\operatorname{AV}_{n}\!T(x)$ . The sequence of averages of $x$ under $T$ is $\operatorname{AV}_{\bullet}\!T(x)=(\operatorname{AV}_{n}\!T(x):n\in\mathbb{N})$ .

We remark that $\operatorname{Th}_{\mathbf{PET}}$ ensures the validity of the identities

[TABLE]

for (standard) $n\in\mathbb{N}$ ; however, averages as defined above are non-classical if $n\in\mathcal{N}\setminus\mathbb{N}$ . On the other hand, the sequence $(\operatorname{AV}_{n}T:n\in\mathbb{N})$ has classical terms, and the study of its convergence is purely classical a priori.

Theorem 2 (Poly-MET/ $\mathbb{Z}$ : Mean Ergodic Theorem for unitary polynomial actions of $\mathbb{Z}$ ).

Let $\mathscr{M}$ be a PET structure over $\mathbb{Z}$ , and let $T\in(\mathscr{L}^{\infty}_{\mathbb{Z},\mathbb{R}})^{\mathscr{M}}$ be a Leibman sequence of unitary operators on the Hilbert space $\mathcal{H}=\mathcal{H}^{\mathscr{M}}$ . For every $x\in\mathcal{H}$ , the sequence $\operatorname{AV}_{\bullet}T(x)=(\operatorname{AV}_{n}T(x):n\in\mathbb{N})$ of averages of $x$ under $T$ converges in the norm topology of $\mathcal{H}$ .

Theorem 2 admits the following uniformly metastable strengthening.

Theorem 3 (Metastable Poly-MET/ $\mathbb{Z}$ ).

Fix $d\in\mathbb{N}$ . There exists a universal metastability rate $E_{\bullet}^{d}$ , depending only on $d$ , that applies uniformly to all sequences $\operatorname{AV}_{\bullet}T(x)$ of averages of arbitrary $x$ in the unit ball of the Hilbert-space sort $\mathcal{H}$ under any Leibman sequence $T$ in $\mathrm{U}_{\mathcal{H}}$ of degree at most $d$ in any PET structure $\mathscr{M}$ over $\mathbb{Z}$ .

The rest of this section is devoted to proving Theorems 2 and 3.

3.2. Proof preliminaries

Lemma 3.3 (Dominated Convergence Theorem in PET structures).

Let $\mathcal{L}$ be the language of PET structures. Let $\mathscr{M}$ be any saturated PET structure. Let $\varphi_{\bullet}=(\varphi_{n}\colon n\in\mathbb{N})$ be a bounded sequence in $\mathscr{L}^{\infty}_{\mathbb{Z},\mathcal{H}}$ . For all $x\in\mathbb{Z}^{\mathscr{M}}$ assume that the sequence $\varphi_{\bullet}(x)=(\varphi_{n}(x):n\in\mathbb{N})$ in $\mathcal{H}^{\mathscr{M}}$ is convergent. Then, for arbitrary $\mu\in\mathfrak{M}^{\mathscr{M}}$ , the sequence $\langle{\varphi_{\bullet}},{\mu}\rangle=\big{(}\langle{\varphi_{n}},{\mu}\rangle:n\in\mathbb{N})$ in $\mathcal{H}^{\mathscr{M}}$ is convergent.

(We will only require the special case of Lemma 3.3 in which $\varphi$ is of the form $n\mapsto\operatorname{AV}_{n}\!T$ with $T\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ .)

Proof.

A saturated PET structure $\mathscr{M}$ is a Banach integration framework (with Banach sort $\mathcal{H}^{\mathscr{M}}$ and measure-space sort $\mathbb{Z}^{\mathscr{M}}$ ) as defined in Appendix A.4. Thus, Lemma 3.3 follows from Theorem 5 whose statement and proof are in Appendix A.5. ∎

To state the next lemma we need a definition. Let the reverse difference operator $\nabla^{\bullet}:\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ be the mapping $T\mapsto\nabla^{\bullet}T$ characterized by the property that $\nabla^{\bullet}T$ evaluates to the function $(i,j)\mapsto T_{j}\circ T^{*}_{i+j}$ . We write $\nabla^{i}T$ to denote the left evaluation of $\nabla^{\bullet}T$ , i.e., $\nabla^{i}T$ evaluates to the mapping $j\mapsto T_{j}\circ T^{*}_{i+j}$ . Note that $\nabla^{i}T$ is the translate by $i$ of the (forward) difference of $T$ with step $-i$ , i.e., $\nabla^{i}T=\prescript{}{i}{\Delta}^{\![-i]}T$ holds for all $i\in\mathcal{Z}$ . Just like the forward difference operator $\Delta^{\!\bullet}$ , the reverse difference operator $\nabla^{\bullet}$ is explicitly definable in any PET structure since it is obtained by composing functions of the structure (the $\mathbb{Z}$ -action $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ , the shear map on $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ , the pointwise adjoint operation $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\to\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ , the left inclusion $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\hookrightarrow\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ , and the pointwise composition $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}\times\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}\to\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ ); thus, $\nabla^{\bullet}$ may as well be considered a distinguished function of any PET structure.

Lemma 3.4.

Let $\mathscr{M}$ be a PET structure such that $\mathcal{N}=\mathbb{N}^{\mathscr{M}}$ contains a nonstandard natural number $M\in\mathcal{N}\setminus\mathbb{N}$ . For every standard natural $n\in\mathbb{N}$ and $T\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ :

[TABLE]

In less cryptic notation, the equation above reads:

[TABLE]

Every step of the proof below is justified by an axiom of $\operatorname{Th}_{\mathbf{PET}}$ . We prefer to use informal integral notation to make the argument transparent.

Proof.

Note that $(\operatorname{AV}_{\!M}T)^{*}=\operatorname{AV}_{\!M}(T^{*})$ (in fact, $\langle{T},{\mu}\rangle^{*}=\langle{T^{*}},{\mu}\rangle$ for all $\mu\in\mathfrak{M}$ ). For $m,n\in\mathbb{N}$ we have:

[TABLE]

where $\prescript{}{j}{(}\sigma_{m})$ is the translation of $\sigma_{m}$ by $j$ . Since $\left\|\nabla^{i}T_{j}\right\|=\left\|\prescript{}{i}{\Delta}^{\![-i]}T_{j}\right\|\leq\left\|T\right\|^{2}$ for all $i,j$ :

[TABLE]

Given fixed $\epsilon>0$ and $n\in\mathbb{N}$ , let $m_{\epsilon,n}$ be the smallest natural number satisfying $m_{\epsilon,n}\geq 2n/\epsilon$ . Clearly, $\|\sigma_{m}-\prescript{}{j}{(}\sigma_{m})\|\leq 2n/(m+1)\leq\epsilon$ for $j\leq n\leq m+1$ . (For $m$ large and $n$ small, this inequality captures the “approximate invariance” of the long interval $\{0,1,\dots,m\}$ of $\mathbb{Z}$ under small translations, i.e., the Følner property of the collection of such intervals.) Then we have

[TABLE]

Since $M\in\mathcal{N}\setminus\mathbb{N}$ satisfies $M\geq m_{\epsilon,n}$ for all $n\in\mathbb{N}$ and $\epsilon>0$ , the assertion in Lemma 3.4 follows. ∎

We offer some remarks on the proof of Lemma 3.4 above, which is the crux of our approach to proving Theorem 2. Despite its rather short length, it sheds light on the various sorts and distinguished functions in PET structures. There are no double integrals as such but rather iterated integrals $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}\to\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ and $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}\to\mathfrak{B}$ —the order of the integration (first on the left and then on the right variable, or vice versa) is immaterial as ensured by the PET axiom

[TABLE]

The validity of the substitution $i=i+j$ in the inner integral is justified by the compatibility of the shear transformation on $\mathscr{L}^{\infty}_{\mathbb{Z}^{2},\mathfrak{B}}$ and the action of $\mathbb{Z}$ on $\mathfrak{M}$ :

[TABLE]

Other steps in the proof admit similar formal justifications by axioms of PET structures.

Lemma 3.5.

Let $\mathscr{M}$ be a saturated Henson structure with an ordered sort $(\mathcal{N},\leq)$ extending $(\mathbb{N},\leq)$ , and let $\varphi_{\bullet}$ be a sequence explicitly defined by a $\mathcal{L}[S]$ -term $\varphi(\mathtt{n})$ of sort $s$ (i.e., $\varphi_{\bullet}$ is the sequence $(\varphi(n):n\in\mathbb{N})$ in $\mathscr{M}$ , where $\mathtt{n}$ is a variable of sort $\mathcal{N}$ and $S$ is some set of parameters of the universe of $\mathscr{M}$ ). Then every sub-sequential limit of $\varphi$ is of the form $\varphi(M)$ for some $M\in\mathcal{N}\setminus\mathbb{N}$ . If $\varphi(M)=\varphi(N)$ for all $M\in\mathcal{N}\setminus\mathbb{N}$ , then the sequence $\varphi_{\bullet}=(\varphi(n):n\in\mathbb{N})$ converges. In such case, the common value $\varphi(M)$ is the limit $\lim_{n\to\infty}\varphi(n)$ .

The proof of Lemma 3.5 is a routine application of saturation left to the reader.

Lemma 3.6.

Let $\mathcal{X}$ , $\mathcal{Y}$ be metric spaces, and let $\varphi:(x,n)\mapsto\varphi_{n}(x)$ be a function from $\mathcal{X}\times\mathbb{N}$ to $\mathcal{Y}$ such that $\varphi_{n}(\cdot):\mathcal{X}\to\mathcal{Y}$ is $1$ -Lipschitz for each $n\in\mathbb{N}$ (i.e., $d(\varphi_{n}(x),\varphi_{n}(y))\leq d(x,y)$ for $x,y\in\mathcal{X}$ ). Let $S$ be a dense subset of $\mathcal{X}$ such that $\varphi_{\bullet}(x)$ converges for all $x\in S$ . Then $\varphi_{\bullet}(x)$ converges for all $x\in\mathcal{X}$ .

We omit the straightforward proof of Lemma 3.6.

3.3. Proof of Theorem 2

For each fixed Leibman degree $d\in\mathbb{N}$ , we first prove Theorem 2 for unitary Leibman sequences of degree at most $d$ in any saturated PET structure $\mathscr{M}$ . The descent argument on the degree $d$ is characteristic of Bergelson’s PET induction [Ber87].

The assertion is trivial for $T=\mathbbm{I}$ . If $T$ (is pointwise unitary and) has Leibman degree $\deg_{\mathrm{L}}T=0$ , we have $T_{i}\circ T^{*}_{0}=\Delta^{\!i}T_{0}=I$ , hence $T_{i}=T_{0}$ for all $i\in\mathcal{Z}$ , so $T$ is constant. Thus, the sequences $\operatorname{AV}_{\bullet}T$ and $\operatorname{AV}_{\bullet}T(x)$ are also constant (all terms are equal to $T_{0}$ and $T_{0}(x)$ , respectively), so Theorem 2 follows for Leibman polynomials of degree [math].

Assume now that the assertion in Theorem 2 is proved for all $T\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ having Leibman degree less than some positive integer $d$ . Fix $T\in\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ with $\deg_{\mathrm{L}}(T)=d$ .

Lemma 3.7.

If $x=(\operatorname{AV}_{\!M}\!T)^{*}(y)$ for some $M\in\mathcal{N}\setminus\mathbb{N}$ and $y\in\mathcal{H}$ , then $\operatorname{AV}_{\bullet}T(x)$ converges.

Proof.

Note that $\operatorname{AV}_{\bullet}T(x)$ is bounded by $\left\|T\right\|\left\|x\right\|$ . By Lemma 3.4, we have $\operatorname{AV}_{n}T(x)=\operatorname{AV}_{n}T\circ\operatorname{AV}_{\!M}T^{*}(y)=\langle{\llangle{\nabla^{\bullet}T(y)},{\sigma_{n}}\rrangle},{\sigma_{M}}\rangle$ for $n\in\mathbb{N}$ . For $i\in\mathcal{Z}$ we have $\nabla^{i}T=\prescript{}{i}{\Delta}^{\![-i]}T$ . By the invariance of Leibman degree under translation and the assumption $\deg_{\mathrm{L}}(T)\leq d$ , we have $\deg_{\mathrm{L}}(\nabla^{i}T)=\deg_{\mathrm{L}}(\prescript{}{i}{\Delta}^{\![-i]}T)<d$ , and hence $\operatorname{AV}_{\bullet}(\nabla^{i}T)(y)$ is convergent for all $i\in\mathcal{Z}$ by the inductive hypothesis. An application of Lemma 3.3 concludes the proof. ∎

The space $\operatorname{Struct}$ of structured elements of $\mathcal{H}$ (relative to $T$ ) is the closure of the linear span of all elements of the form $(\operatorname{AV}_{\!M}\!T)^{*}(y)$ for $M\in\mathcal{N}\setminus\mathbb{N}$ and $y\in\mathcal{H}$ . By linearity and Lemmas 3.6 & 3.7, $\operatorname{AV}_{\bullet}T(x)$ converges for all structured elements $x$ . (The $1$ -Lipschitz condition follows from the inequalities $\|\!\operatorname{AV}_{n}\!T(x)-\operatorname{AV}_{n}\!T(y)\|\leq\|\!\operatorname{AV}_{n}\!T\|\|y-x\|$ and $\|\!\operatorname{AV}_{n}\!T\|=\|\!\left\langle T,\sigma_{n}\right\rangle\!\|\leq\|T\|\cdot\|\sigma_{n}\|=1\cdot 1=1$ .)

The space $\operatorname{Rand}$ of pseudorandom elements of $\mathcal{H}$ (relative to $T$ ) is the orthogonal complement of $\operatorname{Struct}$ in $\mathcal{H}$ . By the fundamental theorem of linear algebra and the definition of structured elements, an element $x\in\mathcal{H}$ is pseudorandom precisely when $\operatorname{AV}_{\!M}\!T(x)=0$ for all $M\in\mathcal{N}\setminus\mathbb{N}$ . By Lemma 3.5, $\operatorname{AV}_{\bullet}T(x)$ converges to zero in this case.

Combining the pseudorandom and structured cases using linearity, we deduce that all averages $\operatorname{AV}_{\bullet}\!T(x)$ converge. This concludes the inductive step of the proof of Theorem 2 in any saturated PET structure $\mathscr{M}$ .

To conclude the proof for any PET structure over $\mathbb{Z}$ , let $\mathscr{M}$ be any (not necessarily saturated) PET structure, and let $\widetilde{\mathscr{M}}$ be a saturated elementary $\mathcal{L}$ -extension of $\mathscr{M}$ in Henson’s logic. For fixed degree $d\in\mathbb{N}$ and $T\in(\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}})^{\mathscr{M}}$ , the property that $T$ is a unitary Leibman sequence of degree at most $d$ is $\mathcal{L}$ -axiomatizable, hence it is true in $\widetilde{\mathscr{M}}$ when $T$ is regarded as an element of $(\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}})^{\widetilde{\mathscr{M}}}$ . For $x\in\mathcal{H}^{\widetilde{\mathscr{M}}}$ we have proved that $\operatorname{AV}_{\bullet}\!T(x)$ converges since $\deg_{\mathrm{L}}(T)\leq d$ . A fortiori, $\operatorname{AV}_{\bullet}\!T(x)$ converges for $x\in\mathcal{H}^{\mathscr{M}}$ . This concludes the proof of Theorem 2 in full generality.

3.4. Proof of Theorem 3

Let $\widetilde{\mathcal{L}}$ expand the language $\mathcal{L}$ of PET structures with new constants $(\mathtt{T},\mathtt{x},\mathtt{y}_{n}:n<\omega)$ , with $\mathtt{T}$ of sort $\mathscr{L}^{\infty}_{\mathbb{Z},\mathfrak{B}}$ , and $\mathtt{x}$ and all $\mathtt{y}_{n}$ of sort $\mathcal{H}$ . For fixed $d\in\mathbb{N}$ , consider the $\widetilde{\mathcal{L}}$ -theory

[TABLE]

where $\lambda_{d}(\mathtt{T})$ is formula (2.11) stating that the interpretation of $\mathtt{T}$ is Leibman of degree at most $d$ . Note that $\Lambda_{d}$ is a uniform theory: $\lambda_{d}(\mathtt{T})$ implies $\|\mathtt{T}\|\leq 1$ , hence also $\left\|\texttt{y}_{n}\right\|\leq 1$ . Every model $\widetilde{\mathscr{M}}$ of $\Lambda_{d}$ is an expansion of a PET structure $\mathscr{M}$ having the form $\widetilde{\mathscr{M}}=(\mathscr{M},T,x,\operatorname{AV}_{\bullet}T(x))$ . By Theorem 2, all sequences $(c_{n}^{\widetilde{\mathscr{M}}})=\operatorname{AV}_{\bullet}T(x)$ are convergent. An application of Proposition A.10 finishes the proof of Theorem 3.

3.5. Proof of Theorem 1

A classical Leibman sequence $T_{\bullet}=(T_{k}:k\in\mathbb{Z})$ with $\deg_{\mathrm{L}}(T)\leq d$ in an abelian subgroup $K$ of the group $\mathrm{U}_{\mathcal{H}}$ of unitary operators on a Hilbert space $\mathcal{H}$ is easily shown to have the form $T_{k}=U_{0}\circ U_{1}^{k}\circ U_{2}^{k\choose 2}\circ\dots\circ U_{d}^{k\choose d}$ where $U_{j}=\Delta^{j}T_{0}$ and ${k\choose j}=k(k-1)\dots(k-j+1)/j!$ are binomial coefficients for $j=0,1,\dots,d$ . Since the functions $k\mapsto{k\choose j}$ ( $j=0,1,\dots,d$ ) are a $\mathbb{Z}$ -basis for polynomial mappings $p:\mathbb{Z}\to\mathbb{Z}$ of degree at most $d$ , Theorem 1 is an immediate corollary of Theorems 2 and 3.

4. A Mean Ergodic Theorem for unitary polynomial actions of abelian groups

To formulate our most general result on convergence of averages, we replace $\mathbb{N}$ with an arbitrary countable directed set $(\mathbb{D},\leq)$ and $\mathbb{Z}$ with an arbitrary abelian group $(\mathbb{G},+)$ endowed with a countable Følner $\mathbb{D}$ -net $\mathcal{F}_{\bullet}=(\mathcal{F}_{j}:j\in\mathbb{D})$ of nonempty finite subsets of $\mathbb{G}$ . These assumptions are sufficient to ensure that the proofs of natural generalizations of Theorems 2 and 3 carry through in this more general context, mutatis mutandis, from those given in Section 3.

Theorem 4 (Poly-MET: Mean Ergodic Theorem for unitary polynomial actions of an abelian group).

Fix an abelian group $(\mathbb{G},+)$ and a Følner net $\mathcal{F}_{\bullet}=(\mathcal{F}_{j}:j\in\mathbb{D})$ of subsets of $\mathbb{G}$ , indexed by a countable directed set $(\mathbb{D},\leq)$ . Let $\mathcal{H}$ be a Hilbert space. Let $T:\mathbb{G}\to\mathrm{U}_{\mathcal{H}}$ be a polynomial mapping, in Leibman’s sense, into the group $\mathrm{U}_{\mathcal{H}}$ of unitary transformations of $\mathcal{H}$ . For every $x\in\mathcal{H}$ , the $\mathbb{D}$ -net $\operatorname{AV}_{\bullet}T(x)=(\operatorname{AV}_{i}T(x):i\in\mathbb{D})$ in $\mathcal{H}$ , of averages relative to $\mathcal{F}_{\bullet}$ :

[TABLE]

of $x$ under $T$ , converges in the norm topology of $\mathcal{H}$ .

In fact, given fixed choices of $\mathbb{G}$ , $\mathbb{D}$ , $\mathcal{F}_{\bullet}$ and $d\in\mathbb{N}$ , there exists a rate of metastability

[TABLE]

(with $E_{\epsilon,\eta}\in\mathcal{P}^{*}_{\!\mathrm{fin}}(\mathbb{D})$ for each $\epsilon,\eta$ ) that applies universally to all sequences $\operatorname{AV}_{\bullet}T(x)$ for any element $x$ in the unit ball of any Hilbert space $\mathcal{H}$ and any Leibman polynomial $T:\mathbb{D}\to\mathrm{U}_{\mathcal{H}}$ of degree at most $d$ .

*Remark 4.1**.*

The definition of Leibman polynomial mapping $T:\mathbb{G}\to\mathrm{U}_{\mathcal{H}}$ is a straightforward generalization of that of Leibman sequence $\mathbb{Z}\to\mathrm{U}_{\mathcal{H}}$ [Lei02]. The discrete difference $\Delta^{\!g}T$ of $T$ with step $g\in\mathbb{G}$ is the mapping $h\mapsto T_{g+h}\circ T^{*}_{h}$ . Define $\deg_{\mathrm{L}}(T)\leq 0$ if $\Delta^{g}T=\mathbbm{I}$ (where $\mathbbm{I}$ is the constant mapping $g\mapsto I$ ). Recursively, let $\deg_{\mathrm{L}}(T)\leq d+1$ mean that $\deg_{\mathrm{L}}(\Delta^{\!g}T)\leq d$ for all $g\in\mathbb{G}$ . Then $T$ is a Leibman mapping if $\deg_{\mathrm{L}}(T)\leq d$ for some $d$ ; the least such $d$ is $\deg_{\mathrm{L}}(T)$ , although we adopt the convention $\deg_{\mathrm{L}}(\mathbbm{I})=-\infty$ .

We only provide an outline of the proof of Theorem 4 since it is formally identical to the arguments in Sections 3.2–3.4. The definition of classical PET structure over $\mathbb{G}$ is completely analogous to that of PET structure over $\mathbb{Z}$ in section 1.1—simply replace all instances of $\mathbb{Z}$ by $\mathbb{G}$ and those of $\mathbb{N}$ by $\mathbb{D}$ . The Følner sequence $\mathcal{F}_{\bullet}$ is captured indirectly via the Følner measure map $\sigma:\mathbb{D}\to\mathfrak{M}$ , where

[TABLE]

The Henson language $\mathcal{L}$ for the class of classical PET structures over $\mathbb{G}$ is clear. Any model of the theory $\operatorname{Th}_{\mathbf{PET}}^{\mathbb{D},\mathbb{G}}$ of such classical structures is an (abstract) PET structure over $\mathbb{G}$ . (Note that the language $\mathcal{L}$ depends on both $\mathbb{G}$ and $\mathbb{D}$ ; the theory $\operatorname{Th}_{\mathbf{PET}}^{\mathbb{D},\mathbb{G}}$ further depends on the choice of the Følner net $\mathcal{F}_{\bullet}$ .)

Analogues of Theorems 2 and 3 hold in the class of all PET structures over $\mathbb{G}$ . The scheme of proof is exactly the same. The countability hypothesis on $\mathbb{D}$ is an essential hypothesis in Theorem 5, which enters the proof via an analogue of Lemma 3.3.

Lemma 3.4 uses the exact same definition of reverse difference $(\nabla^{g}T)_{h}=T_{h}\circ T^{*}_{g+h}$ . Its proof is adapted using the definition of Følner net as we now indicate: By definition, given $\epsilon>0$ and $g\in\mathbb{G}$ there is $k=k_{g,\epsilon}\in\mathbb{D}$ such that the symmetric difference $\mathcal{F}_{l}\triangle(g+\mathcal{F}_{l})$ has cardinality at most $\epsilon\cdot\#\mathcal{F}_{l}$ for all $l\geq k$ . Thus, letting $K_{i,\epsilon}=\max\{k_{g,\epsilon}:g\in\mathcal{F}_{i}\}$ , we have $\|\sigma_{j}-\prescript{}{g}{(}\sigma_{j})\|\leq\epsilon$ for all $g\in\mathcal{F}_{i}$ provided $j\geq K_{i,\epsilon}$ . Whence follows the proof of an analog of Lemma 3.4 stating that $(\operatorname{AV}_{j}T)\circ(AV_{K}T)^{*}=\int\operatorname{AV}_{\!j}(\nabla^{g}T)\,d\sigma_{K}(g)$ holds whenever $j\in\mathbb{D}$ and $K\in\mathbb{D}^{\mathscr{M}}$ satisfies $K\geq i$ for all $i\in\mathbb{D}$ .

Lemma 3.5 continues to hold provided one replaces the nonstandard natural numbers $M,N$ with nonstandard elements $J,K$ of $\mathbb{D}^{\mathscr{M}}$ that satisfy $J,K\geq i$ for all standard elements $i\in\mathbb{D}$ .

The arguments in Sections 3.3 and 3.4 apply verbatim once the lemmas in Section 3.2 have been adapted, completing the proof of Theorem 4.

Appendix A A Dominated Convergence Theorem for notions of integration in Banach spaces

This appendix bears a close connection to our prior manuscript on measure, integration and metastable convergence in Henson structures [DnI17]. Our main goal is proving Lemma 3.3. Rather than doing so in the specific context of PET structures, we prove a more general result (Theorem 5) about sequences of integrals of functions on a finite measure space taking values in a Banach space. This requires a number of preliminary steps.

A.1. Integration structures

We recall the class of integration structures (with underlying finite positive measure) introduced in our earlier manuscript, to which we refer the reader for details [DnI17]. These are saturated models of the Henson theory $\operatorname{Th}_{\int}$ of integration with respect to a positive finite measure on structures with (classical) sorts $\mathbb{R}$ , $\Omega$ , $\mathcal{A}_{\Omega}$ , $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ where $\mathbb{R}$ is the set of real numbers, $\mathcal{A}_{\Omega}$ is a $\sigma$ -algebra of subsets of $\Omega$ , and $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ is the set of bounded $\mathcal{A}_{\Omega}$ -measurable (everywhere-defined) real functions on $\Omega$ . (Here $\Omega$ , $\mathcal{A}_{\Omega}$ are discrete while $\mathbb{R}$ , $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ are real Banach spaces.) This theory contains all Henson formulas involving the functions and distinguished constants below that are valid in such structures:

•

Constants: Rational numbers $r\in\mathbb{R}$ , zero vector in the Banach sort $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ , an arbitrary point (“anchor”) $\omega_{0}\in\Omega$ , the empty set $\emptyset\in\mathcal{A}_{\Omega}$ , and the improper subset $\Omega\in\mathcal{A}_{\Omega}$ .

•

Functions:

–

Arithmetic operations (addition and multiplication), absolute value and lattice operations (binary min and max) on $\mathbb{R}$ ;

–

The characteristic function $\llbracket{\cdot}\boldsymbol{\in}{\cdot}\rrbracket:\Omega\times\mathcal{A}_{\Omega}\to\{0,1\}\subseteq\mathbb{R}$ of the membership relation $\in$ on $\Omega\times\mathcal{A}_{\Omega}$ ;

–

Banach operations (addition, scalar multiplication) and norm on $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ (namely, $\|f\|=\sup_{x\in\Omega}|f(x)|$ for $f\in\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ —note that an almost-everywhere null function $f$ has positive norm per this definition unless $f=0$ everywhere);

–

The evaluation map $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}\times\Omega\to\mathbb{R}:(f,x)\mapsto f(x)$ ;

–

The Banach lattice operations (binary min and max) on $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ ;

–

The unary operation of pointwise absolute value $f\mapsto\left|f\right|$ on $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ where $\left|f\right|\in\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ is the function $x\mapsto|f(x)|$ ;

–

The Boolean algebra operations of union, intersection and relative complement $S\mapsto S^{\complement}=\Omega\setminus S$ on $\mathcal{A}_{\Omega}$ ;

–

The characteristic-function map $\chi:\mathcal{A}_{\Omega}\to\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}:S\mapsto\chi_{S}$ ;

–

A positive finite measure $\mu$ on $\Omega$ ;

–

The integration operator $I:\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}\to\mathbb{R}:f\mapsto\int_{\Omega}f\,d\mu$ .

Let $\mathcal{L}$ be any Henson language including sort symbols $\mathbb{R},\Omega,\mathcal{A}_{\Omega},\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ as well as constant and function symbols matching the lists above, and let $\operatorname{Th}_{\int}$ be the $\mathcal{L}$ -theory of such structures $\mathscr{M}=(\mathbf{S},\mathbf{F},\mathbf{C})$ , where $\mathbf{S}$ is the list of sorts, $\mathbf{F}$ the collection of distinguished functions, and $\mathbf{C}$ the set of distinguished elements of $\mathscr{M}$ . An (abstract) pre-integration structure is a model of $\operatorname{Th}_{\int}$ . An integration structure is a saturated model of $\operatorname{Th}_{\int}$ . If $\mathscr{M}$ is any pre-integration structure (whether saturated or not), then via interpretation of constants, the membership relation $\llbracket{\cdot}\boldsymbol{\in}{\cdot}\rrbracket$ , and the evaluation map, we may identify $\mathbb{R}^{\mathscr{M}}$ with $\mathbb{R}$ , ${\mathcal{A}_{\Omega}^{\mathscr{M}}}$ with a Boolean algebra of subsets of $\Omega^{\mathscr{M}}$ , and $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ with a set of functions $\Omega\to\mathbb{R}$ . However, $\mathcal{A}_{\Omega}$ need not be a $\sigma$ -algebra. Accordingly, $\mu^{\mathscr{M}}$ is typically just a finitely (not countably) additive measure on $(\Omega^{\mathscr{M}},{\mathcal{A}_{\Omega}^{\mathscr{M}}})$ , while elements $f\in\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ are identified with uniformly bounded functions on $\Omega$ that may only be approximately $\mathcal{A}_{\Omega}$ -measurable.111111A function $f$ on $\Omega$ is approximately $\mathcal{A}_{\Omega}$ -measurable if for all rational $r<s$ there exists $A\in\mathcal{A}_{\Omega}$ such that $f^{-1}((-\infty,r))\subseteq A\subseteq f^{-1}((-\infty,s])$ —a property axiomatizable by countably many Henson formulas in the logic of approximate satisfaction ([DnI17], Proposition 4.4). Nevertheless, in earlier work we have shown how the classical (i.e., $\sigma$ -additive) theory of integration of bounded measurable functions over a finite measure space and the corresponding version of the Dominated Convergence Theorem are recovered essentially verbatim in saturated Henson integration structures via an analogous construction to that of Loeb measure in nonstandard analysis [DnI17].

A.2. Loeb structures

Definition A.1 (Loeb structure).

Let ${\operatorname{Th}_{\mathrm{Loeb}}}$ be the reduct of the $\mathcal{L}$ -theory $\operatorname{Th}_{\int}$ of integration structures with a positive measure to the language $\mathcal{L}^{\prime}$ obtained by removing from $\mathcal{L}$ the symbol for sort $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ as well as all functions and constants involving $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ (such as the symbol $I$ for the integral). A model of ${\operatorname{Th}_{\mathrm{Loeb}}}$ is a pre-Loeb structure. (Note that $\mathscr{M}$ may be an $\widetilde{\mathcal{L}}$ -structure for a language $\widetilde{\mathcal{L}}$ properly extending the language $\mathcal{L}^{\prime}$ of Loeb structures, and thus have other sorts, functions and constants prescribed by $\widetilde{\mathcal{L}}$ but not by $\mathcal{L}^{\prime}$ .)

A Loeb structure is a saturated pre-Loeb structure.

Note that, for the present discussion, we are requiring the measure $\mu^{\mathscr{M}}$ in a pre-Loeb structure $\mathscr{M}$ to be positive.

If $\mathscr{M}$ is any pre-Loeb structure, the set underlying a given $A\in{\mathcal{A}_{\Omega}^{\mathscr{M}}}$ is

[TABLE]

We may (externally) identify $A$ with $[A]$ since

[TABLE]

is a sentence in ${\operatorname{Th}_{\mathrm{Loeb}}}$ .121212Although Henson’s languages have no conditional connective “ $\rightarrow$ ”, when $P$ is a discrete predicate (i.e., a term taking only the values $0,1$ ), a non-Henson formula such as $P(x)\rightarrow\varphi(x)$ can be semantically identified with the Henson formula $(P(x)\leq 1/2)\vee\varphi(x)$ . (By contrast, the converse $\varphi(x)\rightarrow P(x)$ is not semantically equivalent to a Henson formula in general.) When both $P,Q$ are discrete, a biconditional $P(x)\leftrightarrow Q(x)$ can similarly be rewritten as a Henson formula. The assertion “ $R$ is discrete” is captured by the Henson formula $(\forall x)(R(x)=0\vee R(x)=1)$ , where “ $R(x)=r$ ” is itself an abbreviation for “ $(R(x)\leq r)\wedge(R(x)\geq r)$ ”.

Definition A.2 (Loeb measure and Loeb-measurable sets).

Let $\mathscr{M}$ be a pre-Loeb structure with positive measure $\mu=\mu^{\mathscr{M}}$ .

A set $S\subseteq\Omega^{\mathscr{M}}$ is $\mathcal{A}_{\Omega}$ -measurable (or just measurable) if $S=[A]$ for some $A\in{\mathcal{A}_{\Omega}^{\mathscr{M}}}$ (i.e., if “ $S\in{\mathcal{A}_{\Omega}^{\mathscr{M}}}$ ”—modulo the identification of $S=[A]$ with $A$ itself).

A set $S\subseteq\Omega^{\mathscr{M}}$ is $\mu$ -measurable (or Loeb-measurable (modulo $\mu$ )) if for every $\epsilon>0$ there exist measurable $A,B\in{\mathcal{A}_{\Omega}^{\mathscr{M}}}$ such that $[A]\subseteq S\subseteq[B]$ and $\mu(B-A)\leq\epsilon$ .

The Loeb measure of a Loeb-measurable set $S$ is

[TABLE]

The Loeb algebra of $\mathscr{M}$ is the collection $\llbracket\mathcal{\mathcal{A}}\rrbracket_{\mu}$ of all Loeb-measurable subsets of $\Omega^{\mathscr{M}}$ .

Note that $\llbracket\mathcal{\mathcal{A}}\rrbracket_{\mu}$ is an external collection of subsets of $\Omega^{\mathscr{M}}$ . It depends on $\mathscr{M}$ and has no intrinsic definition otherwise. It is easy to check that $\llbracket\mathcal{\mathcal{A}}\rrbracket_{\mu}$ is an algebra of sets (i.e., closed under finite unions and intersections as well as complements). In fact, as soon as $\mathscr{M}$ is at least $\omega$ -saturated (i.e., types over a countable set of parameters are realized), $\llbracket\mathcal{\mathcal{A}}\rrbracket_{\mu}$ is a $\sigma$ -algebra that is complete for $\mu_{\mathrm{L}}$ in the sense that any subset of a $\mu_{\mathrm{L}}$ -null set is itself $\mu_{\mathrm{L}}$ -null ([DnI17], Proposition 3.4). On the other hand, no degree of saturation ensures that $\llbracket\mathcal{\mathcal{A}}\rrbracket_{\mu}$ is closed under unions of subfamilies of size $\omega_{1}$ or more.

A.3. Integration frameworks

We need to introduce the notion of (real) integration framework, which generalizes integration structures as presented in section A.1. Roughly speaking, an integration framework is a saturated model of the theory of the operations of integration with respect to arbitrary finite (positive or signed) measures on a measure space.

Consider the reduct $\widetilde{\mathscr{M}}$ of a classical pre-integration structure $\mathscr{M}$ , obtained by removing from $\mathscr{M}$ the distinguished measure $\mu$ and all the functions involving $\mu$ (including the integration operator $I$ ). Now expand $\widetilde{\mathscr{M}}$ to a structure $\mathscr{M}^{\prime}$ with a new Banach sort $\mathfrak{M}_{\Omega}$ containing all finite (signed, real-valued) measures $\mu$ on $\Omega$ plus the following distinguished functions and constants:

•

Constants: The zero measure $0\in\mathfrak{M}_{\Omega}$ ;

•

Functions:

–

Vector space operations of addition and scalar multiplication on $\mathfrak{M}_{\Omega}$ .

–

Banach norm of total variation on $\mathfrak{M}_{\Omega}$ :

$\|\mu\|=\sup\{A\in\mathcal{A}_{\Omega}:|\mu(A)|+|\mu(A^{\complement})|\}$ ;

–

The inclusion maps:

$\Omega\hookrightarrow\mathcal{A}_{\Omega}:x\mapsto\{x\}$ ,

*

$\Omega\hookrightarrow\mathfrak{M}_{\Omega}:x\mapsto\delta_{x}$ (the unit point mass at $x$ );

–

The evaluation map $\mathfrak{M}_{\Omega}\times\mathcal{A}_{\Omega}\to\mathbb{R}:(\mu,A)\mapsto\mu(A)$ (which is 1-Lipschitz by definition of the norm $\left\|\cdot\right\|$ on $\mathfrak{M}_{\Omega}$ );

–

The total variation map $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}\to\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ : $\mu\mapsto|\mu|$ where $|\mu|$ is the (positive) measure of total variation of $\mu$ : $|\mu|(A)=\sup\{|\mu(A\cap B)|+|\mu(A\cap B^{\complement})|:B\in\mathcal{A}_{\Omega}\}$ ;

–

The integration operator $\langle{\cdot},{\cdot}\rangle:\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}\times\mathfrak{M}_{\Omega}\to\mathbb{R}:f\mapsto\langle{f},{\mu}\rangle=\int_{\Omega}f\,d\mu$ .

Given a language $\mathcal{L}$ for pre-integration structures, let $\mathcal{L}^{\prime}$ be obtained from $\mathcal{L}$ by removing the symbols $\boldsymbol{\mu},I$ for the distinguished measure and integral operator, and adding a new sort symbol $\mathfrak{M}_{\Omega}$ as well as new constant and function symbols per the list above.

Definition A.3 (Real integration framework).

A classical (real) pre-integration framework is any $\mathcal{L}^{\prime}$ -structure $\mathscr{M}^{\prime}$ as described above. An (abstract) real pre-integration framework is any model of the Henson theory ${\operatorname{Th}_{{\int:\mathbb{R}}}}$ of classical real pre-integration frameworks.131313It is straightforward to verify that ${\operatorname{Th}_{{\int:\mathbb{R}}}}$ is a uniform theory.

More generally, any structure $\mathscr{M}$ in a language expanding $\mathcal{L}^{\prime}$ such that the $\mathcal{L}^{\prime}$ -reduct of $\mathscr{M}$ is a pre-integration framework in the above sense will be called a pre-integration framework.

A (real) integration framework is a saturated real pre-integration framework.

A.4. Banach integration frameworks

There is no completely general notion of integration of functions taking values in an arbitrary Banach space $\mathfrak{B}$ —not even for bounded functions $F:\Omega\to\mathfrak{B}$ on a finite measure space $(\Omega,\mathcal{A}_{\Omega})$ . However, it is very natural to require that any such notion of Banach integration should build upon the classical integral of real-valued functions. Our viewpoint is that any reasonable notion of Banach integration must expand a (pre-)integration framework to a Banach (pre-)integration framework per Definition A.4 below.

Consider expansions $\mathscr{M}^{\prime}=(\mathbf{S}^{\prime},\mathbf{F}^{\prime},\mathbf{C}^{\prime})$ of real integration frameworks $\mathscr{M}=(\mathbf{S},\mathbf{F},\mathbf{C})$ where $\mathbf{S}^{\prime}\supset\mathbf{S}$ contains two new sorts $\mathfrak{B}$ and $\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ , while $\mathbf{F}^{\prime}\supset\mathbf{F}$ and $\mathbf{C}^{\prime}\supset\mathbf{C}$ contain new functions and symbols as follows:

•

Addition, scalar product and norm on $\mathfrak{B}$ making it a real Banach space.

•

Addition, scalar product and norm on $\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ making it a Banach space.

•

The zero elements of $\mathfrak{B}$ and $\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ .

•

An evaluation map $\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}\times\Omega\to\mathfrak{B}:(F,x)\mapsto F(x)$ such that $\|F\|=\sup\{\|F(x)\|:x\in\Omega\}$ . (Thus, elements of $\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ may be identified with functions $\Omega\to\mathfrak{B}$ .)

•

The operation of multiplication $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}\times\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}\to\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}:(f,F)\mapsto fF$ (such that $(fF)(x)=f(x)F(x)$ for all $x\in\Omega$ ) under which $\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ is an $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ -module.

•

The inclusion map $\mathfrak{B}\to\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}:T\mapsto T(\blacksquare)$ where $T(\blacksquare)\in\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ is identified via evaluation with the constant function $\Omega\to\mathfrak{B}:x\mapsto T$ .

•

The pointwise-norm map $\left|\cdot\right|:\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}\to\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ satisfying $\left|F\right|(x)=\left\|F(x)\right\|$ for all $F\in\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ , $x\in\Omega$ .

•

An operation of Banach integration, namely a pairing $\llangle{\cdot},{\cdot}\rrangle:\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}\times\mathfrak{M}_{\Omega}\to\mathfrak{B}$ satisfying the following properties:

(1)

$\llangle\cdot,\cdot\rrangle$ is bilinear; 2. (2)

$\llangle\cdot,\cdot\rrangle$ is compatible with the integration $\langle\cdot,\cdot\rangle$ of real functions:

(a)

For all $T\in\mathfrak{B}$ and $f\in\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ : $\llangle fT,\mu\rrangle=\langle f,\mu\rangle T$ . 2. (b)

For all $F\in\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}}$ : $\|\llangle F,\mu\rrangle\|\leq|\langle|F|,|\mu|\rangle|$ .

Definition A.4 (Banach integration framework).

A language $\mathcal{L}^{\prime}$ expanding the language $\mathcal{L}$ of real pre-integration frameworks with the new sort symbols plus symbols for the functions and constants above is called a language for Banach integration frameworks.

Let ${\operatorname{Th}_{{\int:\mathbb{R}}}}$ be the Henson $\mathcal{L}$ -theory of real pre-integration frameworks, and let ${\operatorname{Th}_{{\int:\mathfrak{B}}}}$ extend ${\operatorname{Th}_{{\int:\mathbb{R}}}}$ with further Henson $\mathcal{L}^{\prime}$ -axioms capturing the properties of new sorts, functions and constants stated above (in semantically equivalent terms, let ${\operatorname{Th}_{{\int:\mathfrak{B}}}}$ be the Henson $\mathcal{L}^{\prime}$ -theory of those expansions $\mathscr{M}^{\prime}$ of real pre-integration frameworks having the properties above).141414The verification that ${\operatorname{Th}_{{\int:\mathfrak{B}}}}$ is a uniform theory is routine.

A Banach pre-integration framework is a model of ${\operatorname{Th}_{{\int:\mathfrak{B}}}}$ . More generally, if $\widetilde{\mathcal{L}}$ is a language extending $\mathcal{L}^{\prime}$ and $\mathscr{M}$ is an $\widetilde{\mathcal{L}}$ -structure whose reduct $\mathscr{M}\mathord{\upharpoonright}\mathcal{L}^{\prime}$ is a model of ${\operatorname{Th}_{{\int:\mathfrak{B}}}}$ , we shall still call $\mathscr{M}$ a Banach pre-integration framework.

A Banach integration framework is a saturated model of ${\operatorname{Th}_{{\int:\mathfrak{B}}}}$ .

*Remark A.5**.*

The question whether a real pre-integration framework $\mathscr{M}$ admits an expansion to a Banach pre-integration framework $\mathscr{M}^{\prime}$ is very delicate. In general, the answer may be negative. However, when $\Omega^{\mathscr{M}}$ is a finite set the answer is affirmative: It suffices to let $(\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}})^{\mathscr{M}^{\prime}}$ be the set of all functions $F:\Omega^{\mathscr{M}}\to\mathfrak{B}^{\mathscr{M}}$ , and also let $\llangle{F},{\mu}\rrangle=\sum_{x\in\Omega^{\mathscr{M}}}F(x)\mu(\{x\})$ . The remaining ingredients of the expansion are defined in the obvious manner. Similarly, an expansion $\mathscr{M}^{\prime}$ also exists if the Banach sort $\mathfrak{B}^{\mathscr{M}}$ has finite dimension (using a basis for $\mathfrak{B}^{\mathscr{M}}$ , real-valued integration extends to $\mathfrak{B}^{\mathscr{M}}$ -valued integration in the straightforward classical fashion).

A.5. A Dominated Convergence Theorem for nets of functions in Banach integration frameworks

In order to formulate a version of the Dominated Convergence Theorem 5 for integration frameworks below, we fix a directed set $(\mathbb{D},\preceq)$ so we can eventually discuss convergence of nets on it.151515I.e., $\preceq$ is a nonstrict partial order on $\mathbb{D}$ such that any two $i,j\in\mathbb{D}$ have an upper bound $k$ . Classical sequences indexed by the directed set $(\mathbb{N},\leq)$ of natural numbers are of particular interest. Only infinite directed sets are useful as tools to define and study notions of convergence in analysis and topology; on the other hand, critical results such as Theorem 5 depend on the countability of the directed set, so we may as well fix an infinite countable directed set $(\mathbb{D},\preceq)$ for the remainder of the manuscript (this hypothesis will be made explicit whenever needed).

Definition A.6.

Fix a directed set $(\mathbb{D}\preceq)$ . For $i\in\mathbb{D}$ , the final segment of $\mathbb{D}$ starting at $i$ is $\mathbb{D}_{\succeq i}=\{j\in\mathbb{D}:j\succeq i\}$ (i.e., the set of elements equal to or greater than $i$ in $\mathbb{D}$ ). A $\mathbb{D}$ -net $a_{\bullet}$ in a metric space $(X,\mathrm{d})$ is any function $\mathbb{D}\to X:i\mapsto a_{i}$ . The spread of $a_{\bullet}$ from $i$ is

[TABLE]

The oscillation of $a_{\bullet}$ is

[TABLE]

The net $a_{\bullet}$ converges if $\operatorname{osc}(a_{\bullet})=0$ .

Theorem 5 (Dominated Convergence Theorem in Banach integration frameworks).

Fix a countable directed set $\mathbb{D}$ . Let $\mathscr{M}$ be any (saturated) Banach integration framework. Let $\varphi_{\bullet}$ be a bounded $\mathbb{D}$ -net in $(\mathscr{L}^{\infty}_{\Omega,\mathfrak{B}})^{\mathscr{M}}$ . For every $x\in\Omega^{\mathscr{M}}$ and $\mu\in\mathfrak{M}_{\Omega}^{\mathscr{M}}$ , let $\varphi_{\bullet}(x)$ denote the net $(\varphi_{j}(x):j\in\mathbb{D})$ and $\llangle{\varphi_{\bullet}},{\mu}\rrangle$ the net $\big{(}\llangle{\varphi_{j}},{\mu}\rrangle:j\in\mathbb{D})$ in $\mathfrak{B}^{\mathscr{M}}$ . Then we have

[TABLE]

In particular, if the net $\varphi_{\bullet}(x)$ is convergent for all $x\in\Omega^{\mathscr{M}}$ , then $\llangle{\varphi_{\bullet}},{\mu}\rrangle$ is convergent.

The proof of Theorem 5 below is an adaptation of our earlier one for real-valued notions of integration ([DnI17], Proposition 5.3).

Recall that a collection $\mathcal{F}$ of subsets of a set $U$ is a (proper) filter on $S$ if (i) $\emptyset\notin\mathcal{F}$ , (ii) $\mathcal{F}$ is closed under finite intersections, and (iii) $\mathcal{F}$ is upward closed: if $A\in\mathcal{F}$ and $A\subseteq B\subseteq U$ , then $B\in\mathcal{F}$ . A proper filter $\mathcal{F}$ is an ultrafilter if $A\in\mathcal{F}$ or $U\setminus A\in\mathcal{F}$ for all $A\subseteq U$ .

For an introduction to ultrafilters, ultralimits and ultraproduct constructions in model theory, the reader is referred to Bell and Slomson’s monograph [BS06].

Definition A.7.

If $X$ is any nonempty set, let $\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ be the family of finite nonempty subsets of $X$ . We call a filter $\mathcal{F}$ on $\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ greedy if it contains all the sets $X_{\supseteq S}=\{T\in\mathcal{P}^{*}_{\!\mathrm{fin}}(X):T\supseteq S\}$ for all $S\in\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ .

Note that the collection $\{X_{\supseteq S}:S\in\mathcal{P}^{*}_{\!\mathrm{fin}}(X)\}$ is a filter base on $\mathcal{P}^{*}_{\!\mathrm{fin}}(S)$ since $X_{\supseteq S}\cap X_{\supseteq T}=X_{\supseteq S\cup T}$ . (This means that the collection of subsets of $\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ that are supersets of $X_{\supseteq S}$ for some $S\in\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ is a filter on $\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ .)161616Recall that a filter base on a set $U$ is a collection $\mathcal{E}$ of subsets of $U$ such that $\emptyset\notin\mathcal{E}$ and $\mathcal{E}$ is downward directed by inclusion in the sense that if $A,B\in\mathcal{E}$ then $A\cap B\supseteq C$ for some $C\in\mathcal{E}$ . The filter $\mathcal{F}$ with base $\mathcal{E}$ is the collection of all subsets of $U$ that are supersets of some $A\in\mathcal{E}$ . By a routine application of the axiom of choice, greedy ultrafilters on $\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ exist whenever $X$ is nonempty. Observe that the principal ultrafilter generated by a fixed $S\in\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ is greedy precisely when $S=X$ ; thus, if $X$ is infinite, greedy ultrafilters on $\mathcal{P}^{*}_{\!\mathrm{fin}}(X)$ are nonprincipal.

Lemma A.8.

Let $f:\Omega^{\mathscr{M}}\to\mathbb{R}$ be bounded and (externally) $\mu_{\mathrm{L}}$ -measurable. Then there exists $\widetilde{f}\in(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ such that $f(x)=\widetilde{f}(x)$ for $\mu_{\mathrm{L}}$ -almost all $x\in\Omega^{\mathscr{M}}$ and $\inf_{x}f(x)\leq\widetilde{f}\leq\sup_{x}f(x)$ .

Proof.

Let $f$ be a $\mu_{\mathrm{L}}$ -measurable bounded external function on $\Omega^{\mathscr{M}}$ , and let $a=\inf_{x}f$ , $b=\sup_{x}f$ . The assertion is trivial if $a=b$ or if $\left\|\mu\right\|=0$ —just take a constant $\widetilde{f}$ in $[a,b]$ . Otherwise, we have $a<b$ and, replacing $\mu$ with $\left|\mu\right|$ , we may assume $\mu$ to be a positive measure without loss of generality. By definition of Loeb measurability, for rational $r\in[a,b]$ and integer $n\geq 1$ there exist $A^{r}_{n},B^{r}_{n}\in\mathcal{A}_{\Omega}^{\mathscr{M}}$ such that $[A_{n}^{r}]\subseteq\{f\leq r\}$ , $[B_{n}^{r}]\subseteq\{f\geq r\}$ , and $\mu_{\mathrm{L}}\{f\leq r\}-\mu(A_{n}^{r})\leq 1/n$ , $\mu_{\mathrm{L}}\{f\geq r\}-\mu(B_{n}^{r})\leq 1/n$ . For fixed $r$ , the sequences $(A^{r}_{n})$ , $(B^{r}_{n})$ may be constructed recursively to ensure $A_{m}^{r}\subseteq A^{r}_{n}$ and $B^{r}_{m}\subseteq B^{r}_{n}$ for $m\leq n$ . We may also assume $A_{m}^{r}=1_{\mathcal{A}_{\Omega}}$ if $r\geq b$ , and $B^{s}_{m}=1_{\mathcal{A}_{\Omega}}$ if $s\leq a$ .

Let $f^{r}_{n}=a\cdot(1-\chi_{B^{r}_{n}})+r\cdot\chi_{B^{r}_{n}}$ and $g^{r}_{n}=b\cdot(1-\chi_{A^{r}_{n}})+r\cdot\chi_{A^{r}_{n}}$ . Let $Q$ be the set of rational numbers in $[a,b]$ . The construction of $(A^{r}_{n})$ and $(B^{r}_{n})$ implies that $f^{r}_{m}\leq f^{r}_{n}\leq f\leq g^{r}_{n}\leq g^{r}_{m}$ for all $r\in Q$ and $m\leq n$ . For $I\in\mathcal{P}^{*}_{\!\mathrm{fin}}(Q)$ of cardinality $n$ , let $f^{I}=\max\{f^{r}_{n}:r\in I\}$ , $g^{I}=\min\{g^{r}_{n}:r\in I\}$ . Observe that $f^{r}_{n}\leq f^{I}\leq f^{J}\leq f\leq g^{J}\leq g^{I}\leq g^{r}_{n}$ if $I\subseteq J$ , $r\in I$ and $\operatorname{card}(I)\geq n$ . Since $a<b$ by assumption, $Q$ is infinite countable. Let $\mathcal{U}$ be a greedy ultrafilter on $\mathcal{P}^{*}_{\!\mathrm{fin}}(Q)$ . By saturation, there are $\widetilde{f},\widetilde{g}\in(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ realizing the $\mathcal{U}$ -ultralimit of the types $\operatorname{tp}_{S}(f^{I},g^{I})$ over the set of parameters $S=\{\mu,a,b\}\cup\{A^{r}_{n},B^{r}_{n}\}_{r\in Q,n\in\mathbb{N}^{*}}$ . From the construction of $\mathcal{U}$ as a greedy ultrafilter, the definition of ultralimit, and the meaning of realization of a type, it is easy to verify that

[TABLE]

It follows that for fixed $r\in Q$ we have $S^{r}:=\bigcup_{n}[A^{r}_{n}]=\bigcup_{n}\{g^{r}_{n}\leq r\}\subseteq\{\widetilde{g}\leq r\}$ . On the other hand, by construction of $A^{r}_{n}$ we have $S^{r}\subseteq\{f\leq r\}$ and $\mu_{\mathrm{L}}(S^{r})=\sup_{n}\mu(A^{r}_{n})=\mu_{\mathrm{L}}\{f\leq r\}$ . Thus, $S^{r}\subseteq\{f\leq r\}\cap\{\widetilde{g}\leq r\}$ and $\mu_{\mathrm{L}}(S^{r})=\mu_{\mathrm{L}}\{f\leq r\}$ ; hence, $\{f\leq r\}$ is $\mu_{\mathrm{L}}$ -almost included in $\{\widetilde{g}\leq r\}$ . By a completely analogous argument, $\{\widetilde{f}\geq r\}$ $\mu_{\mathrm{L}}$ -almost includes $\{f\geq r\}$ . These almost-inclusions for every (rational) $r\in Q$ are easily shown to imply the $\mu_{\mathrm{L}}$ -a.e. inequalities $\widetilde{g}\leq f\leq\widetilde{f}$ . However, $\widetilde{f}\leq\widetilde{g}$ , so in fact $\widetilde{f}=f=\widetilde{g}$ ( $\mu_{\mathrm{L}}$ -a.e.) ∎

Lemma A.9.

Fix $\mu\in\mathfrak{M}_{\Omega}^{\mathscr{M}}$ and let $a_{\bullet}=(a_{i}:i<\omega)$ be a sequence of external $\mu_{\mathrm{L}}$ -measurable functions $\Omega^{\mathscr{M}}\to\mathbb{R}$ . Then there exist $\sigma,\iota\in(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ such that $\sigma(x)=\sup_{i<\omega}a_{i}(x)$ and $\iota(x)=\inf_{i<\omega}a_{i}(x)$ for $\mu_{\mathrm{L}}$ -almost all $x\in\Omega^{\mathscr{M}}$ .

If the (external) sequence $a_{\bullet}$ consists of internal functions, i.e., it is a sequence in $(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ , then $\sigma,\iota$ may be chosen so $\sigma\geq\sup_{i<\omega}a_{i}$ and $\iota\leq\inf_{i<\omega}a_{i}$ .

Proof.

It is routine to show that $f=\sup_{i<\omega}a_{i}$ and $g=\inf_{i<\omega}a_{i}$ are $\mu_{\mathrm{L}}$ -measurable, so the first assertion follows from Lemma A.8.

When $a_{\bullet}$ is a sequence of internal functions, let $\mathcal{U}$ be any nonprincipal ultrafilter on $\omega$ and let $\sigma$ realize the $\mathcal{U}$ -ultralimit of the types $\operatorname{tp}_{S}(a^{k})$ over the set of parameters $S=\{\mu\}\cup\{a_{i}\}_{i<\omega}$ , where $a^{k}=\max\{a_{i}:i\leq k\}$ . (Recall that $\mathscr{L}^{\infty}_{\Omega,\mathbb{R}}$ is endowed with the binary lattice operation $\max\{a,b\}$ , which trivially defines $n$ -ary maximum operations for all $n\geq 1$ .) The verification that $\sigma$ has the required properties is routine. The construction of $\iota$ is identical upon replacing “max” by “min”. ∎

Proof of Theorem 5.

The asserted inequality evidently holds if $\mu=0$ . Otherwise, using a Jordan decomposition $\mu=\mu_{+}-\mu_{-}$ where $\mu_{+}=(\left|\mu\right|+\mu)/2$ and $\mu_{-}=(\left|\mu\right|-\mu)/2$ are positive, the proof is easily reduced to the case in which $\mu$ is a probability measure, which we assume henceforth.

Choose $C$ such that $\left\|\varphi_{i}\right\|\leq C$ for all $i$ . For $j,k\in\mathbb{D}$ let $\varphi^{j,k}=\left|\varphi_{k}-\varphi_{j}\right|\in(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ . Since $\mathbb{D}$ is countable, Lemma A.9 implies that for each $i\in\mathbb{D}$ there is $\sigma^{i}\in(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ with $\|\sigma^{i}\|\leq 2C$ such that $\sigma^{i}$ is $\mu_{\mathrm{L}}$ -a.e. equal to $\operatorname{spr}_{\succeq i}\varphi_{\bullet}=\sup_{j,k\succeq i}\varphi^{j,k}$ . Similarly, $\operatorname{osc}\varphi_{\bullet}=\inf_{i}\operatorname{spr}_{\succeq i}\varphi_{\bullet}$ is $\mu_{\mathrm{L}}$ -a.e. equal to $\inf_{i}\sigma^{i}$ , hence to some $\omega\in(\mathscr{L}^{\infty}_{\Omega,\mathbb{R}})^{\mathscr{M}}$ with $\left\|\omega\right\|\leq 2C$ .

Let $s=\sup_{x}\operatorname{osc}(\varphi_{\bullet}(x))$ and fix $t>s$ . Since $\{x\colon\operatorname{osc}\varphi_{\bullet}(x)\geq t\}$ is empty (by choice of $s$ and $t$ ) and $\omega(x)=\operatorname{osc}(\varphi_{\bullet}(x))$ for $\mu_{\mathrm{L}}$ -a.e. $x$ , the set $\{x\colon\omega(x)\geq t\}$ is $\mu_{\mathrm{L}}$ -null. Since $\omega=\inf_{i}\operatorname{spr}_{\succeq i}\varphi_{\bullet}$ ( $\mu_{\mathrm{L}}$ -a.e.), we have $\inf_{i}\mu_{\mathrm{L}}\{x\colon\operatorname{spr}_{\succeq i}\varphi_{\bullet}(x)\geq t\}=0$ . Thus, for arbitrary fixed $\epsilon>0$ we have $\mu_{\mathrm{L}}\{x\colon\operatorname{spr}_{\succeq i}\varphi_{\bullet}(x)\geq t\}\leq\epsilon$ for some $i=i_{\epsilon}\in\mathbb{D}$ . (This depends crucially on the hypothesis that $\mathbb{D}$ is countable.) It follows that for $j,k\succeq i$ :

[TABLE]

This proves that $\operatorname{spr}_{\succeq i}\llangle{\varphi_{\bullet}},{\mu}\rrangle\leq t+2C\epsilon$ . As $t>s$ and $\epsilon>0$ are arbitrary, $\operatorname{osc}\llangle{\varphi_{\bullet}},{\mu}\rrangle\leq s$ . ∎

A.6. A Uniform Metastability Principle for nets in Henson structures

Proposition A.10 (Uniform Metastability Principle (UMP)).

Fix a directed set $(\mathbb{D},\preceq)$ . Fix a Henson language $\mathcal{L}$ including constants $(a_{j}:j\in\mathbb{D})$ all of a common sort $\mathbb{S}$ . Let $\mathcal{T}$ be a uniform $\mathcal{L}$ -theory such that for every model $\mathscr{M}$ of $\mathcal{T}$ the net $a_{\bullet}^{\mathscr{M}}=(a_{j}^{\mathscr{M}}:j\in\mathbb{D})$ is convergent. Then there exists a metastability rate $E_{\bullet}=E_{\bullet}^{\mathcal{T}}$ depending only on $\mathcal{T}$ that applies uniformly to all sequences $a_{\bullet}^{\mathscr{M}}$ in all models $\mathscr{M}$ of $\mathcal{T}$ .

Proof.

([DnI17], Proposition 2.4.) Assume no such rate of metastability exists. Then there exist $\epsilon>0$ and a sampling $\eta\in\prod_{i\in\mathbb{D}}\mathcal{P}^{*}_{\!\mathrm{fin}}(\mathbb{D}_{\succeq i})$ such that for every $S\in\mathcal{P}^{*}_{\!\mathrm{fin}}(\mathbb{D})$ there is a model $\mathscr{M}=\mathscr{M}^{S}_{\epsilon,\eta}$ of $\mathcal{T}$ such that $a_{\bullet}=a_{\bullet}^{\mathscr{M}}$ satisfies $\epsilon\leq\operatorname{spr}_{\eta_{i}}(a_{\bullet})=\max\{\mathrm{d}(a_{j},a_{k}):j,k\in\eta_{i}\}$ for all $i\in S$ . By the compactness theorem for Henson logic, there is a model $\mathscr{M}$ of $\mathcal{T}$ such that $a_{\bullet}=a_{\bullet}^{\mathscr{M}}$ satisfies $\operatorname{spr}_{\eta_{i}}(a_{\bullet})\geq\epsilon$ for all $i\in\mathbb{D}$ , and hence $\operatorname{osc}(a_{\bullet})\geq\epsilon$ , contradicting the hypothesis that $a_{\bullet}^{\mathscr{M}}$ converges. ∎

Bibliography21

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Aus 15a] Tim Austin, Pleasant extensions retaining algebraic structure, I , J. Anal. Math. 125 (2015), 1–36. MR 3317896
2[Aus 15b] by same author, Pleasant extensions retaining algebraic structure, II , J. Anal. Math. 126 (2015), 1–111. MR 3358029
3[Aus 16] by same author, Non-conventional ergodic averages for several commuting actions of an amenable group , J. Anal. Math. 130 (2016), 243–274. MR 3574655
4[Ber 87] V. Bergelson, Weakly mixing PET , Ergodic Theory Dynam. Systems 7 (1987), no. 3, 337–349. MR 912373 (89g:28022)
5[Bir 31] G. D. Birkhoff, Proof of the ergodic theorem , Proc. Nat. Acad. Sci. U. S. A. 17 (1931), 656–660.
6[BL 04] V. Bergelson and A. Leibman, Failure of the Roth theorem for solvable groups of exponential growth , Ergodic Theory and Dynamical Systems 24 (2004), no. 1, 45–53.
7[BS 06] John Lane Bell and Alan B. Slomson, Models and ultraproducts: An introduction , Dover Books on Mathematics, Dover Publications, 2006.
8[Dn I 17] Eduardo Dueñez and José Iovino, Model theory and metric convergence I: Metastability and dominated convergence , Beyond first order model theory, CRC Press, Boca Raton, FL, 2017, pp. 131–187. MR 3729326

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Model theory and metric convergence II:

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

Introduction

Mean Ergodic Theorem (MET)****.

Theorem 1** (MET for abelian unitary polynomial actions of Z\mathbb{Z}Z).**

1. PET Structures

1.1. Classical PET Structures

Notation 1.1**.**

Remarks 1.2*.*

Definition 1.3** (Classical PET structure over Z\mathbb{Z}Z).**

1.2. Abstract PET structures

Definition 1.4** (Henson signature and language for PET structures over Z\mathbb{Z}Z).**

Definition 1.5** (PET structure over Z\mathbb{Z}Z).**

Remarks 1.6*.*

2. Leibman sequences

2.1. Classical Leibman sequences

Definition 2.1** (Discrete difference and Leibman sequence).**

Remarks 2.2*.*

Proposition 2.3**.**

2.2. Quadratic Leibman sequences

Proposition 2.4**.**

2.2.1. Proof of Proposition 2.4

Lemma 2.5**.**

Proof.

2.2.2. Some consequences of Proposition 2.4

Proposition 2.6**.**

Proof.

Definition 2.7**.**

Proposition 2.8**.**

Proof.

Corollary 2.9**.**

Proof.

Remark 2.10*.*

2.3. Leibman sequences in PET structures

Definition 2.11** (Discrete difference and abstract Leibman sequence).**

Remarks 2.12*.*

3. An ergodic theorem for unitary polynomial actions of Z\mathbb{Z}Z

3.1. The sequence of ergodic averages

Convention 3.1**.**

Definition 3.2** (Ergodic averages).**

Theorem 2** (Poly-MET/Z\mathbb{Z}Z: Mean Ergodic Theorem for unitary polynomial actions of Z\mathbb{Z}Z).**

Theorem 3** (Metastable Poly-MET/Z\mathbb{Z}Z).**

3.2. Proof preliminaries

Lemma 3.3** (Dominated Convergence Theorem in PET structures).**

Proof.

Lemma 3.4**.**

Proof.

Lemma 3.5**.**

Lemma 3.6**.**

3.3. Proof of Theorem 2

Lemma 3.7**.**

Proof.

3.4. Proof of Theorem 3

3.5. Proof of Theorem 1

4. A Mean Ergodic Theorem for unitary polynomial actions of abelian groups

Theorem 4** (Poly-MET: Mean Ergodic Theorem for unitary polynomial actions of an abelian group).**

Remark 4.1*.*

Appendix A A Dominated Convergence Theorem for notions of integration in Banach spaces

A.1. Integration structures

A.2. Loeb structures

Definition A.1** (Loeb structure).**

Definition A.2** (Loeb measure and Loeb-measurable sets).**

A.3. Integration frameworks

Definition A.3** (Real integration framework).**

A.4. Banach integration frameworks

Definition A.4** (Banach integration framework).**

Remark A.5*.*

A.5. A Dominated Convergence Theorem for nets of functions in Banach integration frameworks

Definition A.6**.**

Theorem 5** (Dominated Convergence Theorem in Banach integration frameworks).**

Definition A.7**.**

Lemma A.8**.**

Mean Ergodic Theorem (MET).

Theorem 1 (MET for abelian unitary polynomial actions of $\mathbb{Z}$ ).

Notation 1.1.

*Remarks 1.2**.*

Definition 1.3 (Classical PET structure over $\mathbb{Z}$ ).

Definition 1.4 (Henson signature and language for PET structures over $\mathbb{Z}$ ).

Definition 1.5 (PET structure over $\mathbb{Z}$ ).

*Remarks 1.6**.*

Definition 2.1 (Discrete difference and Leibman sequence).

*Remarks 2.2**.*

Proposition 2.3.

Proposition 2.4.

Lemma 2.5.

Proposition 2.6.

Definition 2.7.

Proposition 2.8.

Corollary 2.9.

*Remark 2.10**.*

Definition 2.11 (Discrete difference and abstract Leibman sequence).

*Remarks 2.12**.*

3. An ergodic theorem for unitary polynomial actions of $\mathbb{Z}$

Convention 3.1.

Definition 3.2 (Ergodic averages).

Theorem 2 (Poly-MET/ $\mathbb{Z}$ : Mean Ergodic Theorem for unitary polynomial actions of $\mathbb{Z}$ ).

Theorem 3 (Metastable Poly-MET/ $\mathbb{Z}$ ).

Lemma 3.3 (Dominated Convergence Theorem in PET structures).

Lemma 3.4.

Lemma 3.5.

Lemma 3.6.

Lemma 3.7.

Theorem 4 (Poly-MET: Mean Ergodic Theorem for unitary polynomial actions of an abelian group).

*Remark 4.1**.*

Definition A.1 (Loeb structure).

Definition A.2 (Loeb measure and Loeb-measurable sets).

Definition A.3 (Real integration framework).

Definition A.4 (Banach integration framework).

*Remark A.5**.*

Definition A.6.

Theorem 5 (Dominated Convergence Theorem in Banach integration frameworks).

Definition A.7.

Lemma A.8.

Lemma A.9.

Proposition A.10 (Uniform Metastability Principle (UMP)).