Regularity and inverse theorems for uniformity norms on compact abelian   groups and nilmanifolds

Pablo Candela; Bal\'azs Szegedy

arXiv:1902.01098·math.CO·March 15, 2022

Regularity and inverse theorems for uniformity norms on compact abelian groups and nilmanifolds

Pablo Candela, Bal\'azs Szegedy

PDF

TL;DR

This paper establishes a general regularity and inverse theorem for uniformity norms on compact abelian groups and nilmanifolds, unifying and extending previous results, with applications to inverse theorems and structure of nilspaces.

Contribution

It introduces a unified framework for regularity and inverse theorems for uniformity norms on a broad class of compact nilspaces, including non-abelian cases, and provides new structural results for nilspaces.

Findings

01

Proves a general regularity theorem for uniformity norms.

02

Establishes an inverse theorem for these norms on compact nilspaces.

03

Provides new structural and stability results for nilspaces.

Abstract

We prove a general form of the regularity theorem for uniformity norms, and deduce an inverse theorem for these norms which holds for a class of compact nilspaces including all compact abelian groups, and also nilmanifolds; in particular we thus obtain the first non-abelian versions of such theorems. We derive these results from a general structure theorem for cubic couplings, thereby unifying these results with the Host-Kra Ergodic Structure Theorem. A unification of this kind had been propounded as a conceptual prospect by Host and Kra. Our work also provides new results on nilspaces. In particular, we obtain a new stability result for nilspace morphisms. We also strengthen a result of Gutman, Manners and Varj\'u, by proving that a $k$ -step compact nilspace of finite rank is a toral nilspace (in particular, a connected nilmanifold) if and only if its $k$ -dimensional cube set is…

Equations24

E (f ∣ B_{F_{0}^{'} \cap F_{1}^{'}}) \in L^{\infty} (A_{F_{0}^{'} \cap F_{1}^{'}}),

E (f ∣ B_{F_{0}^{'} \cap F_{1}^{'}}) \in L^{\infty} (A_{F_{0}^{'} \cap F_{1}^{'}}),

\mu_{\operatorname{X}}^{\llbracket k+1\rrbracket}\,\big{(}\{\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X}):\exists\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}(\operatorname{Y}),\,\forall\,v\in\llbracket k+1\rrbracket,\,\phi\operatorname{\circ}\operatorname{c}(v)\approx_{\delta}\operatorname{c}^{\prime}(v)\}\big{)}\geq 1-\delta,

\mu_{\operatorname{X}}^{\llbracket k+1\rrbracket}\,\big{(}\{\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X}):\exists\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}(\operatorname{Y}),\,\forall\,v\in\llbracket k+1\rrbracket,\,\phi\operatorname{\circ}\operatorname{c}(v)\approx_{\delta}\operatorname{c}^{\prime}(v)\}\big{)}\geq 1-\delta,

S=\big{\{}x\in\operatorname{X}:\mu_{\operatorname{C}^{k}_{x}(\operatorname{X})}\big{(}\{\operatorname{c}\in\operatorname{C}^{k}_{x}(\operatorname{X}):d_{\operatorname{Z}}(\rho(\operatorname{c}),0)\leq\epsilon^{1/4}\}\big{)}\geq 1-\epsilon^{1/4}\big{\}},

S=\big{\{}x\in\operatorname{X}:\mu_{\operatorname{C}^{k}_{x}(\operatorname{X})}\big{(}\{\operatorname{c}\in\operatorname{C}^{k}_{x}(\operatorname{X}):d_{\operatorname{Z}}(\rho(\operatorname{c}),0)\leq\epsilon^{1/4}\}\big{)}\geq 1-\epsilon^{1/4}\big{\}},

μ_{X} (X ∖ S) ϵ^{1/2} < \int_{X} \int_{C_{x}^{k} (X)} d_{Z} (ρ (c), 0) d μ_{C_{x}^{k} (X)} (c) d μ_{X} (x) = d_{1} (ρ, 0) \leq ϵ .

μ_{X} (X ∖ S) ϵ^{1/2} < \int_{X} \int_{C_{x}^{k} (X)} d_{Z} (ρ (c), 0) d μ_{C_{x}^{k} (X)} (c) d μ_{X} (x) = d_{1} (ρ, 0) \leq ϵ .

\mu_{\mathcal{T}(\operatorname{c})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}):\forall\,v\in\llbracket k\rrbracket,\,d_{\operatorname{Z}}\big{(}\rho(t\operatorname{\circ}\Psi_{v}),0\big{)}\leq\epsilon^{1/4}\big{\}}\big{)}\geq 1-2^{k}\epsilon^{1/4}.

\mu_{\mathcal{T}(\operatorname{c})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}):\forall\,v\in\llbracket k\rrbracket,\,d_{\operatorname{Z}}\big{(}\rho(t\operatorname{\circ}\Psi_{v}),0\big{)}\leq\epsilon^{1/4}\big{\}}\big{)}\geq 1-2^{k}\epsilon^{1/4}.

\mu_{\operatorname{C}^{k}_{x}(\operatorname{X})}\big{(}\big{\{}\operatorname{c}\in\operatorname{C}^{k}_{x}(\operatorname{X}):d_{\operatorname{Z}}\big{(}\rho(\operatorname{c}),g(x)\big{)}\leq 4^{k}\epsilon^{1/4}\big{\}}\big{)}>1-4^{k}\epsilon^{1/2}.

\mu_{\operatorname{C}^{k}_{x}(\operatorname{X})}\big{(}\big{\{}\operatorname{c}\in\operatorname{C}^{k}_{x}(\operatorname{X}):d_{\operatorname{Z}}\big{(}\rho(\operatorname{c}),g(x)\big{)}\leq 4^{k}\epsilon^{1/4}\big{\}}\big{)}>1-4^{k}\epsilon^{1/2}.

\mu_{\mathcal{T}(\operatorname{c}_{0})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}_{0}):\forall\,v\neq 0^{k},\,t\operatorname{\circ}\Psi_{v}\in S^{\llbracket k\rrbracket}\big{\}}\big{)}>1-(2^{k}-1)^{2}\epsilon^{1/2}>1-4^{k}\epsilon^{1/2}.

\mu_{\mathcal{T}(\operatorname{c}_{0})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}_{0}):\forall\,v\neq 0^{k},\,t\operatorname{\circ}\Psi_{v}\in S^{\llbracket k\rrbracket}\big{\}}\big{)}>1-(2^{k}-1)^{2}\epsilon^{1/2}>1-4^{k}\epsilon^{1/2}.

\mu_{\mathcal{T}(\operatorname{c}_{0})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}_{0}):g(x)\approx_{4^{k}\epsilon^{1/4}}\rho(t\operatorname{\circ}\Psi_{0^{k}})\big{\}}\big{)}>1-4^{k}\epsilon^{1/2}.

\mu_{\mathcal{T}(\operatorname{c}_{0})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}_{0}):g(x)\approx_{4^{k}\epsilon^{1/4}}\rho(t\operatorname{\circ}\Psi_{0^{k}})\big{\}}\big{)}>1-4^{k}\epsilon^{1/2}.

\mu_{\mathcal{T}(\operatorname{c})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}):\forall\,v\in\llbracket k\rrbracket,\,d_{\operatorname{Z}}\big{(}\rho(t\operatorname{\circ}\Psi_{v}),g\operatorname{\circ}\operatorname{c}(v)\big{)}\leq 4^{k}\epsilon^{1/4}\big{\}}\big{)}>1-8^{k}\epsilon^{1/2}.

\mu_{\mathcal{T}(\operatorname{c})}\big{(}\big{\{}t\in\mathcal{T}(\operatorname{c}):\forall\,v\in\llbracket k\rrbracket,\,d_{\operatorname{Z}}\big{(}\rho(t\operatorname{\circ}\Psi_{v}),g\operatorname{\circ}\operatorname{c}(v)\big{)}\leq 4^{k}\epsilon^{1/4}\big{\}}\big{)}>1-8^{k}\epsilon^{1/2}.

d_{1} (g, 0) = \int_{X} d_{Z} (g (x), 0) d μ_{X} (x) = \int_{X} \int_{C_{x}^{k} (X)} d_{Z} (g (x), 0) d μ_{C_{x}^{k} (X)} (c) d μ_{X} (x)

d_{1} (g, 0) = \int_{X} d_{Z} (g (x), 0) d μ_{X} (x) = \int_{X} \int_{C_{x}^{k} (X)} d_{Z} (g (x), 0) d μ_{C_{x}^{k} (X)} (c) d μ_{X} (x)

d_{n,k}(\mu,\nu)=d_{n,k}^{\prime}(\mu,\nu)+d_{n,k-1}\big{(}\mu\operatorname{\circ}(\pi_{k-1}^{\llbracket n\rrbracket})^{-1},\nu\operatorname{\circ}(\pi_{k-1}^{\llbracket n\rrbracket})^{-1}\big{)}.\vspace{-0.1cm}

d_{n,k}(\mu,\nu)=d_{n,k}^{\prime}(\mu,\nu)+d_{n,k-1}\big{(}\mu\operatorname{\circ}(\pi_{k-1}^{\llbracket n\rrbracket})^{-1},\nu\operatorname{\circ}(\pi_{k-1}^{\llbracket n\rrbracket})^{-1}\big{)}.\vspace{-0.1cm}

g^{(k)}:\mathbb{Z}^{k+1}\to\widetilde{G},\;\;\mathbf{n}=(n_{0},n_{1},\dots,n_{k})\mapsto\big{(}g(n_{0}+v\cdot(n_{1},\dots,n_{k}))\big{)}_{v\in\llbracket k\rrbracket}.\vspace{-0.1cm}

g^{(k)}:\mathbb{Z}^{k+1}\to\widetilde{G},\;\;\mathbf{n}=(n_{0},n_{1},\dots,n_{k})\mapsto\big{(}g(n_{0}+v\cdot(n_{1},\dots,n_{k}))\big{)}_{v\in\llbracket k\rrbracket}.\vspace{-0.1cm}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Regularity and inverse theorems for uniformity norms on compact abelian groups and nilmanifolds

Pablo Candela

Universidad Autónoma de Madrid and ICMAT

Ciudad Universitaria de Cantoblanco

Madrid 28049

Spain

[email protected]

and

Balázs Szegedy

MTA Alfréd Rényi Institute of Mathematics

Reáltanoda utca 13-15

Budapest, Hungary, H-1053

[email protected]

Abstract.

We prove a general form of the regularity theorem for uniformity norms, and deduce an inverse theorem for these norms which holds for a class of compact nilspaces including all compact abelian groups, and also nilmanifolds; in particular we thus obtain the first non-abelian versions of such theorems. We derive these results from a general structure theorem for cubic couplings, thereby unifying these results with the Host–Kra Ergodic Structure Theorem. A unification of this kind had been propounded as a conceptual prospect by Host and Kra. Our work also provides new results on nilspaces. In particular, we obtain a new stability result for nilspace morphisms. We also strengthen a result of Gutman, Manners and Varjú, by proving that a $k$ -step compact nilspace of finite rank is a toral nilspace (in particular, a connected nilmanifold) if and only if its $k$ -dimensional cube set is connected. We also prove that if a morphism from a cyclic group of prime order into a compact finite-rank nilspace is sufficiently balanced (i.e. equidistributed in a certain quantitative and multidimensional sense), then the nilspace is toral. As an application of this, we obtain a new proof of a refinement of the Green–Tao–Ziegler inverse theorem.

2010 Mathematics Subject Classification:

11B30, 43A85, 37A45

1. Introduction

The inverse theorem for the Gowers norms is a major result in arithmetic combinatorics, with remarkable applications (see for instance [16, 17]), and is central to the theory known as higher-order Fourier analysis, initiated by Gowers in his seminal paper [14] (see also the survey [13]). The inverse theorem was proved in the breakthrough paper [19] by Green, Tao and Ziegler in the case of finite cyclic groups (more precisely, finite intervals of integers), and analogous results were obtained for vector spaces over a finite field of fixed characteristic in [1, 40, 41].

The Gowers norms can be defined on any compact abelian group, and these norms are special cases of more general uniformity norms, which can also be defined on nilmanifolds (see Definition 1.4, or [27, Ch. 12, §2]). The uniformity norms also have counterparts in other areas, especially in ergodic theory, where seminorms of a similar kind were introduced by Host and Kra in [26]. The main result regarding these seminorms, known as the Ergodic Structure Theorem (established in [26, Theorem 10]; see also [27]), is an analogue of, and was in fact an inspiration for, the inverse theorem for the Gowers norms, notably in its use of nilmanifolds.

An approach to higher-order Fourier analysis different from that in [19] was initia- ted by the second named author in [36], inspired on one hand by the work of Host and Kra, especially their introduction of parallelepiped structures [28], and on the other hand by the non-standard analysis viewpoint in graph limit theory [9]. This approach led to the development of the theory of nilspaces by Antolín Camarena and the second named author in [2], and initial applications of this theory to higher-order Fourier analysis were given in [37, 38]. The theory of nilspaces has since been detailed further; see for instance the treatment in [3, 4] detailing in particular the measure-theoretic aspects, and also the development by Gutman, Manners and Varjú in [20, 21, 22] with more emphasis on topological aspects and applications in dynamics. Nilspace related topics have now grown to generate an active research area, which has found further uses in ergodic theory [5, 24], probability theory [7], and topological dynamics [23].

It became conceivable that more conceptual light could be shed on higher-order Fourier analysis by unifying the nilspace approach from [37, 38] with the ergodic theo- retic methods from [26], a prospect raised notably by Host and Kra in [27, end of Ch. 17]. In [7], a framework for such a unification was put forward, based on the concept of a cubic coupling, inspired especially by the cubic measures from [26, §3.1]. A first application of cubic couplings was given in [7] by recovering and extending the Ergodic Structure Theorem of Host and Kra in this framework. Another central application was announced in the same paper [7], namely a result extending the inverse theorem from [19] to compact abelian groups and also to nilmanifolds and more general nilspaces. The main purpose of this paper is to prove this result. Let us emphasize that while the combination of nilspace theory with non-standard analysis in the preprints [37, 38] already yielded inverse theorems for uniformity norms, these were markedly less general than those presented here, and the results in the present paper follow a more conceptual approach using solely the material from the published (or to appear) papers [3, 4, 7]. Crucially, it is the use of the cubic coupling framework here which enables the extension of the inverse theorem beyond abelian groups and its unification with the Ergodic Structure Theorem.

Let us set up some terminology. First we describe the class of nilspaces involved in our main results. This class consists essentially of filtered (possibly disconnected) nilmanifolds. Such a nilmanifold can always be viewed as a nilspace, by equipping it with the cube sets determined by the filtration; see [4, Definition 1.1.2]. Since we shall work in the category of nilspaces, we want to capture precisely these nilmanifolds within this category, which we do with Definition 1.1 below.

Recall that $\operatorname{X}$ is a compact finite-rank nilspace (abbreviated to cfr nilspace) if $\operatorname{X}$ is a compact nilspace and every structure group of $\operatorname{X}$ is a Lie group [4, Definition 2.5.1]. (Following [2] and [4], we assume compact spaces to be second-countable, unless specifically stated otherwise. cfr nilspaces are called Lie-fibred nilspaces in [22].)

Definition 1.1 (cfr coset nilspaces).

We say that a $k$ -step cfr nilspace is a coset nilspace if it is isomorphic to a nilmanifold $G/\Gamma$ (thus $G$ is a nilpotent Lie group and $\Gamma$ is a discrete cocompact subgroup of $G$ ) equipped with cube sets of the form $\operatorname{C}^{n}(G_{\bullet})/\operatorname{C}^{n}(\Gamma_{\bullet})$ , $n\geq 0$ , where $G_{\bullet}=(G_{i})_{i\geq 0}$ is a filtration of degree at most $k$ of closed subgroups $G_{i}\lhd G$ , and $\Gamma_{\bullet}=(\Gamma_{i})_{i\geq 0}$ is a filtration on $\Gamma$ where $\Gamma_{i}=\Gamma\cap G_{i}$ is cocompact in $G_{i}$ , $i\geq 0$ .

Our main results concern the class of compact nilspaces that are inverse limits of cfr coset nilspaces (see [4, §2.7] for the inverse limit construction in this category). This includes all compact abelian groups, and more generally all inverse limits of nilmanifolds.

We deduce the inverse theorem from a regularity theorem for functions on nilspaces in the above class, namely Theorem 1.5. Regularity results in arithmetic combinatorics are inspired by the well-known regularity lemmas from graph theory, and have hitherto focused on functions on abelian groups (see for instance [16, Theorem 1.2]). The point of Theorem 1.5 below is that a bounded measurable function on a cfr coset nilspace can always be decomposed into a sum of a structured function plus two errors, one error being very small in a prescribed uniformity norm, and the other being negligible in the $L^{1}$ -norm. The structured function is a nilspace polynomial of bounded complexity, a generalization of nilsequences that was introduced in [37]. To define nilspace polynomials, we first recall a general notion of complexity for cfr nilspaces. Recall that there are countably many cfr nilspaces up to isomorphism; see [2, Theorem 3], [4, Theorem 2.6.1].

Definition 1.2.

By a complexity notion for cfr nilspaces, we mean a bijection from the countable set of isomorphism classes of cfr nilspaces to $\mathbb{N}$ . Having fixed such a bijection, for $m>0$ we say that a cfr nilspace $\operatorname{X}$ has complexity at most $m$ , and write $\textrm{Comp}(\operatorname{X})\leq m$ , if its image under the bijection is at most $m$ .

Similarly to [19], in this paper we do not pursue explicit bounds for our main results, so we do not need to be specific about the complexity notion being used. In fact our results hold for any prescribed complexity notion.

Definition 1.3 (Nilspace polynomials).

*Let $\operatorname{X}$ be a compact nilspace. A function $f:\operatorname{X}\to\mathbb{C}$ is a nilspace polynomial of degree $k$ if $f=F\operatorname{\circ}\phi$ where $\phi:\operatorname{X}\to\operatorname{Y}$ is a continuous morphism, $\operatorname{Y}$ is a $k$ -step cfr nilspace, and $F$ is continuous; $f$ has complexity * $\leq m$ , denoted $\textrm{Comp}(f)\leq m$ , if $F$ has Lipschitz constant $\leq m$ and $\textrm{Comp}(\operatorname{Y})\leq m$ .

The Lipschitz constant here relates to a Riemannian metric that we fix from the start on each cfr nilspace, using the fact that these spaces are finite-dimensional manifolds [4, Lemma 2.5.3]. Our regularity theorem ensures also that the morphism involved in the structured part satisfies a strong quantitative equidistribution property that we call balance (following [38]). This useful property has a technical definition (concerning morphisms and also nilspace polynomials), which we detail later; see Definition 5.1.

Definition 1.4 (Uniformity seminorms on compact nilspaces).

For $d\geq 2$ , the $U^{d}$ -seminorm of a bounded Borel function $f:\operatorname{X}\to\mathbb{C}$ on a compact nilspace $\operatorname{X}$ is defined by $\|f\|_{U^{d}}=\big{(}\int_{\operatorname{c}\in\operatorname{C}^{d}(\operatorname{X})}\prod_{v\in\{0,1\}^{d}}\mathcal{C}^{|v|}f(\operatorname{c}(v))\,\mathrm{d}\mu(\operatorname{c})\big{)}^{1/2^{d}}$ , where $\mu$ is the Haar measure111This refers to the canonical Borel probability measure on a cube set in nilspace theory; see [4, §2.2.2]. on the cube set $\operatorname{C}^{d}(\operatorname{X})$ , $\mathcal{C}$ denotes the complex conjugation operator, and $|v|=\sum_{i=1}^{d}v\scalebox{0.8}{$ (i) $}$ .

For a proof of the seminorm properties, and a discussion of when these quantities are norms, see Lemma A.4. We can now state our main result.

Theorem 1.5 (Regularity).

Let $k\in\mathbb{N}$ and let $\mathcal{D}:\mathbb{R}_{>0}\times\mathbb{N}\to\mathbb{R}_{>0}$ be an arbitrary function. For every $\epsilon>0$ there exists $N=N(\epsilon,\mathcal{D})>0$ such that the following holds. For every compact nilspace $\operatorname{X}$ that is an inverse limit of cfr coset nilspaces, and every Borel function $f:\operatorname{X}\to\mathbb{C}$ with $|f|\leq 1$ , there is a decomposition $f=f_{s}+f_{e}+f_{r}$ and number $m\leq N$ such that the following properties hold:

(i)

$f_{s}$ * is a $\mathcal{D}(\epsilon,m)$ -balanced nilspace polynomial of degree $k$ , $|f_{s}|\leq 1$ , $\textup{Comp}(f_{s})\leq m$ ,* 2. (ii)

$\|f_{e}\|_{L^{1}}\leq\epsilon$ , 3. (iii)

$\|f_{r}\|_{U^{k+1}}\leq\mathcal{D}(\epsilon,m)$ , $|f_{r}|\leq 1$ and $\max\{|\langle f_{r},f_{s}\rangle|,\,|\langle f_{r},f_{e}\rangle|\}\leq\mathcal{D}(\epsilon,m)$ .

Here $\langle f,g\rangle$ denotes the inner product $\int_{\operatorname{X}}f\,\overline{g}\,\mathrm{d}\mu_{\operatorname{X}}$ where $\mu_{\operatorname{X}}$ is the Haar measure on $\operatorname{X}$ . We use the term 1-bounded function for a function $f:\operatorname{X}\to\mathbb{C}$ with modulus at most 1 everywhere (denoted $|f|\leq 1$ ). Using Theorem 1.5, we obtain our next main result.

Theorem 1.6 (Inverse theorem).

Let $k\in\mathbb{N}$ and $\delta\in(0,1]$ . Then there is $m>0$ such that for every compact nilspace $\operatorname{X}$ that is an inverse limit of cfr coset nilspaces, and every 1-bounded Borel function $f:\operatorname{X}\to\mathbb{C}$ with $\|f\|_{U^{k+1}}\geq\delta$ , there is a 1-bounded nilspace polynomial $F\operatorname{\circ}\phi$ of degree $k$ and complexity $\leq m$ such that $\langle f,F\operatorname{\circ}\phi\rangle\geq\delta^{2^{k+1}}/2$ .

As detailed below, we deduce Theorem 1.5 from results on cubic couplings from [7]. In particular, this yields directly that the nilspace polynomial in this result is arbitrarily well balanced in relation to its complexity (this then holds also in the inverse theorem; see Theorem 5.2). In the case of finite cyclic groups, a property implying the balance property, called irrationality, can be added a posteriori to the regularity theorem, using separate arguments; see [16]. Let us emphasize also that to obtain the extension beyond abelian groups in Theorem 1.6, our proof differs markedly from that in [38]; see Section 3, in particular Remark 3.3, and Remark 3.11 on possible further extensions.

After proving Theorems 1.5 and 1.6, we focus on the important case where $\operatorname{X}$ consists of a cyclic group $\mathbb{Z}_{p}$ of prime order $p$ , in order to show that in this case Theorem 1.6 implies a refinement of the Green–Tao–Ziegler inverse theorem. More precisely, we obtain the following version of [19, Conjecture 4.5]. This uses the notation $\operatorname{poly}(\mathbb{Z},G_{\bullet})$ for the group of polynomial maps $\mathbb{Z}\to G$ relative to a filtration $G_{\bullet}$ (see [30, 18]).

Theorem 1.7.

Let $k\in\mathbb{N}$ and let $\delta\in(0,1]$ . There exists a finite set $\mathcal{M}_{k,\delta}$ of connected filtered nilmanifolds $(G/\Gamma,G_{\bullet})$ , each equipped with a smooth Riemannian me- tric $d_{G/\Gamma}$ , and a constant $C_{k,\delta}>0$ , with the following property. For every prime $p$ and 1-bounded function $f:\mathbb{Z}_{p}\to\mathbb{C}$ with $\|f\|_{U^{k+1}}\geq\delta$ , there exists $G/\Gamma\in\mathcal{M}_{k,\delta}$ , a polynomial $g\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ that is $p$ -periodic mod $\Gamma$ , and a continuous $1$ -bounded function $F:G/\Gamma\to\mathbb{C}$ with Lipschitz constant at most $C_{k,\delta}$ relative to $d_{G/\Gamma}$ , such that $|\mathbb{E}_{x\in\mathbb{Z}_{p}}f(x)\overline{F(g(x)\Gamma)}|\geq\delta^{2^{k+1}}/2$ .

Remark 1.8.

Theorem 1.7 refines [19, Theorem 1.3] in that $g$ is directly ensured to be $p$ -periodic mod $\Gamma$ (i.e. $g(n)^{-1}g(n+p)\in\Gamma$ for all $n\in\mathbb{Z}$ ), thus yielding a well-defined morphism $\mathbb{Z}_{p}\to G/\Gamma$ . This periodicity was first established in the inverse theorem in [37], and is a notable (though not exclusive) feature of the nilspace approach (periodicity is not obtained directly in [19, Theorem 1.3], but it is obtained in the more recent proof in [33]). Periodicity can also be included a posteriori in [19, Theorem 1.3] with additional arguments; see [32]. Another useful refinement that our proof can add directly to Theorem 1.7 is that the nilsequence is arbitrarily well balanced in relation to the complexity of $G/\Gamma$ (for the same reason mentioned above for Theorem 5.2).

Remark 1.9.

Let us elaborate on how Theorem 1.6 relates to previous non-quantitative inverse theorems such as [19, Theorem 1.3] or [38, Theorem 2]. One aspect is that Theorem 1.6 extends these results via its premise, by being applicable to functions $f$ on domains more general than compact abelian groups. Another aspect concerns how the theorem’s conclusion relates to the conclusions of previous such results, and more precisely how the bounded-complexity nilspace polynomials, obtained as correlating harmonics in Theorem 1.6, relate to harmonics such as the nilsequences in [19, Theorem 1.3]. The cfr nilspaces, underlying nilspace polynomials, are generalizations of nilmanifolds which still have strong structural properties akin to several of the most useful properties of nilmanifolds (such properties include an iterated-bundle structure with compact abelian Lie fibers [4, §2.5], [3, §3.2.3]; a nilpotent Lie group action compatible with the cube structure [4, §3.2.4 and Theorem 2.9.10]; and related tools in nilspace theory). Moreover, a key fact detailed in this paper is that when one restricts these nilspaces to the setting of previous results such as [19, Theorem 1.3], one recovers exactly the more explicit structure of nilmanifolds. More precisely, the crux of Theorem 1.7, compared to Theorem 1.6, is that in the specific $\mathbb{Z}_{p}$ setting of the former, the balanced nilspace polynomials obtained from the general setting are shown to be precisely nilsequences generated by $p$ -periodic orbits on connected nilmanifolds (these nilsequences are the same thing as nilspace polynomials from $\mathbb{Z}_{p}$ into connected cfr coset nilspaces). This is established in Theorem 6.1.

Recall that a compact nilspace is toral if its structure groups are tori [4, Definition 2.9.14] (it is then also a connected nilmanifold [4, Theorem 2.9.17]). A key element in our proof of Theorem 6.1 is the following new result about compact nilspaces.

Theorem 1.10.

A $k$ -step cfr nilspace is toral if and only if its $k$ -cube set is connected.

A result in the direction of Theorem 1.10 was observed in [22]. Namely, [22, Theorem 1.22] was noted to imply that if all the cube sets of a cfr nilspace are connected then the nilspace is toral. Theorem 1.10 strengthens this result: the connectedness of the set of $k$ -cubes suffices. The proof of Theorem 1.10 is given in Appendix A.

Remark 1.11.

Following terminology from [38], we say that a family of finite abelian groups $(\operatorname{Z}_{i})_{i\in\mathbb{N}}$ is of characteristic 0 if for every prime $p$ there are only finitely many indices $i$ such that $p$ divides the order of $\operatorname{Z}_{i}$ . Our proof of Theorem 1.7 can be adapted in a straightforward way to yield an analogue of this theorem where the groups $\mathbb{Z}_{p}$ are replaced by any family of characteristic 0. We omit the details in this paper.

In the quantitative direction, a proof of the inverse theorem in the case of cyclic groups $\mathbb{Z}_{p}$ was given with reasonable bounds in a recent breakthrough by Manners [33], and in the case of vector spaces $\mathbb{F}_{p}^{n}$ , in another recent breakthrough by Gowers and Milićević [15]. As mentioned in [33], currently these quantitative results cannot be made to overlap. On a conceptual level, the present paper shows that the notion of nilspace polynomials (and nilspace theory more generally) offers a framework in which a more general inverse theorem can be obtained, valid in particular for any compact abelian group (namely Theorem 1.6), from which more specific inverse theorems such as the Green–Tao–Ziegler theorem can be fully recovered and extended.

The structure of the paper is as follows. In Section 2 we recall some background on analysis in ultraproducts, and we outline its use in proving Theorem 1.5. In Section 3, we analyze ultraproducts of cfr coset nilspaces to locate certain factors that have a cubic coupling structure. This will enable us to apply our structure theorem from [7], as a crucial step in our proof of Theorem 1.5. In Section 4, we prove a new stability result for morphisms into cfr nilspaces, Theorem 4.2, which is central to our proof of Theorem 1.5 and seems to be also of intrinsic interest. In Section 5 we combine the above elements to prove Theorems 1.5 and 1.6. In Section 6 we prove Theorem 1.7.

Acknowledgements. We thank Terence Tao for useful feedback. The first-named author received funding from Spain’s MICINN project MTM2017-83496-P. The second-named author received funding from the European Research Council under the European Union’s Seventh Framework Programme (FP7/2007-2013)/ERC grant agreement 617747. The research was supported partially by the NKFIH “Élvonal” KKP 133921 grant and partially by the Mathematical Foundations of Artificial Intelligence project of the National Excellence Programme (grant no. 2018-1.2.1-NKP-2018-00008). We also thank the anonymous referee for valuable feedback helping to improve this paper.

2. Ultraproducts of nilspaces, and an outline of the main proof

We begin by recalling some basic notions concerning ultraproducts and the Loeb measure. We do so primarily to gather the required terminology and notation. For more background on these tools we refer to standard texts such as [35], or more recent treatments such as [39, §1.7, §2.10]. More detail on the use of these tools specifically in higher-order Fourier analysis can also be found in [42].

For each $i\in\mathbb{N}$ let $\operatorname{X}_{i}$ be a set equipped with a $\sigma$ -algebra $\mathcal{B}_{i}$ and a probability measure $\lambda_{i}$ on $\mathcal{B}_{i}$ . We also fix from now on a non-principal ultrafilter $\omega$ on $\mathbb{N}$ (see [39, §1.7.1]). We denote by $\prod_{i\to\omega}\operatorname{X}_{i}$ the ultraproduct of the sets $\operatorname{X}_{i}$ , that is, the quotient of the cartesian product $\prod_{i\in\mathbb{N}}\operatorname{X}_{i}$ under the equivalence relation $(x_{i})_{i}\sim(y_{i})_{i}\;\Leftrightarrow\{i\in\mathbb{N}:x_{i}=y_{i}\}\in\omega$ . We often denote such ultraproducts using boldface, thus $\mathbf{X}=\prod_{i\to\omega}\operatorname{X}_{i}$ . We can equip $\mathbf{X}$ with a $\sigma$ -algebra and a probability measure as follows. A set $B\subset\mathbf{X}$ is called an internal set if $B=\prod_{i\to\omega}B_{i}$ for some sequence of sets $B_{i}\subset\operatorname{X}_{i}$ , $i\in\mathbb{N}$ , and is an internal measurable set if $\{i:B_{i}\in\mathcal{B}_{i}\}\in\omega$ . For each internal measurable set $B$ , we define the real number $\lambda(B)\in[0,1]$ to be the standard part of the ultralimit (see [39, Definition 1.7.9]) of the numbers $\lambda_{i}(B_{i})$ , that is $\lambda(B)=\mathrm{st}\big{(}\lim_{i\to\omega}\lambda_{i}(B_{i})\big{)}$ . More generally, for any compact Hausdorff space $Y$ , for every sequence of functions $f_{i}:\operatorname{X}_{i}\to Y$ we can define a function $\mathbf{X}\to Y$ , $x\mapsto\mathrm{st}\big{(}\lim_{i\to\omega}f_{i}(x_{i})\big{)}$ , where $(x_{i})_{i}$ is any representative of the class $x$ , the value of this function being the unique point $y\in Y$ such that222To see the existence of $y$ , note that if no such $y$ existed then using compactness we could cover $\operatorname{Y}$ with finitely many open sets $U$ with $\{i:f_{i}(x_{i})\in U\}\not\in\omega$ , which would contradict that $\omega$ is an ultrafilter. The uniqueness follows from the Hausdorff property and a similar use of the ultrafilter’s properties. for every open set $U\ni y$ we have $\{i:f_{i}(x_{i})\in U\}\in\omega$ . As in several texts in this area, we shorten the notation $\mathrm{st}\big{(}\lim_{i\to\omega}f_{i}\big{)}$ ; we denote this by $\lim_{\omega}f_{i}$ .

Definition 2.1.

Given probability spaces $(\operatorname{X}_{i},\mathcal{B}_{i},\lambda_{i})$ , $i\in\mathbb{N}$ , and a non-principal ultrafilter $\omega$ on $\mathbb{N}$ , we define the corresponding Loeb measure to be the probability measure $\lambda$ obtained by applying the Hahn–Kolmogorov extension theorem to the premeasure $\prod_{i\to\omega}B_{i}\mapsto\lim_{\omega}\lambda_{i}(B_{i})$ defined on internal measurable subsets of $\mathbf{X}$ (see [35, Theorem 2.1], [39, Theorem 2.10.2]). The corresponding Loeb $\sigma$ -algebra, denoted by $\mathcal{L}_{\mathbf{X}}$ , is the completion of the $\sigma$ -algebra on $\mathbf{X}$ generated by the internal measurable sets.

Recall that for any sequence of functions $(f_{i}:\operatorname{X}_{i}\to Y)_{i\in\mathbb{N}}$ into a compact set $Y\subset\mathbb{C}$ , if $f_{i}$ is $\mathcal{B}_{i}$ -measurable for all $i$ in some set $S\in\omega$ , then $\lim_{\omega}f_{i}:\mathbf{X}\to Y$ is $\mathcal{L}_{\mathbf{X}}$ -measurable (see [35, Theorem 5.1]).

We now focus on ultraproducts of nilspaces. If each set $\operatorname{X}_{i}$ is a nilspace, with cube sets $\operatorname{C}^{n}(\operatorname{X}_{i})$ , $n\geq 0$ (where $\operatorname{C}^{0}(\operatorname{X}_{i})=\operatorname{X}_{i}$ ), then it is easily checked that the ultraproduct $\mathbf{X}$ equipped with cube sets $\operatorname{C}^{n}(\mathbf{X}):=\prod_{i\to\omega}\operatorname{C}^{n}(\operatorname{X}_{i})$ satisfies the nilspace axioms as well.

Let us now outline the proof of Theorem 1.5, and especially our use of ultraproducts. We argue by contradiction, supposing that there is a sequence of 1-bounded Borel functions $f_{i}:\operatorname{X}_{i}\to\mathbb{C}$ that disproves the theorem (thus for some $\epsilon>0$ and real numbers $N_{i}\to\infty$ as $i\to\infty$ , for each $i$ the required decomposition fails for $f_{i}$ , $\epsilon$ and $N_{i}$ ). We then consider the 1-bounded function $f=\lim_{\omega}f_{i}:\mathbf{X}\to\mathbb{C}$ , and analyze this using results on cubic couplings from [7]. To detail this further, we need to recall the notion of a cubic coupling. To this end we first recall the following notation from [7].

We write $\llbracket n\rrbracket$ for the discrete $n$ -cube $\{0,1\}^{n}$ . Two $(n-1)$ -faces $F_{0},F_{1}\subset\llbracket n\rrbracket$ are adjacent if $F_{0}\cap F_{1}\neq\emptyset$ . For finite sets $T\subset S$ and a system of sets $(A_{v})_{v\in S}$ , we write $p_{T}$ for the coordinate projection $\prod_{v\in S}A_{v}\to\prod_{v\in T}A_{v}$ . Given a probability space $\varOmega=(\Omega,\mathcal{A},\lambda)$ , we write $\mathcal{A}^{S}$ for the product $\sigma$ -algebra $\bigotimes_{v\in S}\mathcal{A}=\bigvee_{v\in S}p_{v}^{-1}(\mathcal{A})$ on $\Omega^{S}$ (where, given $\sigma$ -algebras $\mathcal{B}_{v}$ on a set, $\bigvee_{v\in S}\mathcal{B}_{v}$ denotes their join, i.e. the smallest $\sigma$ -algebra on this set that includes $\mathcal{B}_{v}$ for all $v\in S$ ). We write $\mathcal{A}^{S}_{T}$ for the sub- $\sigma$ -algebra of $\mathcal{A}^{S}$ consisting of sets depending only on coordinates indexed in $T$ , i.e. $\mathcal{A}^{S}_{T}=\bigvee_{v\in T}p_{v}^{-1}(\mathcal{A})$ . We write $\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1}$ for the meet of $\sigma$ -algebras $\mathcal{B}_{0},\mathcal{B}_{1}\subset\mathcal{A}$ (see [7, Definition 2.6]), and $\mathcal{B}_{0}\operatorname{\perp\!\!\!\perp}_{\lambda}\mathcal{B}_{1}$ for the relation of conditional independence, which holds if and only if $\forall f\in L^{\infty}(\mathcal{B}_{0})$ , $\mathbb{E}(f|\mathcal{B}_{1})\in L^{\infty}(\mathcal{B}_{0})$ ; see [7, Proposition 2.10]. (We omit the subscript $\lambda$ from $\wedge_{\lambda},\operatorname{\perp\!\!\!\perp}_{\lambda}$ when the measure $\lambda$ is clear.) Inclusion and equality up to $\lambda$ -null sets are denoted by $\subset_{\lambda}$ and $=_{\lambda}$ respectively [7, §2.1]. We write $\operatorname{\mathsf{Cg}}(\varOmega,S)$ for the space of self-couplings of $\varOmega$ indexed by $S$ [7, Definition 2.20]. Finally, given $\mu\in\operatorname{\mathsf{Cg}}(\varOmega,S)$ and an injection $\phi:R\to S$ , we write $\mu_{\phi}$ for the subcoupling of $\mu$ along $\phi$ [7, Definition 2.26]. Let us now recall the notion of a cubic coupling [7, Definition 3.1].

Definition 2.2.

A cubic coupling on a probability space $\varOmega=(\Omega,\mathcal{A},\lambda)$ is a sequence $\big{(}\mu^{\llbracket n\rrbracket}\in\operatorname{\mathsf{Cg}}(\varOmega,\llbracket n\rrbracket)\big{)}_{n\geq 0}$ satisfying the following axioms for all $m,n\geq 0$ :

(Consistency) If $\phi:\llbracket m\rrbracket\to\llbracket n\rrbracket$ is an injective cube morphism then $\mu^{\llbracket n\rrbracket}_{\phi}=\mu^{\llbracket m\rrbracket}$ . 2. 2.

(Ergodicity) The measure $\mu^{\llbracket 1\rrbracket}$ is the product measure $\lambda\times\lambda$ . 3. 3.

(Conditional independence) For every pair of adjacent faces $F_{0},F_{1}$ of codimension 1 in $\llbracket n\rrbracket$ , we have $\mathcal{A}^{\llbracket n\rrbracket}_{F_{0}}\operatorname{\perp\!\!\!\perp}_{\mu^{\llbracket n\rrbracket}}\mathcal{A}^{\llbracket n\rrbracket}_{F_{1}}$ and $\mathcal{A}^{\llbracket n\rrbracket}_{F_{0}}\wedge_{\mu^{\llbracket n\rrbracket}}\mathcal{A}^{\llbracket n\rrbracket}_{F_{1}}=_{\mu^{\llbracket n\rrbracket}}\mathcal{A}^{\llbracket n\rrbracket}_{F_{0}\cap F_{1}}$ .

Given any cubic coupling, one can define an associated family of uniformity seminorms that generalize the Gowers norms [7, Definition 3.15]. The structure theorem for cubic couplings [7, Theorem 4.2] tells us that the characteristic factor corresponding to the $k$ -th order uniformity seminorm on a cubic coupling is a $k$ -step compact nilspace. Given the functions $f_{i}:\operatorname{X}_{i}\to\mathbb{C}$ that we started with above, which were supposed not to satisfy the decomposition in Theorem 1.5, our goal is to apply the structure theorem to some suitable cubic coupling obtained using $\mathbf{X}$ and $f$ , in order to obtain eventually the contradiction that some function $f_{i}$ does in fact satisfy the required decomposition.

To carry out the above argument, our first main task is to obtain such a cubic coupling using $\mathbf{X}$ and $f$ . Now each compact nilspace $\operatorname{X}_{i}$ has an associated cubic-coupling structure, given by the Haar measures $\mu_{\operatorname{C}^{n}(\operatorname{X}_{i})}$ on the cube sets $\operatorname{C}^{n}(\operatorname{X}_{i})$ , $n\geq 0$ (see [4, §2.2] for background on these Haar measures). More precisely, the cubic coupling in question is the sequence $(\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket})_{n\geq 0}$ where $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ is defined to be $\mu_{\operatorname{C}^{n}(\operatorname{X}_{i})}$ viewed as a measure on $\operatorname{X}_{i}^{\llbracket n\rrbracket}$ , i.e. for any set $B$ in the product $\sigma$ -algebra $\mathcal{B}(\operatorname{X}_{i})^{\llbracket n\rrbracket}$ (where $\mathcal{B}(\operatorname{X}_{i})$ is the Borel $\sigma$ -algebra on $\operatorname{X}_{i}$ ) we define $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}(B):=\mu_{\operatorname{C}^{n}(\operatorname{X}_{i})}\big{(}B\cap\operatorname{C}^{n}(\operatorname{X}_{i})\big{)}$ . The fact that $(\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket})_{n\geq 0}$ is a cubic coupling is established in [7, Proposition 3.6]. We can then apply the Loeb measure construction to the sequence of probability spaces $(\operatorname{X}_{i}^{\llbracket n\rrbracket},\mathcal{B}(\operatorname{X}_{i})^{\llbracket n\rrbracket},\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket})$ , $i\in\mathbb{N}$ , and thus obtain the Loeb probability space that we shall denote by $(\mathbf{X}^{\llbracket n\rrbracket},\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}},\mu^{\llbracket n\rrbracket})$ . Note that the ultraproduct of cube sets $\operatorname{C}^{n}(\mathbf{X}):=\prod_{i\to\omega}\operatorname{C}^{n}(\operatorname{X}_{i})$ is a subset of $\mathbf{X}^{\llbracket n\rrbracket}$ , and that $\mu^{\llbracket n\rrbracket}$ is concentrated on $\operatorname{C}^{n}(\mathbf{X})$ .

As we shall see in the next section, the cubic coupling axioms hold to some extent for these measures $\mu^{\llbracket n\rrbracket}$ . However, two problems prevent this construction from forming a genuine cubic coupling.

The first (and main) problem is that, for a sequence of measures $(\mu^{\llbracket n\rrbracket})_{n\geq 0}$ to form a cubic coupling, the $\sigma$ -algebras involved in satisfying the three axioms (especially the third axiom) must be the product $\sigma$ -algebras $\mathcal{A}^{\llbracket n\rrbracket}$ (where $\mathcal{A}$ is the $\sigma$ -algebra of the original probability space $\varOmega$ ). For $\Omega=\mathbf{X}$ , this requires that the axioms be satisfied, not with the Loeb $\sigma$ -algebras $\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ obtained above, but rather with the product $\sigma$ -algebras $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}=\bigotimes_{v\in\llbracket n\rrbracket}\mathcal{L}_{\mathbf{X}}$ . However, we then face an analogue in the present setting of a well-known fact about Loeb measure spaces, namely, we face the fact that $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}\subset\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ and that this inclusion may be strict (i.e. with $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}\neq\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ ). Indeed, the inclusion $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}\subset\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ can be seen using that each measure $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ is a coupling of $\mu_{\operatorname{X}_{i}}^{\llbracket 0\rrbracket}$ , and standard properties of ultralimits (e.g. by applying for each $v\in\llbracket n\rrbracket$ Lemma B.6 with $\pi_{i}$ the projection $p_{v}:\operatorname{X}_{i}^{\llbracket n\rrbracket}\to\operatorname{X}_{i}$ , to deduce that the projection $p_{v}:\mathbf{X}^{\llbracket n\rrbracket}\to\mathbf{X}$ satisfies $p_{v}^{-1}(\mathcal{L}_{\mathbf{X}})\subset\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ , and then concluding that $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}=\bigvee_{v\in\llbracket n\rrbracket}p_{v}^{-1}(\mathcal{L}_{\mathbf{X}})\subset\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ ). The possible strictness of this inclusion can be seen already for $n=1$ , where the associated measure $\mu^{\llbracket 1\rrbracket}$ can be seen to be the product measure $\mu^{\llbracket 0\rrbracket}\times\mu^{\llbracket 0\rrbracket}$ , and where we then have examples of this strict inclusion such as [8, Example 3.13] (see also [39, Remark 2.10.4]). Given the above fact, we cannot ensure directly that the third axiom in Definition 2.2 is satisfied with $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ as required. This problem occupies us for most of the next section, where we show that if the nilspaces $\operatorname{X}_{i}$ are cfr coset nilspaces then the cubic coupling axioms do hold with the smaller $\sigma$ -algebras $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ , as required.

The second problem is that the Loeb measure spaces are typically not separable, thus failing to be Borel probability spaces (i.e. probability spaces $(\Omega,\mathcal{A},\lambda)$ where the measurable space $(\Omega,\mathcal{A})$ is standard Borel; see [7, Definition 2.15]), which is required in [7, Theorem 4.2]. This problem is addressed in the second part of the next section, using the given function $f$ to generate a suitable separable factor of $\mathbf{X}$ which still satisfies the axioms in Definition 2.2.

3. The cubic coupling axioms for ultraproducts of cfr coset nilspaces

Recall that for each compact nilspace $\operatorname{X}$ and $n\geq 0$ , we write $\mu_{\operatorname{X}}^{\llbracket n\rrbracket}$ for the measure $B\mapsto\mu_{\operatorname{C}^{n}(\operatorname{X})}\big{(}B\cap\operatorname{C}^{n}(\operatorname{X})\big{)}$ on $\mathcal{B}(\operatorname{X})^{\llbracket n\rrbracket}$ , where $\mu_{\operatorname{C}^{n}(\operatorname{X})}$ is the Haar probability measure on the cube set $\operatorname{C}^{n}(\operatorname{X})$ . (Note that $\mu_{\operatorname{X}}^{\llbracket 0\rrbracket}$ is just the Haar measure $\mu_{\operatorname{X}}$ on $\operatorname{X}$ .)

Our main aim in this section is to prove the following result.

Proposition 3.1.

For each $i\in\mathbb{N}$ let $\operatorname{X}_{i}$ be a $k$ -step cfr coset nilspace. For $n\geq 0$ let $\mu^{\llbracket n\rrbracket}$ be the Loeb measure on $(\mathbf{X}^{\llbracket n\rrbracket},\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}})$ corresponding to the measures $\mu^{\llbracket n\rrbracket}_{\operatorname{X}_{i}}$ . Then the measures $\mu^{\llbracket n\rrbracket}$ restricted to the $\sigma$ -algebras $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ satisfy the axioms in Definition 2.2.

The first two axioms hold in fact for all compact nilspaces.

Lemma 3.2.

For each $i\in\mathbb{N}$ let $\operatorname{X}_{i}$ be a $k$ -step compact nilspace. For $n\geq 0$ let $\mu^{\llbracket n\rrbracket}$ be the Loeb measure on $(\mathbf{X}^{\llbracket n\rrbracket},\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}})$ corresponding to the measures $\mu^{\llbracket n\rrbracket}_{\operatorname{X}_{i}}$ . Then the measures $\mu^{\llbracket n\rrbracket}$ restricted to the $\sigma$ -algebras $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ satisfy axioms 1, 2 in Definition 2.2.

Proof.

We first check the ergodicity axiom. The $\sigma$ -algebra $\mathcal{L}_{\mathbf{X}}^{\llbracket 1\rrbracket}=\mathcal{L}_{\mathbf{X}}\otimes\mathcal{L}_{\mathbf{X}}$ is generated by rectangles of the form $\mathbf{E}_{1}\times\mathbf{E}_{2}$ where $\mathbf{E}_{i}\in\mathcal{L}_{\mathbf{X}}$ . By part 4 of [35, Theorem 2.1] applied to $\mu^{\llbracket 0\rrbracket}$ , there are internal measurable sets $\mathbf{F}_{1}=\prod_{i\to\omega}F_{1,i}$ , $\mathbf{F}_{2}=\prod_{i\to\omega}F_{2,i}$ such that $\mu^{\llbracket 0\rrbracket}(\mathbf{E}_{i}\Delta\mathbf{F}_{i})=0$ for $i=1,2$ . Compact nilspaces are known to satisfy the ergodicity axiom, so $\mu^{\llbracket 1\rrbracket}_{\operatorname{X}_{i}}=\mu_{\operatorname{X}_{i}}\times\mu_{\operatorname{X}_{i}}$ , whence $\mu^{\llbracket 1\rrbracket}(\mathbf{F}_{1}\times\mathbf{F}_{2})=\lim_{\omega}\mu_{\operatorname{X}_{i}}(F_{1,i})\mu_{\operatorname{X}_{i}}(F_{2,i})=\mu^{\llbracket 0\rrbracket}(\mathbf{F}_{1})\mu^{\llbracket 0\rrbracket}(\mathbf{F}_{2})$ . Note also that $\mathbf{E}_{1}\times\mathbf{E}_{2}\in\mathcal{L}_{\mathbf{X}^{\llbracket 1\rrbracket}}$ and $\mu^{\llbracket 1\rrbracket}(\mathbf{E}_{1}\times\mathbf{E}_{2})=\mu^{\llbracket 1\rrbracket}(\mathbf{F}_{1}\times\mathbf{F}_{2})$ (these facts are seen similarly to the inclusion $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}\subset\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ in Section 2, using Lemma B.6). The ergodicity axiom follows.

To check the consistency axiom, we need to show that given any injective morphism $\phi:\llbracket m\rrbracket\to\llbracket n\rrbracket$ , we have $\mu^{\llbracket n\rrbracket}_{\phi}=\mu^{\llbracket m\rrbracket}$ . This holds on the larger $\sigma$ -algebra $\mathcal{L}_{\mathbf{X}^{\llbracket m\rrbracket}}$ , because $\mu^{\llbracket n\rrbracket}$ is the Loeb measure associated with the measures $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ and the consistency axiom holds for $(\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket})_{n\geq 0}$ (note that the measurability of the map $\mathbf{X}^{\llbracket n\rrbracket}\to\mathbf{X}^{\llbracket m\rrbracket}$ , $\operatorname{c}\mapsto\operatorname{c}\operatorname{\circ}\phi$ with respect to $\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ , $\mathcal{L}_{\mathbf{X}^{\llbracket m\rrbracket}}$ is itself ensured by the fact that the measures $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ obey the consistency axiom, and Lemma B.6). But then the equality $\mu^{\llbracket n\rrbracket}_{\phi}=\mu^{\llbracket m\rrbracket}$ holds also in the smaller $\sigma$ -algebra $\mathcal{L}_{\mathbf{X}}^{\llbracket m\rrbracket}$ , since if $B\in\mathcal{L}_{\mathbf{X}}^{\llbracket m\rrbracket}$ and $F:=\phi(\llbracket m\rrbracket)\subset\llbracket n\rrbracket$ , then $p_{F}^{-1}(B)$ is in $\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ and so $\mu^{\llbracket n\rrbracket}\big{(}p_{F}^{-1}(B)\big{)}=\mu^{\llbracket m\rrbracket}(B)$ . ∎

We turn to the main task, i.e. to check that the conditional independence axiom holds not only with the $\sigma$ -algebras $\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ , but also with the smaller ones $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ . As recalled in Section 2, for $F\subset\llbracket n\rrbracket$ we denote by $(\mathcal{L}_{\mathbf{X}})^{\llbracket n\rrbracket}_{F}$ the $\sigma$ -algebra $\bigvee_{v\in F}p_{v}^{-1}(\mathcal{L}_{\mathbf{X}})\subset\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ .

Remark 3.3.

In the special case of Proposition 3.1 where each $\operatorname{X}_{i}$ is a compact abelian group (equipped with its standard cubes; see [3, Proposition 2.1.2]), the ultraproduct $\mathbf{X}$ is also an abelian group. This can be used to prove the conditional independence axiom with an argument that is markedly simpler than the one we use below for the more general case. Indeed, in the abelian case, the group structure on $\mathbf{X}$ yields a useful expression for the conditional expectation $\mathbb{E}\big{(}f|(\mathcal{L}_{\mathbf{X}})^{\llbracket n\rrbracket}_{F_{i}}\big{)}$ , namely that this is almost-surely equal to the function $\mathbf{x}\mapsto\int_{\mathbf{X}}f(\mathbf{x}+t^{F_{i}})\,\mathrm{d}\lambda(t)$ , where $t^{F_{i}}$ is the element of the group $\mathbf{X}^{\llbracket n\rrbracket}$ with $t^{F_{i}}(v)=t$ if $v\in F_{i}$ and $t^{F_{i}}(v)=0$ otherwise. These integral expressions for these expectation operators make it easy to see that for the two faces $F_{0},F_{1}$ the operators commute. This implies the conditional independence axiom (via [7, Proposition 2.10], say). While this case is much simpler than the argument in the general case, it still has significant content, and looking at its details can be helpful to understand the rest of this section.

Let us introduce a simplified notation for $\sigma$ -algebras for the rest of this section. For $S\subset\llbracket n\rrbracket$ , when the ultraproduct nilspace $\mathbf{X}$ and the dimension $n$ are clear from the context, we write simply $\mathcal{A}$ for $(\mathcal{L}_{\mathbf{X}})^{\llbracket n\rrbracket}$ , and $\mathcal{A}_{S}$ for $(\mathcal{L}_{\mathbf{X}})^{\llbracket n\rrbracket}_{S}$ . Similarly, we write $\mathcal{B}$ for $\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ and $\mathcal{B}_{S}$ for the $\sigma$ -algebra $p_{S}^{-1}(\mathcal{L}_{\mathbf{X}^{S}})$ on $\mathbf{X}^{\llbracket n\rrbracket}$ . By the explanation at the end of Section 2 we see that $\mathcal{A}_{S}\subset\mathcal{B}_{S}$ (and this inclusion may be strict).

Our main task, then, is to prove that for any adjacent faces $F_{0},F_{1}\subset\llbracket n\rrbracket$ of codimension 1, we have $\mathcal{A}_{F_{0}}\operatorname{\perp\!\!\!\perp}_{\mu^{\llbracket n\rrbracket}}\mathcal{A}_{F_{1}}$ and $\mathcal{A}_{F_{0}}\wedge_{\mu^{\llbracket n\rrbracket}}\mathcal{A}_{F_{1}}=\mathcal{A}_{F_{0}\cap F_{1}}$ .

We say that two faces of codimension 1 in $\llbracket n\rrbracket$ are opposite faces if they are not adjacent (i.e. if their intersection is empty). Given a $\sigma$ -algebra $\mathcal{X}$ on a set $X$ , and a finite set $S$ , we say an $\mathcal{X}^{S}$ -measurable function $f:X^{S}\to\mathbb{C}$ is a rank 1 function if $f=\prod_{v\in S}f_{v}\operatorname{\circ}p_{v}$ where each $f_{v}:X\to\mathbb{C}$ is $\mathcal{X}$ -measurable.

We begin by reducing our main task as follows.

Lemma 3.4.

The conditional independence axiom holds with $\mathcal{A}$ , $\mu^{\llbracket n\rrbracket}$ ( $\forall n\in\mathbb{N}$ ) if the following statement holds: $\forall n\in\mathbb{N}$ , for any opposite faces $F_{0},F_{1}\subset\llbracket n\rrbracket$ of codimension 1, every rank 1 bounded $\mathcal{A}_{F_{0}}$ -measurable function $f$ satisfies $\mathbb{E}(f|\mathcal{B}_{F_{1}})\in L^{\infty}(\mathcal{A}_{F_{1}})$ .

Here and below, in notions involving equality up to null sets, unless otherwise stated these are null sets relative to $\mu^{\llbracket n\rrbracket}$ and are allowed to be from the largest ambient $\sigma$ -algebra on $\mathbf{X}^{n}$ , i.e. $\mathcal{L}_{\mathbf{X}^{n}}$ . Thus “ $\mathbb{E}(f|\mathcal{B}_{F_{1}})\in L^{\infty}(\mathcal{A}_{F_{1}})$ ” here means that $\mathbb{E}(f|\mathcal{B}_{F_{1}})$ agrees with some $\mathcal{A}_{F_{1}}$ -measurable bounded function outside some $\mu^{\llbracket n\rrbracket}$ -null set (recall that $\mathbb{E}(f|\mathcal{B}_{F_{1}})$ is defined up to $\mu^{\llbracket n\rrbracket}$ -null sets anyway). Similarly, equalities between conditional expectations are meant up to a null-set in the ambient measure (if there is danger of confusion, we indicate the measure by a subscript in the equality).

Proof.

To confirm that the conditional independence axiom holds, we have to show that for any adjacent faces $F_{0}^{\prime},F_{1}^{\prime}\subset\llbracket n\rrbracket$ of codimension 1 we have $\mathcal{A}_{F_{0}^{\prime}}\operatorname{\perp\!\!\!\perp}_{\mu^{\llbracket n\rrbracket}}\mathcal{A}_{F_{1}^{\prime}}$ and $\mathcal{A}_{F_{0}^{\prime}}\wedge_{\mu^{\llbracket n\rrbracket}}\mathcal{A}_{F_{1}^{\prime}}=_{\mu^{\llbracket n\rrbracket}}\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}}$ . By [7, Lemma 2.30], it suffices to prove that if $f$ is a rank 1 bounded $\mathcal{A}_{F_{0}^{\prime}}$ -measurable function then $\mathbb{E}(f|\mathcal{A}_{F_{1}^{\prime}})\in L^{\infty}(\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ . We have $\mathbb{E}(f|\mathcal{A}_{F_{1}^{\prime}})=\mathbb{E}(\mathbb{E}(f|\mathcal{B}_{F_{1}^{\prime}})|\mathcal{A}_{F_{1}^{\prime}})$ , since $\mathcal{A}_{F_{1}^{\prime}}\subset\mathcal{B}_{F_{1}^{\prime}}$ . We also have $\mathbb{E}(f|\mathcal{B}_{F_{1}^{\prime}})=\mathbb{E}(f|\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ because the conditional independence axiom holds for the measures $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ , and this is then seen to imply the same property for $\mu^{\llbracket n\rrbracket}$ on $\mathcal{B}$ using Lemma B.3. Hence $\mathbb{E}(f|\mathcal{A}_{F_{1}^{\prime}})=\mathbb{E}(\mathbb{E}(f|\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}})|\mathcal{A}_{F_{1}^{\prime}})$ . Therefore, if we prove

[TABLE]

then $\mathbb{E}(f|\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}})=\mathbb{E}(f|\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ (since $\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}}\supset\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}}$ ), which implies that $\mathbb{E}(f|\mathcal{A}_{F_{1}^{\prime}})$ $=\mathbb{E}(\mathbb{E}(f|\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}})|\mathcal{A}_{F_{1}^{\prime}})=\mathbb{E}(f|\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ , so $\mathbb{E}(f|\mathcal{A}_{F_{1}^{\prime}})\in L^{\infty}(\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ as required.

Since $f$ is a rank 1 function $\prod_{v\in F_{0}^{\prime}}f_{v}\operatorname{\circ}p_{v}$ , and $\prod_{v\in F_{0}^{\prime}\cap F_{1}^{\prime}}f_{v}\operatorname{\circ}p_{v}$ is $\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}}$ -measurable, we have $\mathbb{E}(f|\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}})=(\prod_{v\in F_{0}^{\prime}\cap F_{1}^{\prime}}f_{v}\operatorname{\circ}p_{v})\,\mathbb{E}(\prod_{v\in F_{0}^{\prime}\setminus F_{1}^{\prime}}f_{v}\operatorname{\circ}p_{v}|\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ . Hence, if it holds that $\mathbb{E}(\prod_{v\in F_{0}^{\prime}\setminus F_{1}^{\prime}}f_{v}\operatorname{\circ}p_{v}|\mathcal{B}_{F_{0}^{\prime}\cap F_{1}^{\prime}})\in L^{\infty}(\mathcal{A}_{F_{0}^{\prime}\cap F_{1}^{\prime}})$ then (1) follows. But this is indeed seen to hold by relabeling $F_{0}^{\prime}$ as $\llbracket n\rrbracket$ , $F_{0}^{\prime}\setminus F_{1}^{\prime}$ as $F_{0}$ , and $F_{0}^{\prime}\cap F_{1}^{\prime}$ as $F_{1}$ , and using the statement in the lemma. ∎

To prove the statement in Lemma 3.4, we work with the $\sigma$ -algebra $\mathcal{I}:=\mathcal{B}_{F_{0}}\wedge_{\mu^{\llbracket n\rrbracket}}\mathcal{B}_{F_{1}}\subset\mathcal{L}_{\mathbf{X}^{\llbracket n\rrbracket}}$ . First we note the following expression for $\mathcal{I}$ in terms of a $\sigma$ -algebra $\mathcal{I}^{\prime}\subset\mathcal{L}_{\mathbf{X}^{\llbracket n-1\rrbracket}}$ .

Lemma 3.5.

Let $F_{0},F_{1}$ be opposite faces of codimension 1 in $\llbracket n\rrbracket$ . Let $\mathbf{\mathcal{I}}^{\prime}$ be the $\sigma$ -algebra of sets $A^{\prime}\in\mathcal{L}_{\mathbf{X}^{\llbracket n-1\rrbracket}}$ such that $p_{F_{0}}^{-1}(A^{\prime})=_{\mu^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A^{\prime})$ . Then we have $p_{F_{0}}^{-1}(\mathcal{I}^{\prime})=_{\mu^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(\mathcal{I}^{\prime})=_{\mu^{\llbracket n\rrbracket}}\mathcal{I}$ .

Proof.

It is clear from the definitions that $p_{F_{0}}^{-1}(\mathcal{I}^{\prime})=_{\mu^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(\mathcal{I}^{\prime})\subset_{\mu^{\llbracket n\rrbracket}}\mathcal{I}$ , so it suffices to prove that $\mathcal{I}\subset_{\mu^{\llbracket n\rrbracket}}p_{F_{0}}^{-1}(\mathcal{I}^{\prime})$ . The idea is that the analogous inclusion is known to hold for the nilspaces $\operatorname{X}_{i}$ , and the inclusion for $\mathcal{I}$ then follows by straightforward arguments with ultraproducts. More precisely, let $\mathcal{B}_{i}$ denote the Borel $\sigma$ -algebra on $\operatorname{X}_{i}$ for each $i\in\mathbb{N}$ , and recall that the cubic Haar measures $\mu_{\operatorname{X}_{i}}^{\llbracket m\rrbracket}$ , $m\geq 0$ form a cubic coupling [7, Proposition 3.6], so by [7, Lemma 3.4] the measure $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ is an idempotent coupling, and so by [7, Lemma 2.62 (iii) and Proposition 2.66] we have $(\mathcal{B}_{i})_{F_{0}}^{\llbracket n\rrbracket}\operatorname{\perp\!\!\!\perp}_{\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}}(\mathcal{B}_{i})_{F_{1}}^{\llbracket n\rrbracket}$ , for each $i\in\mathbb{N}$ . By Lemma B.3, for every $A\in\mathcal{I}$ there are sets $A_{i}\in(\mathcal{B}_{i})_{F_{0}}^{\llbracket n\rrbracket}\wedge_{\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}}(\mathcal{B}_{i})^{\llbracket n\rrbracket}_{F_{1}}$ , $i\in\mathbb{N}$ , such that $A=_{\mu^{\llbracket n\rrbracket}}\prod_{i\to\omega}A_{i}$ . Then by [7, Lemma 2.62 (iii)], there is $A_{i}^{\prime}\in\mathcal{B}_{i}^{\llbracket n-1\rrbracket}$ such that $p_{F_{0}}^{-1}(A_{i}^{\prime})=_{\mu_{i}^{\llbracket n\rrbracket}}A_{i}=_{\mu_{i}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A_{i}^{\prime})$ . Now $A^{\prime}:=\prod_{i\to\omega}A_{i}^{\prime}$ is in $\mathcal{I}^{\prime}$ and $A=_{\mu_{i}^{\llbracket n\rrbracket}}p_{F_{0}}^{-1}(A^{\prime})$ . The desired inclusion follows. ∎

Using this expression of $\mathcal{I}$ , we now perform a second reduction, using Lemma 3.4.

Lemma 3.6.

The conditional independence axiom holds with $(\mathcal{A},\mu^{\llbracket n\rrbracket})$ if the following statement holds. For every pair of opposite faces $F_{0},F_{1}$ of codimension 1 in $\llbracket n\rrbracket$ , the $\sigma$ -algebra $\mathcal{I}=\mathcal{B}_{F_{0}}\wedge_{\mu^{\llbracket n\rrbracket}}\mathcal{B}_{F_{1}}$ satisfies $\mathcal{A}_{F_{0}}\operatorname{\perp\!\!\!\perp}_{\mu^{\llbracket n\rrbracket}}\mathcal{I}$ .

Proof.

By Lemma 3.4, it suffices to prove that for every rank 1 bounded $\mathcal{A}_{F_{0}}$ -measurable function $f$ we have $\mathbb{E}(f|\mathcal{B}_{F_{1}})\in L^{\infty}(\mathcal{A}_{F_{1}})$ . We claim that $\mathcal{B}_{F_{0}}\operatorname{\perp\!\!\!\perp}\mathcal{B}_{F_{1}}$ . As in the proof of Lemma 3.5, this follows from a similar property holding for the nilspaces $\operatorname{X}_{i}$ . Indeed, as recalled in that proof, for each $i$ the coupling $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ is idempotent. By [7, Lemma 2.62 (iii) and Proposition 2.66] the claimed conditional independence holds for the analogues of $\mathcal{B}_{F_{0}},\mathcal{B}_{F_{1}}$ on $\operatorname{X}_{i}^{\llbracket n\rrbracket}$ . Our claim then follows by Lemma B.3. Now, since $f$ is $\mathcal{B}_{F_{0}}$ -measurable (as $\mathcal{B}_{F_{0}}\supset\mathcal{A}_{F_{0}}$ ), by $\mathcal{B}_{F_{0}}\operatorname{\perp\!\!\!\perp}\mathcal{B}_{F_{1}}$ we have $\mathbb{E}(f|\mathcal{B}_{F_{1}})=\mathbb{E}(f|\mathcal{B}_{F_{0}}\wedge\mathcal{B}_{F_{1}})=\mathbb{E}(f|\mathcal{I})$ . Hence, it suffices to prove that $\mathbb{E}(f|\mathcal{I})\in L^{\infty}(\mathcal{A}_{F_{1}})$ .

We now claim that $\mathcal{I}\wedge\mathcal{A}_{F_{0}}=_{\mu^{\llbracket n\rrbracket}}\mathcal{I}\wedge\mathcal{A}_{F_{1}}$ . Confirming this claim would complete the proof. Indeed, by assumption $\mathcal{A}_{F_{0}}\operatorname{\perp\!\!\!\perp}\mathcal{I}$ , so we would have $\mathbb{E}(f|\mathcal{I})\in L^{\infty}(\mathcal{A}_{F_{0}}\wedge\mathcal{I})=L^{\infty}(\mathcal{A}_{F_{1}}\wedge\mathcal{I})\subset L^{\infty}(\mathcal{A}_{F_{1}})$ , as required. To prove the claim, let $\sigma$ be the reflection map on $\mathbf{X}^{\llbracket n\rrbracket}$ induced by the reflection on $\llbracket n\rrbracket$ that permutes $F_{0}$ and $F_{1}$ . By Lemma 3.5, for every $U\in\mathcal{I}$ we have $\sigma(U)=_{\mu^{\llbracket n\rrbracket}}U$ . Since $\sigma(\mathcal{A}_{F_{0}})=\mathcal{A}_{F_{1}}$ , if follows that for every $U\in\mathcal{I}\wedge\mathcal{A}_{F_{0}}$ we have $U=_{\mu^{\llbracket n\rrbracket}}\sigma(U)\in\sigma(\mathcal{A}_{F_{0}})=\mathcal{A}_{F_{1}}$ , so $\mathcal{I}\wedge\mathcal{A}_{F_{0}}\subset_{\mu^{\llbracket n\rrbracket}}\mathcal{I}\wedge\mathcal{A}_{F_{1}}$ . Similarly $\mathcal{I}\wedge\mathcal{A}_{F_{1}}\subset_{\mu^{\llbracket n\rrbracket}}\mathcal{I}\wedge\mathcal{A}_{F_{0}}$ . ∎

To prove the statement in Lemma 3.6, we now work towards a useful description of $\mathcal{I}$ in terms of an invariance under a certain group action. For this, we start using the coset nilspace structure. Thus, we now suppose that $\mathbf{X}$ is an ultraproduct of cfr coset nilspaces $\operatorname{X}_{i}=(G^{(i)}/\Gamma^{(i)},G^{(i)}_{\bullet})$ , $i\in\mathbb{N}$ . Note that $\mathbf{X}$ is then a coset nilspace $(G/\Gamma,G_{\bullet})$ (in the algebraic sense of [3, Proposition 2.3.1]), where $G$ , $\Gamma$ are the groups $\prod_{i\to\omega}G^{(i)}$ , $\prod_{i\to\omega}\Gamma^{(i)}$ respectively, and $G_{\bullet}=(G_{j})_{j\geq 0}$ is a filtration with $G_{j}=\prod_{i\to\omega}G^{(i)}_{j}$ .

Given a filtration $G_{\bullet}$ and $\ell\in\mathbb{N}$ , we denote by $G_{\bullet}^{+\ell}$ the shifted filtration whose $j$ -th term is $G_{j+\ell}$ (strictly speaking, this is a prefiltration; see [6, Apppendix C]). We use the notion of a $1$ -arrow of cubes on a nilspace $\operatorname{X}$ [3, Definition 2.2.18]: for $\operatorname{c}_{0},\operatorname{c}_{1}\in\operatorname{C}^{n}(\operatorname{X})$ , the $1$ -arrow $\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}\in\operatorname{X}^{\llbracket n+1\rrbracket}$ is defined by $\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}(v,j)=\operatorname{c}_{j}(v)$ , $j=0,1$ .

Given any nilspace $\operatorname{X}$ , we define an equivalence relation $\sim$ on $\operatorname{C}^{n-1}(\operatorname{X})$ by declaring that $\operatorname{c}_{0}\sim\operatorname{c}_{1}$ if $\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}\in\operatorname{C}^{n}(\operatorname{X})$ . The following result gives a useful algebraic description of this relation when $\operatorname{X}$ is a coset nilspace $(G/\Gamma,G_{\bullet})$ (the purely algebraic definition of a coset nilspace can be recalled from [3, Proposition 2.3.1]).

Lemma 3.7.

Let $\operatorname{X}=(G/\Gamma,G_{\bullet})$ be a coset nilspace. Then $\operatorname{c}_{0}\sim\operatorname{c}_{1}$ if and only if there exist $\widetilde{\operatorname{c}}_{0},\widetilde{\operatorname{c}}_{1}\in\operatorname{C}^{n-1}(G_{\bullet})$ with $\operatorname{c}_{i}=\pi_{\Gamma}\operatorname{\circ}\widetilde{\operatorname{c}}_{i}$ , $i=0,1$ , and $\widetilde{\operatorname{c}}_{0}^{\,-1}\,\widetilde{\operatorname{c}}_{1}\in\operatorname{C}^{n-1}(G_{\bullet}^{+1})$ . Thus, the equivalence classes of $\sim$ are the orbits of the action of $\operatorname{C}^{n-1}(G_{\bullet}^{+1})$ on $\operatorname{C}^{n-1}(\operatorname{X})$ .

Here $\pi_{\Gamma}$ denotes the canonical quotient map $G\to G/\Gamma$ .

Proof.

Suppose that $\operatorname{c}_{0}\sim\operatorname{c}_{1}$ . Thus $\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}\in\operatorname{C}^{n}(\operatorname{X})$ , so there is $\operatorname{c}\in\operatorname{C}^{n}(G_{\bullet})$ such that $\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}=\pi_{\Gamma}\operatorname{\circ}\operatorname{c}$ . For $i\in\{0,1\}$ let $\widetilde{\operatorname{c}}_{i}$ be the restriction of $\operatorname{c}$ to the face $\{v\in\llbracket n\rrbracket:v\scalebox{0.8}{$ (n) $}=i\}$ . Then $\pi_{\Gamma}\operatorname{\circ}\widetilde{\operatorname{c}}_{i}=\operatorname{c}_{i}$ . Since $\langle\widetilde{\operatorname{c}}_{0},\widetilde{\operatorname{c}}_{1}\rangle_{1}=\operatorname{c}$ is a cube, we have by [3, Lemma 2.2.19] that $\widetilde{\operatorname{c}}_{0}^{\,-1}\,\widetilde{\operatorname{c}}_{1}\in\operatorname{C}^{n-1}(G_{\bullet}^{+1})$ . The backward implication is also clear, using the backward implication in [3, Lemma 2.2.19]. For the last claim, suppose that $\widetilde{\operatorname{c}}_{0}\Gamma^{\llbracket n-1\rrbracket}\sim\widetilde{\operatorname{c}}_{1}\Gamma^{\llbracket n-1\rrbracket}$ , and note that $\widetilde{\operatorname{c}}_{1}\Gamma^{\llbracket n-1\rrbracket}=\widetilde{\operatorname{c}}_{0}(\widetilde{\operatorname{c}}_{0}^{\,-1}\,\widetilde{\operatorname{c}}_{1})\Gamma^{\llbracket n-1\rrbracket}=g\,\widetilde{\operatorname{c}}_{0}\Gamma^{\llbracket n-1\rrbracket}$ , where $g:=\widetilde{\operatorname{c}}_{0}(\widetilde{\operatorname{c}}_{0}^{\,-1}\widetilde{\operatorname{c}}_{1})\widetilde{\operatorname{c}}_{0}^{\,-1}$ is in $\operatorname{C}^{n-1}(G_{\bullet}^{+1})$ since this is a normal subgroup of $\operatorname{C}^{n-1}(G_{\bullet})$ . ∎

We use this algebraic expression of the relation $\sim$ to prove the following description of the $\sigma$ -algebra $\mathcal{I}^{\prime}$ from Lemma 3.5, as a key step toward the proof of Proposition 3.1.

Lemma 3.8.

For each $i\in\mathbb{N}$ let $\operatorname{X}_{i}$ be a cfr coset nilspace $(G^{(i)}/\Gamma^{(i)},G^{(i)}_{\bullet})$ . Let $\mathbf{H}$ be the ultraproduct group $\prod_{i\to\omega}\operatorname{C}^{n-1}\big{(}(G^{(i)})_{\bullet}^{+1}\big{)}$ . Then a set $A\in\mathcal{L}_{\mathbf{X}^{\llbracket n-1\rrbracket}}$ is in $\mathcal{I}^{\prime}$ if and only if $g\cdot A=_{\mu^{\llbracket n-1\rrbracket}}A$ for every $g\in\mathbf{H}$ .

To prove this we first obtain the following analogous result for cfr coset nilspaces.

Lemma 3.9.

Let $\operatorname{X}$ be a cfr coset nilspace $(G/\Gamma,G_{\bullet})$ , let $H=\operatorname{C}^{n-1}(G_{\bullet}^{+1})$ , and let $\mathcal{J}$ be the $\sigma$ -algebra of Borel sets $A\subset\operatorname{X}^{\llbracket n-1\rrbracket}$ such that $p_{F_{0}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A)$ . Then a Borel set $A\subset\operatorname{X}^{\llbracket n-1\rrbracket}$ is in $\mathcal{J}$ if and only if $g\cdot A=_{\mu_{\operatorname{X}}^{\llbracket n-1\rrbracket}}A$ for every $g\in H$ .

Recall that $\mu_{\operatorname{X}}^{\llbracket n\rrbracket}$ denotes the Haar measure on $\operatorname{C}^{n}(\operatorname{X})$ viewed as a measure on $\operatorname{X}^{\llbracket n\rrbracket}$ .

Proof.

Assume that $p_{F_{0}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A)$ , and let $A^{\prime}=A\cap\operatorname{C}^{n-1}(\operatorname{X})$ . Note that every element in $p_{F_{0}}^{-1}(A^{\prime})$ that lies in $\operatorname{C}^{n}(\operatorname{X})$ is of the form $\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}$ for $\operatorname{c}_{0}\sim\operatorname{c}_{1}$ , with $\operatorname{c}_{0}\in A^{\prime}$ . Since $\mu_{\operatorname{X}}^{\llbracket n\rrbracket}$ is concentrated on $\operatorname{C}^{n}(\operatorname{X})$ , we have $p_{F_{0}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}p_{F_{0}}^{-1}(A^{\prime})=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}\{\langle\operatorname{c}_{0},g\cdot\operatorname{c}_{0}\rangle_{1}:\operatorname{c}_{0}\in A^{\prime},g\in H\}$ , by Lemma 3.7. Letting $H^{\prime}$ denote the group $\{\langle\mathrm{id}_{H},g\rangle_{1}:g\in H\}$ , it follows that $p_{F_{0}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}g^{\prime}\cdot p_{F_{0}}^{-1}(A)$ for every $g^{\prime}=\langle\mathrm{id}_{H},g\rangle_{1}\in H^{\prime}$ . By our assumption, this implies $p_{F_{1}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}g^{\prime}\cdot p_{F_{1}}^{-1}(A)$ . Moreover $g^{\prime}\cdot p_{F_{1}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}g^{\prime}\cdot\{\langle h\cdot\operatorname{c}_{1},\operatorname{c}_{1}\rangle_{1}:\operatorname{c}_{1}\in A^{\prime},h\in H\}$ and this equals $\{\langle h\cdot\operatorname{c}_{1},\operatorname{c}_{1}\rangle_{1}:\operatorname{c}_{1}\in g\cdot A^{\prime},h\in H\}=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(g\cdot A)$ . Hence $p_{F_{1}}^{-1}(A)=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(g\cdot A)$ , which implies that $A=_{\mu_{\operatorname{X}}^{\llbracket n-1\rrbracket}}g\cdot A$ as required.

Conversely, if $A=_{\mu_{\operatorname{X}}^{\llbracket n-1\rrbracket}}g\cdot A$ for all $g\in H$ , then by [31, Theorem 3] there is $A^{\prime}=_{\mu_{\operatorname{X}}^{\llbracket n-1\rrbracket}}A$ such that $g\cdot A^{\prime}=A^{\prime}$ for every $g\in H$ . Using Lemma 3.7 as above yields $p_{F_{0}}^{-1}(A^{\prime})=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}\{\langle\operatorname{c}_{0},\operatorname{c}_{1}\rangle_{1}:\operatorname{c}_{0},\operatorname{c}_{1}\in A,\operatorname{c}_{0}\sim\operatorname{c}_{1}\}=_{\mu_{\operatorname{X}}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A^{\prime})$ , whence $S\in\mathcal{J}$ . ∎

Proof of Lemma 3.8.

We first prove the forward implication. If $A\in\mathcal{I}^{\prime}$ , then by definition $\widetilde{A}:=p_{F_{0}}^{-1}(A)=_{\mu^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A)$ , so in particular $\widetilde{A}\in\mathcal{B}_{F_{0}}\wedge\mathcal{B}_{F_{1}}$ . By Lemma B.3 there are Borel sets $\widetilde{A}_{i}\in\mathcal{B}_{i,F_{0}}\wedge\mathcal{B}_{i,F_{1}}$ , $i\in\mathbb{N}$ , such that $\widetilde{A}=_{\mu^{\llbracket n\rrbracket}}\prod_{i\to\omega}\widetilde{A}_{i}$ (where $\mathcal{B}_{i,F_{0}}$ is the analogue of $\mathcal{B}_{F_{0}}$ for $\operatorname{X}_{i}$ ). For each $i$ , combining the idempotence of $\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}$ with [4, Lemma 2.62] as in previous proofs, we obtain Borel sets $A_{i}\in\operatorname{X}_{i}^{\llbracket n-1\rrbracket}$ such that $\widetilde{A}_{i}=_{\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}}p_{F_{0}}^{-1}(A_{i})=_{\mu_{\operatorname{X}_{i}}^{\llbracket n\rrbracket}}p_{F_{1}}^{-1}(A_{i})$ . Hence $p_{F_{0}}^{-1}(A)=_{\mu^{\llbracket n\rrbracket}}\prod_{i\to\omega}p_{F_{0}}^{-1}(A_{i})=_{\mu^{\llbracket n\rrbracket}}p_{F_{0}}^{-1}(\prod_{i\to\omega}A_{i})$ . Consequently $A=_{\mu^{\llbracket n-1\rrbracket}}\prod_{i\to\omega}A_{i}$ . By Lemma 3.9 every such set $A_{i}$ is $H_{i}$ -invariant for $H_{i}:=\operatorname{C}^{n-1}\big{(}(G^{(i)})_{\bullet}^{+1}\big{)})$ . It follows that $A$ is $\mathbf{H}$ -invariant as required.

Conversely, if $\mu^{\llbracket n-1\rrbracket}(A\Delta h\cdot A)=0$ for all $h\in\mathbf{H}$ , then by [35, Theorem 2.1] there are Borel sets $A_{i}\subset\operatorname{X}_{i}^{\llbracket n-1\rrbracket}$ such that $A=_{\mu^{\llbracket n-1\rrbracket}}\prod_{i\to\omega}A_{i}$ . For each $i$ let $s_{i}=\sup_{h\in H_{i}}\mu_{\operatorname{X}_{i}}^{\llbracket n-1\rrbracket}\big{(}A_{i}\Delta(h\cdot A_{i})\big{)}$ . We claim that for every $\epsilon>0$ we have $\{i:s_{i}<\epsilon\}\in\omega$ . Otherwise there is $\epsilon>0$ such that $\{i:s_{i}\geq\epsilon\}\in\omega$ , so for every such $i$ there is $h_{i}\in H_{i}$ such that $\mu_{\operatorname{X}_{i}}^{\llbracket n-1\rrbracket}\big{(}A_{i}\Delta(h_{i}\cdot A_{i})\big{)}\geq\epsilon/2$ . Letting $h=\lim_{i\to\omega}h_{i}\in\mathbf{H}$ , we would have $\mu^{\llbracket n-1\rrbracket}\big{(}A\Delta(h\cdot A)\big{)}\geq\epsilon/2>0$ , a contradiction. This proves our claim. Hence, for every $\epsilon>0$ , for every $i$ such that $s_{i}<\epsilon$ , by Lemma B.4 there is an $H_{i}$ -invariant set $A^{\prime}_{i}$ such that $\mu_{\operatorname{X}_{i}}^{\llbracket n-1\rrbracket}\big{(}A_{i}\Delta A_{i}^{\prime})\leq 5\epsilon^{1/4}$ . Let $A^{\prime}=\prod_{i\to\omega}A_{i}^{\prime}$ . Then $\mu^{\llbracket n-1\rrbracket}\big{(}A\Delta A^{\prime})\leq 5\epsilon^{1/4}$ . Since $A_{i}^{\prime}\in\mathcal{J}_{i}$ , we have $A^{\prime}\in\mathcal{I}^{\prime}$ by Lemma B.3. Letting $\epsilon\to 0$ , we deduce that $A\in\mathcal{I}^{\prime}$ . ∎

We can now complete the proof of Proposition 3.1, by proving the following result.

Proposition 3.10.

For every pair of opposite faces $F_{0},F_{1}$ of codimension 1 in $\llbracket n\rrbracket$ , the $\sigma$ -algebra $\mathcal{I}=\mathcal{B}_{F_{0}}\wedge\mathcal{B}_{F_{1}}$ satisfies $\mathcal{A}_{F_{0}}\operatorname{\perp\!\!\!\perp}\mathcal{I}$ .

Proof.

As $\mathcal{A}_{F_{0}}=p_{F_{0}}^{-1}(\mathcal{L}_{\mathbf{X}}^{\llbracket n-1\rrbracket})$ and $\mathcal{I}=_{\mu^{\llbracket n\rrbracket}}p_{F_{0}}^{-1}(\mathcal{I}^{\prime})$ , it suffices to show that $\mathcal{L}_{\mathbf{X}}^{\llbracket n-1\rrbracket}\operatorname{\perp\!\!\!\perp}\mathcal{I}^{\prime}$ . For this proof let $\mathcal{A}$ denote $\mathcal{L}_{\mathbf{X}}^{\llbracket n-1\rrbracket}$ . Let $f\in L^{\infty}(\mathcal{I}^{\prime})$ and $h\in\mathbf{H}$ . Then $f^{h}=_{\mu^{\llbracket n-1\rrbracket}}f$ , by Lemma 3.8 (where $f^{h}(x):=f(h\cdot x)$ ), so $\mathbb{E}(f|\mathcal{A})=_{\mu^{\llbracket n-1\rrbracket}}\mathbb{E}(f^{h}|\mathcal{A})$ . Note the global invariance $\mathcal{A}^{h}=_{\mu^{\llbracket n-1\rrbracket}}\mathcal{A}$ , since $g^{h}\in L^{\infty}(\mathcal{A})$ for every $g\in L^{\infty}(\mathcal{A})$ of rank 1. Hence $\mathbb{E}(f^{h}|\mathcal{A})=_{\mu^{\llbracket n-1\rrbracket}}\mathbb{E}(f^{h}|\mathcal{A}^{h})$ . As $h$ is measure preserving, $\mathbb{E}(f^{h}|\mathcal{A}^{h})=_{\mu^{\llbracket n-1\rrbracket}}\mathbb{E}(f|\mathcal{A})^{h}$ , so $\mathbb{E}(f|\mathcal{A})=_{\mu^{\llbracket n-1\rrbracket}}\mathbb{E}(f|\mathcal{A})^{h}$ . This holds for all $h$ , so $\mathbb{E}(f|\mathcal{A})\in L^{\infty}(\mathcal{I}^{\prime})$ . Hence $\mathcal{I}^{\prime}\operatorname{\perp\!\!\!\perp}\mathcal{A}$ . ∎

Remark 3.11.

To prove Proposition 3.1, we have made significant use of the transitive group action present on a cfr coset nilspace. We do not know whether the cubic coupling axioms can be proved for ultraproducts of more general compact nilspaces, where such a group action is not necessarily available. If the axioms still hold in such a setting, then this may yield an extension of Theorem 1.5 valid for all compact nilspaces.

3.1. Locating a separable factor yielding a Borel cubic coupling

Given a probability space $(\Omega,\mathcal{A},\lambda)$ , we say that a $\sigma$ -algebra $\mathcal{X}\subset\mathcal{A}$ is separable if $L^{1}_{\lambda}(\mathcal{X})$ is separable as a metric space. In this subsection we prove the following result.

Proposition 3.12.

Let $(\operatorname{X}_{i})_{i\in\mathbb{N}}$ be a sequence of cfr coset nilspaces. Then for every separable $\sigma$ -algebra $\mathcal{X}_{0}\subset\mathcal{L}_{\mathbf{X}}$ there is a separable $\sigma$ -algebra $\mathcal{X}\subset\mathcal{L}_{\mathbf{X}}$ such that $\mathcal{X}_{0}\subset\mathcal{X}$ and such that the Loeb measures $\mu^{\llbracket n\rrbracket}$ on the $\sigma$ -algebras $\mathcal{X}^{\llbracket n\rrbracket}$ form a cubic coupling.

The proof relies on the following couple of lemmas.

Lemma 3.13.

Let $(\Omega,\mathcal{A},\lambda)$ be a probability space and let $S$ be a finite set. For each $v\in S$ let $\mathcal{X}_{v}$ be a sub- $\sigma$ -algebra of $\mathcal{A}$ , and let $\mathcal{C}\subset\bigvee_{v\in S}\mathcal{X}_{v}$ be a separable $\sigma$ -algebra. Then there are separable $\sigma$ -algebras $\mathcal{X}_{v}^{\prime}\subset\mathcal{X}_{v}$ for $v\in S$ such that $\mathcal{C}\subset_{\lambda}\bigvee_{v\in S}\mathcal{X}_{v}^{\prime}$ .

Proof.

The separability of $\mathcal{C}$ implies that there is a dense sequence of functions $(f_{\ell})_{\ell\in\mathbb{N}}$ in $L^{1}(\mathcal{C})$ . By [7, Lemma 2.2], for each $\ell$ there is a sequence of functions $(f_{k,\ell})_{k\in\mathbb{N}}$ , where for each $k$ we have $\|f_{k,\ell}-f_{\ell}\|_{L^{1}}\leq 1/k$ and $f_{k,\ell}$ is a finite sum of bounded rank 1 functions, i.e. $f_{k,\ell}=\sum_{j=1}^{m_{k,\ell}}\prod_{v\in S}g_{v,j,k,\ell}$ where $g_{v,j,k,\ell}\in L^{\infty}(\mathcal{X}_{v})$ for every $j$ . Let $\mathcal{X}_{v}^{\prime}$ be the separable sub- $\sigma$ -algebra of $\mathcal{X}_{i}$ generated by the collection $\{g_{v,j,k,\ell}:\ell,k\in\mathbb{N},j\in[m_{k,\ell}]\}$ . This collection is countable, so $\mathcal{X}_{v}^{\prime}$ is separable. Now given any $f\in L^{1}(\mathcal{C})$ , for any $\epsilon>0$ there is $\ell$ such that $\|f-f_{\ell}\|_{L^{1}}<\epsilon/2$ , and there is $k$ such that $\|f_{\ell}-f_{\ell,k}\|_{L^{1}}<\epsilon/2$ , so $\|f-f_{k,\ell}\|_{L^{1}}<\epsilon$ , and by construction $f_{k,\ell}\in L^{1}(\bigvee_{v\in S}\mathcal{X}_{v}^{\prime})$ . Letting $\epsilon\to 0$ , we deduce that $\mathcal{C}\subset_{\lambda}\bigvee_{v\in S}\mathcal{X}_{v}^{\prime}$ . ∎

Let us single out the adjacent faces $F_{n,0}:=\{0\}\times\llbracket n-1\rrbracket$ , $F_{n,1}:=\llbracket n-1\rrbracket\times\{0\}$ in $\llbracket n\rrbracket$ . For $p\in[1,\infty]$ we denote by $\mathcal{U}^{p}(\mathcal{A})$ the unit ball of $L^{p}(\mathcal{A})$ .

Lemma 3.14.

Let $\mathcal{C}$ be a separable sub- $\sigma$ -algebra of $\mathcal{L}_{\mathbf{X}}$ . There is a separable $\sigma$ -algebra $\mathcal{D}$ with $\mathcal{C}\subset\mathcal{D}\subset\mathcal{L}_{\mathbf{X}}$ , such that for every $n\in\mathbb{N}$ , for every system $(f_{v})_{v\in F_{n,0}}$ of bounded $\mathcal{C}$ -measurable functions $f_{v}$ , we have $\mathbb{E}\big{(}\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|(\mathcal{L}_{\mathbf{X}})^{\llbracket n\rrbracket}_{F_{n,1}}\big{)}\in L^{\infty}(\mathcal{D}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}})$ .

Proof.

By assumption the metric space $L^{1}(\mathcal{C})$ is separable, and therefore so is the subset $\mathcal{U}^{\infty}(\mathcal{C})\subset L^{1}(\mathcal{C})$ , so there is a sequence $\mathcal{S}\subset\mathcal{U}^{\infty}(\mathcal{C})$ that is dense in $\mathcal{U}^{\infty}(\mathcal{C})$ relatively to the $L^{1}$ -norm. Recall that $\mathcal{A}$ denotes $\mathcal{L}_{\mathbf{X}}^{\llbracket n\rrbracket}$ . Let $\langle\mathcal{C}\rangle_{n}$ denote the sub- $\sigma$ -algebra of $\mathcal{A}_{F_{n,1}}$ generated by all expectations $\mathbb{E}(\prod_{v\in F_{n,0}}g_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})$ for systems $(g_{v})_{v\in F_{n,0}}$ of functions in $\mathcal{S}$ . Since $\langle\mathcal{C}\rangle_{n}$ is generated by countably many functions, it is separable. By the conditional independence axiom (Proposition 3.1) we have $\mathbb{E}(\prod_{v\in F_{n,0}}g_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})\in L^{\infty}(\mathcal{A}_{F_{n,0}\cap F_{n,1}})$ . Hence $\langle\mathcal{C}\rangle_{n}\subset_{\lambda}\mathcal{A}_{F_{n,0}\cap F_{n,1}}$ . By Lemma 3.13, there is a separable $\sigma$ -algebra $\mathcal{D}_{n}\subset\mathcal{L}_{\mathbf{X}}$ such that $\langle\mathcal{C}\rangle_{n}\subset_{\lambda}(\mathcal{D}_{n})_{F_{n,0}\cap F_{n,1}}^{\llbracket n\rrbracket}$ . Let $\mathcal{D}=\mathcal{C}\vee\big{(}\bigvee_{n\in\mathbb{N}}\mathcal{D}_{n}\big{)}$ . Fix any system $\big{(}f_{v}\in\mathcal{U}^{\infty}(\mathcal{C})\big{)}_{v\in F_{n,0}}$ . For every $\epsilon>0$ , for each $v$ there is $g_{v}\in\mathcal{S}$ such that $\|f_{v}-g_{v}\|_{L^{1}}\leq\epsilon$ . Using telescoping sums we have $\|\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})-\mathbb{E}(\prod_{v\in F_{n,0}}g_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})\|_{L^{1}}\leq 2^{n}\,\epsilon$ . Letting $\epsilon\to 0$ yields $\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})\in L^{1}\big{(}(\mathcal{D}_{n})_{F_{n,0}\cap F_{n,1}}^{\llbracket n\rrbracket}\big{)}\subset L^{1}(\mathcal{D}_{F_{n,0}\cap F_{n,1}}^{\llbracket n\rrbracket})$ . The result follows. ∎

Proof of Proposition 3.12.

The consistency and ergodicity axioms hold with $\mathcal{L}_{\mathbf{X}}$ (by Lemma 3.2), so they clearly hold also for any sub- $\sigma$ -algebra of $\mathcal{L}_{\mathbf{X}}$ . In particular, for each $n$ we have to check the conditional independence axiom (for the suitable separable $\sigma$ -algebra $\mathcal{X}\subset\mathcal{L}_{\mathbf{X}}$ ) only for $F_{n,0},F_{n,1}$ , rather than for all pairs of adjacent $(n-1)$ -faces in $\llbracket n\rrbracket$ (indeed, the consistency axiom implies conditional independence for every such pair of faces, once we have it just for $F_{n,0},F_{n,1}$ ). So let us prove that there is a separable $\sigma$ -algebra $\mathcal{X}\subset\mathcal{L}_{\mathbf{X}}$ such that for each $n$ , for every system $(f_{v})_{v\in F_{n,0}}$ in $L^{\infty}(\mathcal{X})$ , we have $\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})\in L^{\infty}(\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}})$ (this is enough, since by [7, Lemma 2.2] every integrable $\mathcal{X}_{F_{n,0}}^{\llbracket n\rrbracket}$ -measurable function is a limit of finite sums of rank 1 functions $\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}$ ). If we prove this, then we also have $\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,1}})\in L^{\infty}(\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}})$ , since $\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}}\subset\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,1}}\subset\mathcal{A}_{F_{n,1}}$ . To obtain $\mathcal{X}$ , we argue as follows: let $\mathcal{X}_{0}$ be the initial separable $\sigma$ -algebra in the proposition, and let $(\mathcal{X}_{i})_{i\in\mathbb{N}}$ be the increasing sequence of separable sub- $\sigma$ -algebras of $\mathcal{L}_{\mathbf{X}}$ defined inductively by letting $\mathcal{X}_{i}$ be the $\sigma$ -algebra $\mathcal{D}$ obtained by applying Lemma 3.14 with $\mathcal{C}=\mathcal{X}_{i-1}$ . Let $\mathcal{X}=\bigvee_{i\geq 0}\mathcal{X}_{i}$ . To see that this has the required property, fix any $n$ and let $(f_{v})_{v\in F_{n,0}}$ be any system of functions in $L^{\infty}(\mathcal{X})$ . We have to check that $\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})\in L^{\infty}(\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}})$ . It clearly suffices to do this assuming that $f_{v}\in\mathcal{U}^{\infty}(\mathcal{X})$ . Fix any $\epsilon>0$ . For each $v$ there is $f_{v}^{\prime}\in\mathcal{U}^{\infty}(\mathcal{X}_{i})$ for some $i=i(v)$ such that $\|f_{v}-f_{v}^{\prime}\|_{L^{1}}<\epsilon$ (indeed we can take $f_{v}^{\prime}$ to be a version of $\mathbb{E}(f_{v}|\mathcal{X}_{i})$ ). Letting $j=\max_{v\in F_{n,0}}i(v)$ , we have $f^{\prime}_{v}\in\mathcal{U}^{\infty}(\mathcal{X}_{j})$ for all $v$ . It then follows by construction and Lemma 3.14 that $\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}^{\prime}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})\in L^{\infty}\big{(}(\mathcal{X}_{j+1})^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}}\big{)}\subset L^{\infty}\big{(}\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}}\big{)}$ . As in the previous proof, this expectation converges to $\mathbb{E}(\prod_{v\in F_{n,0}}f_{v}\operatorname{\circ}p_{v}|\mathcal{A}_{F_{n,1}})$ as $\epsilon\to 0$ , so the latter expectation is also $\mathcal{X}^{\llbracket n\rrbracket}_{F_{n,0}\cap F_{n,1}}$ -measurable modulo null sets, as required. ∎

4. Stability of morphisms into compact finite-rank nilspaces

By a compatible metric on a topological space $X$ we mean a metric $d$ on $X$ which generates the given topology on $X$ . Given such a metric $d$ on $X$ , for any $x,y\in X$ and $\epsilon>0$ we write $x\approx_{\epsilon}y$ to mean that $d(x,y)\leq\epsilon$ . Recall that if $G$ is a compact group acting continuously on a metric space $X$ with metric $d$ , then we can always define a compatible metric $d^{\prime}$ on $X$ which is also $G$ -invariant, meaning that for all $x,y\in X$ and $g\in G$ we have $d^{\prime}(gx,gy)=d^{\prime}(x,y)$ (see [34, Proposition 1.1.12]).

Given compact nilspaces $\operatorname{X},\operatorname{Y}$ , with a compatible metric $d$ on $\operatorname{Y}$ , we define a pseudometric $d_{1}$ on the space of Borel measurable functions $\phi:\operatorname{X}\to\operatorname{Y}$ by the formula $d_{1}(\phi_{1},\phi_{2})=\int_{\operatorname{X}}d(\phi_{1}(x),\phi_{2}(x)\big{)}\,\mathrm{d}\mu_{\operatorname{X}}(x)$ .

Definition 4.1.

Let $\operatorname{X},\operatorname{Y}$ be $k$ -step compact nilspaces, and let $d$ be a compatible metric on $\operatorname{Y}$ . For $\delta>0$ , a $(\delta,1)$ -quasimorphism from $\operatorname{X}$ to $\operatorname{Y}$ (relative to $d$ ) is a Borel measurable map $\phi:\operatorname{X}\to\operatorname{Y}$ satisfying

[TABLE]

where $\mu_{\operatorname{X}}^{\llbracket k+1\rrbracket}$ denotes the Haar probability measure on $\operatorname{C}^{k+1}(\operatorname{X})$ .

We write “ $(\delta,1)$ -quasimorphism”, rather than just “ $\delta$ -quasimorphism”, to distinguish this notion from the quasimorphisms defined in [4, Definition 2.8.1], which we call here $(\delta,\infty)$ -quasimorphisms; these are defined by replacing property (2) with the uniform (and stronger) property $\forall\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X}),\,\exists\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}(\operatorname{Y}),\,\forall\,v\in\llbracket k+1\rrbracket,\,\phi\operatorname{\circ}\operatorname{c}(v)\approx_{\delta}\operatorname{c}^{\prime}(v)$ .

In our proof of Theorem 1.5 in Section 5, a key ingredient is the following stability (or rigidity) result for morphisms.

Theorem 4.2.

Let $\operatorname{Y}$ be a $k$ -step cfr nilspace with compatible metric $d$ . For every $\epsilon>0$ there exists $\delta=\delta(\epsilon,\operatorname{Y})>0$ such that if $\operatorname{X}$ is a compact nilspace and $\phi:\operatorname{X}\to\operatorname{Y}$ is a $(\delta,1)$ -quasimorphism, then there exists a continuous morphism $\phi^{\prime}:\operatorname{X}\to\operatorname{Y}$ such that $d_{1}(\phi,\phi^{\prime})\leq\epsilon$ .

This theorem is an analogue, for $(\delta,1)$ -quasimorphisms, of the uniform stability result for $(\delta,\infty)$ -quasimorphisms given in [2, Theorem 5] (see also [4, Theorem 2.8.2]). Indeed, we obtain the statement of this uniform stability result by replacing in Theorem 4.2 every “1” by “ $\infty$ ” (where $d_{\infty}(\phi_{1},\phi_{2})=\sup_{x\in\operatorname{X}}d(\phi_{1}(x),\phi_{2}(x)$ ).

4.1. Cocycles close to the 0 cocycle are coboundaries

Recall that the group $\operatorname{Aut}(\llbracket k\rrbracket)$ of automorphisms of the cube $\llbracket k\rrbracket$ is generated by permutations of $[k]=\{1,2,\ldots,k\}$ and coordinate reflections. For $\theta\in\operatorname{Aut}(\llbracket k\rrbracket)$ we write $r(\theta)$ for the number of reflections involved in $\theta$ . Equivalently, $r(\theta)$ is the number of coordinates equal to 1 of $\theta(0^{k})$ . Two $n$ -cubes $\operatorname{c}_{1},\operatorname{c}_{2}$ on a nilspace are adjacent if $\operatorname{c}_{1}(v,1)=\operatorname{c}_{2}(v,0)$ for all $v\in\llbracket n-1\rrbracket$ ; we can then form their concatenation, which is the $n$ -cube $\operatorname{c}$ such that $\operatorname{c}(v,0)=\operatorname{c}_{1}(v,0)$ and $\operatorname{c}(v,1)=\operatorname{c}_{2}(v,1)$ for all $v\in\llbracket n-1\rrbracket$ (see [3, Lemma 3.1.7]).

We now recall the definition of a nilspace cocycle, which is fundamental to the structural analysis of nilspaces (see [2, Definition 2.14] or [3, Definition 3.3.14]).

Definition 4.3.

*Let $\operatorname{X}$ be a nilspace, $\operatorname{Z}$ an abelian group, and $k\in\mathbb{Z}_{\geq-1}$ . A $\operatorname{Z}$ -valued cocycle of degree $k$ on $\operatorname{X}$ is a function $\rho:\operatorname{C}^{k+1}(\operatorname{X})\to\operatorname{Z}$ with the following properties:

(i)

If $\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X})$ and $\theta\in\operatorname{Aut}(\llbracket k+1\rrbracket)$ , then $\rho(\operatorname{c}\operatorname{\circ}\theta)=(-1)^{r(\theta)}\rho(\operatorname{c})$ . 2. (ii)

If $\operatorname{c}_{3}$ is the concatenation of cubes $\operatorname{c}_{1},\operatorname{c}_{2}\in\operatorname{C}^{k+1}(\operatorname{X})$ then $\rho(\operatorname{c}_{3})=\rho(\operatorname{c}_{1})+\rho(\operatorname{c}_{2})$ .

We recall also that for any $n\in\mathbb{N}$ and any group $G$ we denote by $\sigma_{n}$ the Gray-code map $G^{\llbracket n\rrbracket}\to G$ from [3, Definition 2.2.22]; in particular if $G$ is abelian we have $\sigma_{n}(g):=\sum_{v\in\llbracket n\rrbracket}(-1)^{|v|}g(v)$ for every $g:\llbracket n\rrbracket\to G$ . Using this notation, we say that a cocycle $\rho$ of degree $k$ on $\operatorname{X}$ is a coboundary (of degree $k$ ) if there is a function $f:\operatorname{X}\to\operatorname{Z}$ such that $\rho(\operatorname{c})=\sigma_{k+1}(f\operatorname{\circ}\operatorname{c})$ for every $\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X})$ . We refer to [3, §3.3.3] for more background on cocycles and coboundaries.

The proof of Theorem 4.2, given in Subsection 4.2, relies on the following stability result for cocycles, which is the main result in this subsection.

Proposition 4.4.

Let $\operatorname{Z}$ be a compact abelian group, and let $d_{Z}$ be a compatible $\operatorname{Z}$ -invariant metric on $\operatorname{Z}$ . There exists $\epsilon>0$ such that the following holds. If $\operatorname{X}$ is a compact nilspace and $\rho:\operatorname{C}^{k}(\operatorname{X})\to\operatorname{Z}$ is a Borel cocycle such that $d_{1}(0,\rho):=\int_{\operatorname{C}^{k}(\operatorname{X})}d_{\operatorname{Z}}\big{(}\rho(\operatorname{c}),0_{\operatorname{Z}}\big{)}\,\mathrm{d}\mu_{\operatorname{C}^{k}(\operatorname{X})}(\operatorname{c})\leq\epsilon$ , then $\rho$ is a coboundary.

A key element in the proof of Proposition 4.4 is the following result.

Lemma 4.5.

Let $\operatorname{X}$ be a compact nilspace, let $\operatorname{Z}$ be a compact abelian group with compatible $\operatorname{Z}$ -invariant metric $d_{\operatorname{Z}}$ , let $\rho:\operatorname{C}^{k}(\operatorname{X})\to\operatorname{Z}$ be a Borel measurable cocycle, let $0<\epsilon<2^{-4k}$ , and suppose that $d_{1}(\rho,0)\leq\epsilon$ . Then there is a Borel set $S\subset\operatorname{X}$ such that $\mu_{\operatorname{X}}(S)>1-\epsilon^{1/2}$ and $d_{\operatorname{Z}}(\rho(\operatorname{c}),0)\leq 2^{k}\epsilon^{1/4}$ for every $\operatorname{c}\in\operatorname{C}^{k}(\operatorname{X})\cap S^{\llbracket k\rrbracket}$ .

The proof employs tricubes, which are very useful tools in nilspace theory ([3, §3.1.3]), especially because they enable an operation akin to convolution (called tricube composition) to be performed with cubes (see [3, Lemma 3.1.16]). A crucial property of cocyles, which is used repeatedly in this section, is that they commute with this operation in the sense captured in [2, Lemma 2.18] (see also [3, Lemma 3.3.31]).

Proof of Lemma 4.5.

Let

[TABLE]

where $\operatorname{C}^{k}_{x}(\operatorname{X}):=\{\operatorname{c}\in\operatorname{C}^{k}(\operatorname{X}):\operatorname{c}(0^{k})=x\}$ , and $\mu_{\operatorname{C}^{k}_{x}(\operatorname{X})}$ denotes the Haar probability measure on $\operatorname{C}^{k}_{x}(\operatorname{X})$ (see [4, Lemma 2.2.17]). By Markov’s inequality, we have

[TABLE]

Hence $\mu_{\operatorname{X}}(S)>1-\epsilon^{1/2}$ .

Now if $\operatorname{c}\in\operatorname{C}^{k}(\operatorname{X})\cap S^{\llbracket k\rrbracket}$ , then for each $v\in\llbracket k\rrbracket$ , by definition of $S$ there is a measure at least $1-\epsilon^{1/4}$ of cubes $\operatorname{c}^{\prime}\in\operatorname{C}^{k}_{\operatorname{c}(v)}(\operatorname{X})$ such that $d_{\operatorname{Z}}(\rho(\operatorname{c}^{\prime}),0)\leq\epsilon^{1/4}$ . Recall that the restricted tricube space $\mathcal{T}(\operatorname{c}):=\hom_{\operatorname{c}\operatorname{\circ}\omega_{k}^{-1}}(T_{k},\operatorname{X})$ , being an iterated compact abelian bundle, has a Haar measure (see [4, Lemma 2.2.12], and see [3, Definition 3.1.15] for the notion of the outer-point map $\omega_{k}$ ). Let us denote this Haar measure by $\mu_{\mathcal{T}(\operatorname{c})}$ . For each $v\in\llbracket k\rrbracket$ the map $\mathcal{T}(\operatorname{c})\to\operatorname{C}^{k}_{\operatorname{c}(v)}(\operatorname{X})$ , $t\mapsto t\operatorname{\circ}\Psi_{v}$ takes this measure $\mu_{\mathcal{T}(\operatorname{c})}$ to the Haar measure on $\operatorname{C}^{k}_{\operatorname{c}(v)}(\operatorname{X})$ (see [4, Corollary 2.2.22], and see [3, Definition 3.1.13] for the maps $\Psi_{v}$ ). It follows from this and the union bound that

[TABLE]

Our assumption for $\epsilon$ implies that this measure is positive, so there exists $t\in\mathcal{T}(\operatorname{c})$ with this property, namely such that $d_{\operatorname{Z}}\big{(}\rho(t\operatorname{\circ}\Psi_{v}),0\big{)}\leq\epsilon^{1/4}$ for every $v\in\llbracket k\rrbracket$ . For this tricube $t$ , we apply the formula $\rho(\operatorname{c})=\sum_{v\in\llbracket k\rrbracket}(-1)^{|v|}\rho(t\operatorname{\circ}\Psi_{v})$ , which holds for every tricube in $\mathcal{T}(\operatorname{c})$ by [3, Lemma 3.3.31]. By the triangle inequality and $\operatorname{Z}$ -invariance of $d_{\operatorname{Z}}$ , we obtain $d_{\operatorname{Z}}(\rho(\operatorname{c}),0)\leq\sum_{v\in\llbracket k\rrbracket}d_{\operatorname{Z}}(\rho(t\operatorname{\circ}\Psi_{v}),0)\leq 2^{k}\epsilon^{1/4}$ , as claimed. ∎

Using the set $S$ provided by Lemma 4.5, we can define a function $g:\operatorname{X}\to\operatorname{Z}$ such that, subtracting the coboundary $\operatorname{c}\mapsto\sigma_{k}(g\operatorname{\circ}\operatorname{c})$ from $\rho$ , we obtain a new cocycle $\rho^{\prime}$ whose values are uniformly close to [math] (not just close in $d_{1}$ ), as follows.

Lemma 4.6.

Let $\operatorname{X}$ be a compact nilspace, let $\operatorname{Z}$ be a compact abelian group with compatible $\operatorname{Z}$ -invariant metric $d_{\operatorname{Z}}$ , let $C$ denote the diameter of $\operatorname{Z}$ relative to $d_{\operatorname{Z}}$ , let $\rho:\operatorname{C}^{k}(\operatorname{X})\to\operatorname{Z}$ be a Borel cocycle, let $\epsilon\in(0,2^{-4k})$ , and suppose that $d_{1}(\rho,0)\leq\epsilon$ . Then there is a Borel function $g:\operatorname{X}\to\operatorname{Z}$ with $d_{1}(g,0)\leq(2+C)4^{k}\epsilon^{1/4}$ such that $\rho^{\prime}:\operatorname{c}\mapsto\rho(\operatorname{c})-\sigma_{k}(g\operatorname{\circ}\operatorname{c})$ satisfies $d_{\operatorname{Z}}(\rho^{\prime}(\operatorname{c}),0)\leq 8^{k}\epsilon^{1/4}$ , $\forall\operatorname{c}\in\operatorname{C}^{k}(\operatorname{X})$ .

Proof.

Let $S$ be the subset of $\operatorname{X}$ given by Lemma 4.5.

We claim that for every $x\in\operatorname{X}$ there exists an element $g(x)\in\operatorname{Z}$ such that

[TABLE]

To see this, fix any $x\in\operatorname{X}$ , and note that for each $v\neq 0^{k}$ , the map $\operatorname{C}^{k}_{x}(\operatorname{X})\to\operatorname{X}$ , $\operatorname{c}\mapsto\operatorname{c}(v)$ preserves the Haar measures (by [4, Lemma 2.2.14] with $n=k$ , $P=\llbracket k\rrbracket$ , $P_{1}=\{0^{k}\}$ , $P_{2}=\{v\}$ ). Since $\mu(S)>1-\epsilon^{1/2}$ , by the union bound we therefore have $\mu_{\operatorname{C}^{k}_{x}(\operatorname{X})}\big{(}\big{\{}\operatorname{c}\in\operatorname{C}^{k}_{x}(\operatorname{X}):\forall\,v\neq 0^{k},\,\operatorname{c}(v)\in S\big{\}}\big{)}>1-(2^{k}-1)\epsilon^{1/2}$ . Fix any cube $\operatorname{c}_{0}\in\operatorname{C}^{k}_{x}(\operatorname{X})$ with $\operatorname{c}_{0}(v)\in S$ for every $v\neq 0^{k}$ . Combining the last inequality with the fact (used in the previous proof) that the map $\mathcal{T}(\operatorname{c}_{0})\to\operatorname{C}^{k}_{\operatorname{c}_{0}(v)}(\operatorname{X})$ , $t\mapsto t\operatorname{\circ}\Psi_{v}$ preserves the Haar measures, we deduce by the union bound that

[TABLE]

Let $g(x):=\rho(\operatorname{c}_{0})$ , and note that $\operatorname{c}_{0}$ can be chosen to make the function $g:\operatorname{X}\to\operatorname{Z}$ Borel, by [29, Theorem (12.16), (12.18)] and the continuity of the map $\operatorname{c}\mapsto\operatorname{c}(0^{k})$ .

For every tricube $t$ in the above set, we have $\rho(\operatorname{c}_{0})=\sum_{v\in\llbracket k\rrbracket}(-1)^{|v|}\rho(t\operatorname{\circ}\Psi_{v})$ and, for every $v\neq 0^{k}$ , since $t\operatorname{\circ}\Psi_{v}\in S^{\llbracket k\rrbracket}$ , we have $d_{\operatorname{Z}}(\rho(t\operatorname{\circ}\Psi_{v}),0)\leq 2^{k}\epsilon^{1/4}$ by Lemma 4.5. We deduce that $d_{\operatorname{Z}}\big{(}g(x),\rho(t\operatorname{\circ}\Psi_{0^{k}})\big{)}\leq 4^{k}\epsilon^{1/4}$ . Hence

[TABLE]

Since the map $\mathcal{T}(\operatorname{c}_{0})\to\operatorname{C}^{k}_{x}(\operatorname{X})$ , $t\mapsto t\operatorname{\circ}\Psi_{0^{k}}$ preserves the Haar measures, we have that (4) is equivalent to (3), which proves our claim.

Define the coboundary $f:\operatorname{C}^{k}(\operatorname{X})\to\operatorname{Z}$ by $f(\operatorname{c})=\sigma_{k}(g\operatorname{\circ}\operatorname{c})$ . Fix any cube $\operatorname{c}\in\operatorname{C}^{k}(\operatorname{X})$ . By the measure-preserving properties used earlier, the union bound, and (3), we have

[TABLE]

By our assumption on $\epsilon$ we have $8^{k}\epsilon^{1/2}<1$ , so there exists $t\in\mathcal{T}(\operatorname{c})$ with the above property. Applying the formula $\rho(\operatorname{c})=\sum_{v\in\llbracket k\rrbracket}(-1)^{|v|}\rho(t\operatorname{\circ}\Psi_{v})$ for this $t$ , and the triangle inequality (and shift invariance of $d_{\operatorname{Z}}$ ), we deduce that $d_{\operatorname{Z}}\big{(}\rho(\operatorname{c}),f(\operatorname{c})\big{)}\leq 8^{k}\epsilon^{1/4}$ , as required. Finally, we have

[TABLE]

The latter integral is $d_{1}(\rho,0)$ , and by (3) the former integral is at most $(1+C)4^{k}\epsilon^{1/4}$ . Hence $d_{1}(g,0)\leq d_{1}(\rho,0)+(1+C)4^{k}\epsilon^{1/4}\leq(2+C)4^{k}\epsilon^{1/4}$ , as required. ∎

We can now complete the proof of the stability result for cocycles.

Proof of Proposition 4.4.

We know by [4, Lemma 2.5.7] that there exists $\epsilon_{0}>0$ depending only on $\operatorname{Z}$ and $k$ such that if a cocycle $\rho^{\prime}:\operatorname{C}^{k}(\operatorname{X})\to\operatorname{Z}$ takes all its values within distance $\epsilon_{0}$ of $0_{\operatorname{Z}}$ , then $\rho^{\prime}$ is a coboundary. Applying Lemma 4.6 with $\epsilon$ sufficiently small in terms of $\epsilon_{0}$ and $k$ , we conclude that $\rho-f$ is a coboundary, where $f(\operatorname{c})=\sigma_{k}(g\operatorname{\circ}\operatorname{c})$ . Since $f$ is also a coboundary, it follows that $\rho$ is a coboundary. ∎

4.2. Proof of the stability result for morphisms

Given a $k$ -step nilspace $\operatorname{X}$ , for $j\in[k]$ we denote by $\operatorname{X}_{j}$ the $j$ -th factor of $\operatorname{X}$ (also denoted by $\mathcal{F}_{j}(\operatorname{X})$ , with $\mathcal{F}_{k}(\operatorname{X})=\operatorname{X}$ ), and by $\pi_{j}$ the factor map $\operatorname{X}\to\operatorname{X}_{j}$ (see [3, Lemma 3.2.10]). If $\operatorname{X}$ is compact, with a compatible $\operatorname{Z}_{k}$ -invariant metric $d$ , we can always metrize $\operatorname{X}_{k-1}$ with the quotient metric corresponding to $d$ the standard way (see [4, (2.2)]).

We shall use the following rectification result for cubes (see [4, Lemma 2.8.3]).

Lemma 4.7.

Let $\operatorname{X}$ be a $k$ -step compact nilspace with compatible $\operatorname{Z}_{k}$ -invariant metric $d$ , and let $d^{\prime}$ be the quotient metric on $\operatorname{X}_{k-1}$ . For every $\epsilon>0$ there exists $\delta>0$ such that the following holds. If $\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X})$ satisfies $d^{\prime}\big{(}\pi_{k-1}\operatorname{\circ}\operatorname{c}(\cdot,0),\pi_{k-1}\operatorname{\circ}\operatorname{c}(\cdot,1)\big{)}\leq\delta$ on $\llbracket k\rrbracket$ , then there is $\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}(\operatorname{X})$ with $\operatorname{c}\approx_{\epsilon}\operatorname{c}^{\prime}$ and $\pi_{k-1}\operatorname{\circ}\operatorname{c}^{\prime}(\cdot,0)=\pi_{k-1}\operatorname{\circ}\operatorname{c}^{\prime}(\cdot,1)$ on $\llbracket k\rrbracket$ .

Recall from [3, Definition 2.2.30] the notation $\mathcal{D}_{k}(\operatorname{Z})$ for the degree- $k$ nilspace structure on an abelian group $\operatorname{Z}$ . In our proof of Theorem 4.2, we argue by induction on $k$ . Each step of the induction uses the following special case of the theorem.

Lemma 4.8.

Let $\operatorname{Z}$ be a compact abelian Lie group equipped with a compatible $\operatorname{Z}$ -invariant metric $d_{\operatorname{Z}}$ , and let $k\in\mathbb{Z}_{\geq 0}$ . For every $\epsilon>0$ there exists $\delta=\delta(\epsilon,k,\operatorname{Z})>0$ such that if $\phi$ is a $(\delta,1)$ -quasimorphism from a compact $k$ -step nilspace $\operatorname{X}$ to $\mathcal{D}_{k}(\operatorname{Z})$ , then there is a morphism $\phi^{\prime}:\operatorname{X}\to\mathcal{D}_{k}(\operatorname{Z})$ such that $d_{1}(\phi,\phi^{\prime})\leq\epsilon$ .

Proof.

Let $C$ be the diameter of $\operatorname{Z}$ relative to $d_{\operatorname{Z}}$ . Let $\delta^{\prime}\in\big{(}0,\epsilon/(2+C)\big{)}$ be sufficiently small for the conclusion of [4, Theorem 2.8.2] to hold with initial parameter $\epsilon/2$ , for every $(\delta^{\prime},\infty)$ -quasimorphism $\operatorname{X}\to\mathcal{D}_{k}(\operatorname{Z})$ . Let $0<\delta<\delta^{\prime 4}/\big{(}8^{4(k+1)}(2^{k+1}+C)\big{)}$ .

Let $\rho$ be the coboundary $\operatorname{c}\mapsto\sigma_{k+1}(\phi\operatorname{\circ}\operatorname{c})$ . From our assumption, inequality (2), and the definition of the cube structure on $\mathcal{D}_{k}(\operatorname{Z})$ (see [3, formula (2.9)]) it follows that $d_{1}(\rho,0)\leq(2^{k+1}+C)\delta$ . By Lemma 4.6 applied with $\epsilon_{0}=(2^{k+1}+C)\delta$ , there exists a Borel function $g:\operatorname{X}\to\operatorname{Z}$ such that $d_{\operatorname{Z}}\big{(}\rho(\operatorname{c})-\sigma_{k+1}(g\operatorname{\circ}\operatorname{c}),0\big{)}\leq 8^{k+1}\epsilon_{0}^{1/4}<\delta^{\prime}$ for every cube $\operatorname{c}\in\operatorname{C}^{k+1}(\operatorname{X})$ . Equivalently, the map $\phi_{1}:\operatorname{X}\to\operatorname{Z}$ , $x\mapsto\phi(x)-g(x)$ satisfies $d_{\operatorname{Z}}\big{(}\sigma_{k+1}(\phi_{1}\operatorname{\circ}\operatorname{c}),0\big{)}\leq\delta^{\prime}$ . Let $\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}\big{(}\mathcal{D}_{k}(\operatorname{Z})\big{)}$ be the cube such that $\operatorname{c}^{\prime}(v)=\phi_{1}\operatorname{\circ}\operatorname{c}(v)$ for $v\neq 0^{k+1}$ and $\operatorname{c}^{\prime}(0^{k+1})=\phi_{1}\operatorname{\circ}\operatorname{c}(0^{k+1})-\sigma_{k+1}(\phi_{1}\operatorname{\circ}\operatorname{c})$ (note that $\operatorname{c}^{\prime}$ is indeed in $\operatorname{C}^{k+1}\big{(}\mathcal{D}_{k}(\operatorname{Z})\big{)}$ since $\sigma_{k+1}(\operatorname{c}^{\prime})=0$ ). We clearly have $d_{\operatorname{Z}}\big{(}\operatorname{c}^{\prime}(v),\phi_{1}\operatorname{\circ}\operatorname{c}(v)\big{)}\leq\delta^{\prime}$ for every $v\in\llbracket k+1\rrbracket$ . We have thus shown that $\phi_{1}$ is a $(\delta^{\prime},\infty)$ -quasimorphism.

We can thus apply [4, Theorem 2.8.2] to conclude that there is a continuous morphism $\phi^{\prime}:\operatorname{X}\to\mathcal{D}_{k}(\operatorname{Z})$ such that $d_{\operatorname{Z}}\big{(}\phi_{1}(x),\phi^{\prime}(x)\big{)}\leq\epsilon/2$ for all $x\in\operatorname{X}$ . Hence $d_{1}(\phi,\phi^{\prime})\leq d_{1}(\phi,\phi_{1})+d_{1}(\phi_{1},\phi^{\prime})\leq d_{1}(g,0)+\epsilon/2$ . By Lemma 4.6 we have $d_{1}(g,0)\leq(2+C)4^{k+1}\epsilon_{0}^{1/4}=\frac{(2+C)\delta^{\prime}}{2^{k+1}}\leq\epsilon/2$ . ∎

We need one more lemma before the proof of Theorem 4.2. This lemma enables us to lift certain Borel maps, and is useful for the inductive step in the proof of the theorem.

Lemma 4.9.

Let $\operatorname{Y}$ be a $k$ -step cfr nilspace, with $k$ -th structure group $\operatorname{Z}_{k}$ , let $d$ be a $\operatorname{Z}_{k}$ -invariant compatible metric on $\operatorname{Y}$ , with corresponding quotient metric $d^{\prime}$ on $\operatorname{Y}_{k-1}$ . For every $\epsilon>0$ there exists $\delta>0$ such that the following holds. Let $\operatorname{X}$ be a $k$ -step compact nilspace, let $\phi:\operatorname{X}\to\operatorname{Y}$ be a Borel map, let $\phi_{1}=\pi_{k-1,\operatorname{Y}}\operatorname{\circ}\phi:\operatorname{X}\to\operatorname{Y}_{k-1}$ , and let $\phi_{2}:\operatorname{X}\to\operatorname{Y}_{k-1}$ be a continuous map such that for some Borel set $A\subset\operatorname{X}$ we have $d^{\prime}(\phi_{1}(x),\phi_{2}(x))<\delta$ for every $x\in A$ . Then there is a Borel map $\phi_{3}:\operatorname{X}\to\operatorname{Y}$ such that for every $x\in\operatorname{X}$ , $\pi_{k-1,\operatorname{Y}}\operatorname{\circ}\phi_{3}(x)=\phi_{2}(x)$ , and for every $x\in A$ , $d(\phi(x),\phi_{3}(x))<\epsilon$ .

Proof.

By Gleason’s slice theorem $\operatorname{Y}$ is a locally trivial $\operatorname{Z}_{k}$ -bundle over $\operatorname{Y}_{k-1}$ (see [4, Proposition 2.5.2]). Hence, for each $y\in\operatorname{Y}_{k-1}$ there is $\delta_{y}>0$ such that the $\operatorname{Z}_{k}$ -bundle $\operatorname{Y}$ trivializes over the closed ball $\overline{B_{\delta_{y}}(y)}\subset\operatorname{Y}_{k-1}$ . Thus we have a $\operatorname{Z}_{k}$ -bundle isomorphism $\theta_{y}:\pi_{k-1}^{-1}\big{(}\overline{B_{\delta_{y}}(y)}\big{)}\to\overline{B_{\delta_{y}}(y)}\times\operatorname{Z}_{k}$ , $w\mapsto(\pi_{k-1}(w),z)$ , i.e., $\theta_{y}$ is a $\operatorname{Z}_{k}$ -equivariant homeomorphism (where the action of $\operatorname{Z}_{k}$ on $B_{\delta_{y}}(y)\times\operatorname{Z}_{k}$ is defined by $z^{\prime}\cdot(\pi_{k-1}(w),z)=(\pi_{k-1}(w),z+z^{\prime})$ ). By uniform continuity of $\theta_{y}^{-1}$ on the compact set $\overline{B_{\delta_{y}}(y)}\times\operatorname{Z}_{k}$ , there is $\delta_{y}^{\prime}>0$ such that, letting $d^{\prime\prime}$ denote the metric $d^{\prime}+d_{\operatorname{Z}_{k}}$ on $B_{\delta_{y}}(y)\times\operatorname{Z}_{k}$ (with $d_{\operatorname{Z}_{k}}$ the metric on $\operatorname{Z}_{k}$ ), we have $d^{\prime\prime}\big{(}\theta_{y}(w),\theta_{y}(w^{\prime})\big{)}\leq\delta_{y}^{\prime}$ $\Rightarrow$ $d(w,w^{\prime})\leq\epsilon$ .

Since the balls $B_{\delta_{y}/2}(y)$ cover $\operatorname{Y}_{k-1}$ , by compactness there is a finite subcover by balls $B_{\delta_{i}/2}(y_{i})$ , $i\in[M]$ , where $\delta_{i}=\delta_{y_{i}}$ . Thus $\operatorname{Y}$ trivializes over each ball $\overline{B_{\delta_{i}}(y_{i})}$ . Let $\delta<\frac{1}{2}\min\{\delta_{i},\delta^{\prime}_{y_{i}}:i\in[M]\}$ . Then, for each $x\in\operatorname{X}$ , there is $i\in[M]$ such that $d^{\prime}(\phi_{2}(x),y_{i})<\delta_{i}/2$ , whence if $x\in A$ then $d^{\prime}\big{(}\phi_{1}(x),y_{i}\big{)}\leq d^{\prime}\big{(}\phi_{1}(x),\phi_{2}(x)\big{)}+d^{\prime}\big{(}\phi_{2}(x),y_{i}\big{)}<\delta+\delta_{i}/2<\delta_{i}$ . In particular, for every $x\in A$ there is $i\in[M]$ such that $\phi_{1}(x),\phi_{2}(x)\in B_{\delta_{i}}(y_{i})$ .

Now we claim that for each $i\in[M]$ there is a Borel function $f_{i}:\phi_{2}^{-1}\big{(}B_{\delta_{i}/2}(y_{i})\big{)}\to\operatorname{Y}$ such that $\pi_{k-1}\operatorname{\circ}f_{i}=\phi_{2}$ and $d\big{(}f_{i}(x),\phi(x)\big{)}\leq\epsilon$ for all $x\in A\cap\phi_{2}^{-1}\big{(}B_{\delta_{i}/2}(y_{i})\big{)}$ . To see this, let $\theta_{i}=\theta_{y_{i}}:\pi_{k-1}^{-1}\big{(}B_{\delta_{i}}(y_{i})\big{)}\to B_{\delta_{i}}(y_{i})\times\operatorname{Z}_{k}$ , $y\mapsto(\pi_{k-1}(y),z)$ be the trivializing bundle isomorphism. Fix any $x\in\operatorname{X}$ , and let $i$ be such that $\phi_{2}(x)\in B_{\delta_{i}/2}(y_{i})$ . If $x\in A$ then, since $\phi_{1}(x)\in B_{\delta_{i}}(y_{i})$ , there is $z_{x}\in\operatorname{Z}_{k}$ such that $\theta_{i}\operatorname{\circ}\phi(x)=(\phi_{1}(x),z_{x})$ . In this case let $f_{i}(x):=\theta_{i}^{-1}(\phi_{2}(x),z_{x})$ . If $x\in\phi_{2}^{-1}\big{(}B_{\delta_{i}/2}(y_{i})\big{)}\setminus A$ , then we just let $f_{i}(x)=\operatorname{s}\operatorname{\circ}\phi_{2}(x)$ , where $\operatorname{s}:\operatorname{Y}_{k-1}\to\operatorname{Y}$ is a fixed Borel cross section for $\operatorname{Y}$ (which always exists for such bundles, see [4, Lemma 2.4.5]). Thus clearly $\pi_{k-1}\operatorname{\circ}f_{i}=\phi_{2}$ . We can see that $f_{i}$ is Borel as follows. Let $p_{2}$ denote the projection to the $\operatorname{Z}_{k}$ component on $B_{\delta_{i}}(y_{i})\times\operatorname{Z}_{k}$ . Let $g$ denote the function which “corrects” the $\operatorname{Z}_{k}$ component of $\operatorname{s}\operatorname{\circ}\phi_{2}(x)$ , namely $g:x\mapsto\theta_{i}\operatorname{\circ}\operatorname{s}\operatorname{\circ}\phi_{2}(x)+\big{(}p_{2}\operatorname{\circ}\theta_{i}\operatorname{\circ}\phi(x)-p_{2}\operatorname{\circ}\theta_{i}\operatorname{\circ}\operatorname{s}\operatorname{\circ}\phi_{2}(x)\big{)}=(\phi_{2}(x),z_{x})$ . Then $g$ is Borel, and $f_{i}(x)=\theta_{i}^{-1}\operatorname{\circ}g(x)$ for $x\in A$ , so $f_{i}$ is also Borel. Let us now confirm that $d\big{(}f_{i}(x),\phi(x)\big{)}\leq\epsilon$ for all $x\in A\cap\phi_{2}^{-1}\big{(}B_{\delta_{i}/2}(y_{i})\big{)}$ . Since $\theta_{i}\operatorname{\circ}f_{i}(x)$ and $\theta_{i}\operatorname{\circ}\phi(x)$ have the same $\operatorname{Z}_{k}$ -component $z_{x}$ (by construction of $f_{i}$ ), we have $d^{\prime\prime}(\theta_{i}\operatorname{\circ}f_{i}(x),\theta_{i}\operatorname{\circ}\phi(x))=d^{\prime}\big{(}\phi_{2}(x),\phi_{1}(x)\big{)}\leq\delta$ . Hence, since $\delta<\delta_{i}^{\prime}$ , we have $d(f_{i}(x),\phi(x))\leq\epsilon$ by the choice of $\delta_{i}^{\prime}$ above. This proves our claim.

We can greedily form a Borel partition of the domain of $\phi_{2}$ out of the sets $\phi_{2}^{-1}(B_{\delta_{i}/2}(y_{i}))$ . Thus with each $x$ in this domain we associate a unique $i\in[M]$ such that $\phi_{2}(x)\in B_{\delta_{i}/2}(y_{i})$ . We set $\phi_{3}(x):=f_{i}(x)$ , which makes $\phi_{3}$ a Borel function. ∎

Proof of Theorem 4.2.

We argue by induction on $k$ . The case $k=0$ is trivial (a non-empty 0-step nilspace is a one-point nilspace). For $k>0$ , let $\phi:\operatorname{X}\to\operatorname{Y}$ be a $(\delta,1)$ -quasimorphism relative to the given compatible metric $d$ . Note that letting $\tilde{d}$ be the corresponding $\operatorname{Z}_{k}$ -invariant metric on $\operatorname{Y}$ (see [4, Lemma 2.1.11]), the identity map on $\operatorname{Y}$ is uniformly continuous $(\operatorname{Y},d)\to(\operatorname{Y},\tilde{d})$ , so $\phi$ is a $(\tilde{\delta},1)$ -quasimorphism relative to $\tilde{d}$ for some $\tilde{\delta}(\delta)>0$ with $\tilde{\delta}=o(1)_{\delta\to 0}$ , and therefore we may relabel $\tilde{d},\tilde{\delta}$ as $d,\delta$ and assume without loss of generality that $d$ was already $\operatorname{Z}_{k}$ -invariant. Now let $\phi_{1}^{\prime}=\pi_{k-1}\operatorname{\circ}\phi$ , and note that $\phi_{1}^{\prime}$ is also a $(\delta,1)$ -quasimorphism relative to the quotient metric $d^{\prime}$ on $\operatorname{Y}_{k-1}$ . By induction, for some positive $\delta_{1}=\delta_{1}(\delta)=o(1)_{\delta\to 0}$ , there exists a continuous morphism $\phi_{2}:\operatorname{X}\to\operatorname{Y}_{k-1}$ such that $d_{1}(\phi_{2},\phi_{1}^{\prime})\leq\delta_{1}$ . This implies by Markov’s inequality that for some Borel set $A\subset\operatorname{X}$ with $\mu_{\operatorname{X}}(A)\geq 1-\delta_{1}^{1/2}$ we have $d^{\prime}(\phi_{2}(x),\phi_{1}^{\prime}(x))\leq\delta_{1}^{1/2}$ for all $x\in A$ . Applying Lemma 4.9 with initial parameter $\delta_{2}>0$ , we obtain a Borel map $\phi_{3}:\operatorname{X}\to\operatorname{Y}$ such that $\phi_{2}=\pi_{k-1}\operatorname{\circ}\phi_{3}$ and $d(\phi(x),\phi_{3}(x)\big{)}\leq\delta_{2}=o(1)_{\delta\to 0}$ for every $x\in A$ , which implies that $d_{1}(\phi,\phi_{3})<\delta_{2}+\delta_{1}^{1/2}C$ , where $C$ is the diameter of $(\operatorname{Y},d_{\operatorname{Y}})$ . Note that this implies that $\phi_{3}$ is also a $(\delta^{\prime},1)$ -quasimorphism for some positive $\delta^{\prime}=o(1)_{\delta\to 0}$ , and what we have gained compared to $\phi$ is that $\phi_{3}$ is a lift of the morphism $\phi_{2}$ (i.e. $\pi_{k-1}\operatorname{\circ}\phi_{3}=\phi_{2}$ ). We shall now use this to show that $\phi_{2}$ can in fact be lifted to a continuous morphism $\psi:\operatorname{X}\to\operatorname{Y}$ (not just to a quasimorphism like $\phi_{3}$ ).

Let $W$ be the fiber product $\{(x,y)\in\operatorname{X}\times\operatorname{Y}:\phi_{2}(x)=\pi_{k-1,\operatorname{Y}}(y)\}$ . This is a compact sub-nilspace of the product nilspace $\operatorname{X}\times\operatorname{Y}$ , i.e. $W$ is a $k$ -step compact nilspace if we equip it with the cubes $\operatorname{c}$ on the product nilspace $\operatorname{X}\times\operatorname{Y}$ such that $\operatorname{c}$ takes values in $W$ (see the proof of [5, Lemma 4.2], applied taking $\psi_{1}$ in that proof to be $\pi_{k-1,\operatorname{Y}}$ here). Note that this $k$ -step nilspace $W$ is an extension of degree $k$ of $\operatorname{X}$ by the abelian group $\operatorname{Z}_{k}(\operatorname{Y})$ , because the action of $\operatorname{Z}_{k}(\operatorname{Y})$ on the $\operatorname{Y}$ -component of $W$ is transitive on each fiber of the projection $\pi:W\to\operatorname{X}$ , $(x,y)\mapsto x$ (recall [3, Definition 3.3.13]).

The map $\phi_{3}$ induces a Borel cross section $\operatorname{s}:\operatorname{X}\to W$ , $x\mapsto(x,\phi_{3}(x))$ . With this cross section we can associate a cocycle following [3, Lemma 3.3.21], namely the cocycle $\rho_{\operatorname{s}}:\operatorname{C}^{k+1}(\operatorname{X})\to\operatorname{Z}_{k}(\operatorname{Y})$ defined by $\operatorname{c}\mapsto\sigma_{k+1}(\operatorname{s}\operatorname{\circ}\operatorname{c}-\operatorname{c}^{\prime})$ for any cube $\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}(W)$ such that $\pi\operatorname{\circ}\operatorname{c}^{\prime}=\operatorname{c}$ . It then follows from the definitions that $\rho_{\operatorname{s}}(\operatorname{c})=\sigma_{k+1}(\phi_{3}\operatorname{\circ}\operatorname{c}-\operatorname{c}^{\prime\prime})$ for any $\operatorname{c}^{\prime\prime}\in\operatorname{C}^{k+1}(\operatorname{Y})$ such that $\pi_{k-1,\operatorname{Y}}\operatorname{\circ}\operatorname{c}^{\prime\prime}=\phi_{2}\operatorname{\circ}\operatorname{c}$ . Since $d_{1}(\phi,\phi_{3})<\delta_{2}+\delta_{1}^{1/2}C$ , and $\phi$ is a $(\delta,1)$ -quasimorphism, we deduce using Lemma 4.7 that $d_{1}(\rho_{\operatorname{s}},0)<\delta_{3}$ , where $\delta_{3}>0$ tends to [math] as $\delta\to 0$ (recall that $\delta_{1},\delta_{2}$ are both $o(1)_{\delta\to 0}$ ). By Proposition 4.4, $\rho_{\operatorname{s}}$ is a coboundary, so $W$ is a split extension of $\operatorname{X}$ , whence there is a Borel morphism $\psi:\operatorname{X}\to\operatorname{Y}$ such that $\pi_{k-1}\operatorname{\circ}\psi=\phi_{2}$ , and $\psi$ is then continuous by [4, Theorem 2.4.6].

Let $\phi_{4}:\operatorname{X}\to\mathcal{D}_{k}(\operatorname{Z}_{k}(\operatorname{Y}))$ , $x\mapsto\phi_{3}(x)-\psi(x)$ , where the subtraction here is enabled by the fact that $\phi_{3}(x),\psi(x)$ lie in the same fiber of $\pi_{k-1}$ in $\operatorname{Y}$ (every such fiber is an affine copy of the group $\operatorname{Z}_{k}(\operatorname{Y})$ ; see [3, Corollary 3.2.16]). Note that $\phi_{4}$ is a $(\delta_{4},1)$ -quasimorphism for some positive $\delta_{4}=\delta_{4}(\delta)=o(1)_{\delta\to 0}$ . By Lemma 4.8 there is a continuous morphism $\phi_{5}:\operatorname{X}\to\mathcal{D}_{k}(\operatorname{Z}_{k})$ such that $d_{1}(\phi_{4}-\phi_{5},0)<\delta_{5}$ for some positive $\delta_{5}=\delta_{5}(\delta)=o(1)_{\delta\to 0}$ . Now let $\phi^{\prime}:\operatorname{X}\to\operatorname{Y}$ , $x\mapsto\psi(x)+\phi_{5}(x)$ . Then $\phi^{\prime}$ is a continuous morphism and $d_{1}(\phi,\phi^{\prime})\leq d_{1}(\phi,\psi+\phi_{4})+d_{1}(\psi+\phi_{4},\phi^{\prime})=d_{1}(\phi,\phi_{3})+d_{1}(\phi_{4}-\phi_{5},0)<\delta_{2}+\delta_{1}^{1/2}C+\delta_{5}$ , which is less than $\epsilon$ for $\delta$ sufficiently small. ∎

5. Proof of the regularity and inverse theorems

Recall that given a Polish space $\operatorname{Y}$ , the space $\mathcal{P}(\operatorname{Y})$ of Borel probability measures on $\operatorname{Y}$ equipped with the weak topology is metrizable, and is in fact a Polish space (see [29, Theorems (17.23) and (17.19)]). Given a nilspace morphism $\phi:\operatorname{X}\to\operatorname{Y}$ and $n\in\mathbb{N}$ , we denote by $\phi^{\llbracket n\rrbracket}$ the map $\operatorname{C}^{n}(\operatorname{X})\to\operatorname{C}^{n}(\operatorname{Y})$ , $\operatorname{c}\mapsto\phi\operatorname{\circ}\operatorname{c}$ .

In the decomposition given by Theorem 1.5, the structured part is guaranteed to have the following useful property.

Definition 5.1 (Balance).

Let $\operatorname{Y}$ be a $k$ -step compact nilspace. For each $n\in\mathbb{N}$ fix a metric $d_{n}$ on the space $\mathcal{P}(\operatorname{C}^{n}(\operatorname{Y}))$ . Let $\operatorname{X}$ be a compact nilspace, and let $\phi:\operatorname{X}\to\operatorname{Y}$ be a continuous morphism. Then for $b>0$ we say that $\phi$ is $b$ -balanced if for every $n\leq 1/b$ we have $d_{n}\big{(}\mu_{\operatorname{C}^{n}(\operatorname{X})}\operatorname{\circ}(\phi^{\llbracket n\rrbracket})^{-1},\mu_{\operatorname{C}^{n}(\operatorname{Y})}\big{)}\leq b$ . A nilspace polynomial $F\operatorname{\circ}\phi$ is $b$ -balanced if the morphism $\phi$ is $b$ -balanced.

The balance property is an approximate form of multidimensional equidistribution: the image of $\phi^{\llbracket n\rrbracket}$ , $n\in[1/b]$ , tends toward being equidistributed in $\operatorname{C}^{n}(\operatorname{Y})$ as $b$ decreases. This property is useful in problems involving averages of functions over certain configurations. It appeared in [38], and is related to a property of approximate irrationality from [16]. In fact, from results in the latter paper it follows that, for nilsequences, high irrationality implies $b$ -balance for small $b$ (see [16, Theorem 3.6], or [6, Theorem 4.1]).

Proof of Theorem 1.5.

We begin by noting that it suffices to prove the result for cfr coset nilspaces. Indeed, if $\operatorname{X}$ is an inverse limit of such nilspaces, then the preimages of the Borel $\sigma$ -algebras on these spaces under the limit maps form an increasing sequence of $\sigma$ -algebras $\mathcal{B}_{i}$ on $\operatorname{X}$ such that $\bigvee_{i\in\mathbb{N}}\mathcal{B}_{i}=_{\mu_{\operatorname{X}}}\mathcal{B}_{\operatorname{X}}$ , the Borel $\sigma$ -algebra on $\operatorname{X}$ . By standard results $\mathbb{E}(f|\mathcal{B}_{i})\to f$ in $L^{1}$ as $i\to\infty$ . This implies (using [7, Lemma 2.17]) that given any $\epsilon>0$ , there is a limit map $\psi:\operatorname{X}\to\operatorname{X}^{\prime}$ , i.e. a continuous fibration onto a cfr coset nilspace $\operatorname{X}^{\prime}$ , and a 1-bounded Borel function $f^{\prime}:\operatorname{X}^{\prime}\to\mathbb{C}$ , such that $h:=f-f^{\prime}\operatorname{\circ}\psi$ satisfies $\|h\|_{L^{1}}\leq\epsilon/2$ . Let $f^{\prime}=f^{\prime}_{s}+f^{\prime}_{e}+f^{\prime}_{r}$ be the decomposition for $f^{\prime}$ applied with initial parameter $\epsilon/2$ and with $\mathcal{D}^{\prime}(\epsilon,m):=\mathcal{D}(2\epsilon,m)$ , and let $f_{s}=f^{\prime}_{s}\operatorname{\circ}\psi$ , $f_{e}=h+f_{e}^{\prime}\operatorname{\circ}\psi$ , $f_{r}=f^{\prime}_{r}\operatorname{\circ}\psi$ . We have (using that $\psi$ is a Haar-measure-preserving morphism [4, Corollary 2.2.7]) that $f=f_{s}+f_{e}+f_{r}$ is a valid decomposition for $\epsilon$ , $\mathcal{D}$ .

To prove the theorem for cfr coset nilspaces, we argue by contradiction. Suppose that the theorem fails for some $\epsilon>0$ . This means that there is a sequence of functions $(f_{i})_{i\in\mathbb{N}}$ where $f_{i}:\operatorname{X}_{i}\to\mathbb{C}$ is Borel measurable on a compact coset nilspace $\operatorname{X}_{i}$ with $|f_{i}|\leq 1$ , such that $f_{i}$ does not satisfy the statement with $\epsilon$ and $N=i$ . Let $\omega$ be a non-principal ultrafilter on $\mathbb{N}$ and let $\mathbf{X}$ be the ultraproduct $\prod_{i\to\omega}\operatorname{X}_{i}$ equipped with the Loeb probability measure $\lambda^{\prime}$ on $\mathcal{L}_{\mathbf{X}}$ . Let $f:\mathbf{X}\to\mathbb{C}$ be the Loeb measurable function $\lim_{\omega}f_{i}$ , and let $\mathcal{B}_{0}$ be the separable sub- $\sigma$ -algebra of $\mathcal{L}_{\mathbf{X}}$ generated by $f$ .

By Proposition 3.12 there is a $\sigma$ -algebra $\mathcal{B}^{\prime}\subset\mathcal{L}_{\mathbf{X}}$ including $\mathcal{B}_{0}$ such that the probability space $\varOmega^{\prime}=(\mathbf{X},\mathcal{B}^{\prime},\lambda^{\prime})$ is separable, and such that the sequence of measures $\mu^{\llbracket n\rrbracket}$ on $(\mathbf{X}^{\llbracket n\rrbracket},{\mathcal{B}^{\prime}}^{\llbracket n\rrbracket})$ form a cubic coupling. By [29, (17.44), iv)], the measure algebra of $\varOmega^{\prime}$ is isomorphic to the measure algebra of a Borel probability space $\varOmega=(\Omega,\mathcal{B},\lambda)$ . By [11, 343B(vi)] (using [10, 211L(a)-(c)] and [11, 324K(b)]) there is a mod 0 isomorphism $\theta:\Omega^{\prime}\to\Omega$ realizing this measure-algebra isomorphism. Moreover, by [7, Proposition A.11] the images of the measures $\mu^{\llbracket n\rrbracket}$ under the maps $\theta^{\llbracket n\rrbracket}$ form a cubic coupling on $\varOmega$ . From now on we identify $f$ and $f\operatorname{\circ}\theta^{-1}$ , so we view $f$ as a function on $\Omega$ .

Let $\mathcal{F}_{k}$ be the $k$ -th Fourier $\sigma$ -algebra on $\Omega$ (see [7, Definition 3.18]). Then we have $f=f_{s}+f_{r}$ , where $f_{s}=\mathbb{E}(f|\mathcal{F}_{k})$ , and $f_{r}=f-\mathbb{E}(f|\mathcal{F}_{k})$ satisfies $\|f_{r}\|_{U^{k+1}}=0$ . We now apply the structure theorem for cubic couplings [7, Theorem 4.2]. More precisely, applying this theorem to the above cubic coupling $\big{(}\varOmega,(\mu^{\llbracket n\rrbracket})_{n\geq 0}\big{)}$ , we obtain a $k$ -step compact nilspace $\operatorname{Y}$ , and a measurable map $\gamma_{k}:\Omega\to\operatorname{Y}$ such that $\gamma_{k}^{\llbracket n\rrbracket}$ takes $\mu^{\llbracket n\rrbracket}$ to the Haar measure $\mu_{\operatorname{C}^{n}(\operatorname{Y})}$ for each $n\geq 0$ . Moreover, this nilspace $\operatorname{Y}$ is related to $\mathcal{F}_{k}$ in the sense that, letting $\mathcal{B}_{\operatorname{Y}}$ denote the Borel $\sigma$ -algebra on $\operatorname{Y}$ , we have that the $\sigma$ -algebra $\gamma_{k}^{-1}(\mathcal{B}_{\operatorname{Y}})$ equals $\mathcal{F}_{k}$ modulo null sets (see [7, Lemma 3.42]). Then by [7, Lemma 2.17] there is a Borel function $g:\operatorname{Y}\to\mathbb{C}$ such that $f_{s}=_{\lambda}g\operatorname{\circ}\gamma_{k}$ .

By [4, Theorem 2.7.3], the nilspace $\operatorname{Y}$ is an inverse limit of $k$ -step cfr nilspaces $\operatorname{Y}_{j}$ , $j\in\mathbb{N}$ , where the limit maps $\psi_{j}:\operatorname{Y}\to\operatorname{Y}_{j}$ are continuous fibrations. Let $\mathcal{Y}_{j}$ denote the $\sigma$ -algebra on $\operatorname{Y}$ generated by $\psi_{j}$ . Arguing as in the first paragraph of the proof, there is $j\in\mathbb{N}$ such that $g_{j}:=\mathbb{E}(g|\mathcal{Y}_{j})$ satisfies $\|g-g_{j}\|_{1}\leq\epsilon/3$ . For this $j$ let $\gamma=\psi_{j}\operatorname{\circ}\gamma_{k}:\Omega\to\operatorname{Y}_{j}$ . As fibrations take cube sets onto cube sets in a measure-preserving way, the map $\gamma$ has the same measure-preserving properties as $\gamma_{k}$ . Furthermore, by Lusin’s theorem combined with [12, Theorem 1], there is a continuous function $h:\operatorname{Y}_{j}\to\mathbb{C}$ with $|h|\leq 1$ and with finite Lipschitz constant $C$ such that $\|g_{j}-h\|_{L^{1}(\operatorname{Y})}\leq\epsilon/3$ . Let $q=h\operatorname{\circ}\gamma:\Omega\to\mathbb{C}$ . The measure-preserving properties of $\gamma_{k}$ and $\psi_{j}$ imply that $\|f_{s}-q\|_{L^{1}(\Omega)}=\|g-h\operatorname{\circ}\psi_{j}\|_{L^{1}(\operatorname{Y})}\leq 2\epsilon/3$ . Let $f_{e}=f_{s}-q=f-q-f_{r}$ .

Next, we show that there are continuous morphisms $\phi_{i}:\operatorname{X}_{i}\to\operatorname{Y}_{j}$ , $i\in\mathbb{N}$ , such that $\gamma=_{\lambda}\lim_{\omega}\phi_{i}$ . Note that since $\gamma$ is $\mathcal{L}_{\mathbf{X}}$ -measurable, by [35, Corollary 5.1] it has a lifting, i.e. there are Borel maps $g_{i}:\operatorname{X}_{i}\to\operatorname{Y}_{j}$ , $i\in\mathbb{N}$ such that $\gamma=_{\lambda}\lim_{\omega}g_{i}$ . This together with the measure-preserving property of $\gamma^{\llbracket k+1\rrbracket}$ implies that the preimage of $\operatorname{C}^{k+1}(\operatorname{Y}_{j})$ under $(\lim_{\omega}g_{i})^{\llbracket k+1\rrbracket}$ has $\mu^{\llbracket k+1\rrbracket}$ -probability 1. For each $i$ let $\delta_{i}=\inf\{t:g_{i}\textrm{ is a }(t,1)\textrm{-quasimorphism}\}\in[0,1]$ . Then $\lim_{\omega}\delta_{i}=0$ . Indeed, otherwise for some $\delta>0$ the set $S_{1}=\{i\in\mathbb{N}:g_{i}\textrm{ is not a }(\delta,1)\textrm{-quasimorphism}\}$ is in $\omega$ . Then for each $i\in S_{1}$ there is a Borel set $B_{i}\subset\operatorname{C}^{k+1}(\operatorname{X}_{i})$ of measure at least $\delta$ such that for every $\operatorname{c}\in B_{i}$ the image $g_{i}\operatorname{\circ}\operatorname{c}$ is $\delta$ -separated from cubes, that is for every $\operatorname{c}^{\prime}\in\operatorname{C}^{k+1}(\operatorname{Y}_{j})$ we have $\max_{v\in\llbracket k+1\rrbracket}d_{\operatorname{Y}_{j}}(g_{i}\operatorname{\circ}\operatorname{c}(v),\operatorname{c}^{\prime}(v))\geq\delta$ . Since $S_{1}\in\omega$ , we can take $B=\prod_{i\to\omega}B_{i}\subset\Omega$ , and we have $\mu^{\llbracket k+1\rrbracket}(B)\geq\delta$ . Then, for every $\operatorname{c}\in B$ the composition $(\lim_{\omega}g_{i})\operatorname{\circ}\operatorname{c}$ is also $\delta$ -separated from cubes, so it cannot be in $\operatorname{C}^{k+1}(\operatorname{Y}_{j})$ . This contradicts the above fact that $(\lim_{\omega}g_{i})^{\llbracket k+1\rrbracket}$ maps almost every $\operatorname{c}\in\operatorname{C}^{k+1}(\Omega)$ into $\operatorname{C}^{k+1}(\operatorname{Y}_{j})$ , so we indeed have $\lim_{\omega}\delta_{i}=0$ . Hence there is a sequence $(\delta_{i}^{\prime}>0)_{i\in\mathbb{N}}$ with $\lim_{\omega}\delta_{i}^{\prime}=0$ such that $g_{i}$ is a $(\delta_{i}^{\prime},1)$ -quasimorphism for each $i$ . Theorem 4.2 implies that for each $i$ there is a continuous morphism $\phi_{i}:\operatorname{X}_{i}\to\operatorname{Y}_{j}$ such that $\mu_{\operatorname{X}_{i}}(\{x\in\operatorname{X}_{i}:\phi_{i}(x)\approx_{\epsilon_{i}}g_{i}(x)\})\geq 1-\epsilon_{i}$ , where $\lim_{\omega}\epsilon_{i}=0$ . Hence $\lim_{\omega}g_{i}=_{\lambda}\lim_{\omega}\phi_{i}$ , as required. Indeed, otherwise we have $\lambda(\lim_{\omega}g_{i}\neq\lim_{\omega}\phi_{i})>0$ , which implies (using monotonicity of $\lambda$ ) that $\lambda(\lim_{\omega}g_{i}\approx_{\eta}\lim_{\omega}\phi_{i})<1-\eta$ for some $\eta>0$ . But this event $\lim_{\omega}g_{i}\approx_{\eta}\lim_{\omega}\phi_{i}$ is $\big{\{}(x_{i})\in\Omega:\{i:g_{i}(x_{i})\approx_{\eta}\phi_{i}(x_{i})\}\in\omega\big{\}}$ , and this includes the set $\prod_{i\to\omega}\big{\{}x_{i}\in\operatorname{X}_{i}:g_{i}(x_{i})\approx_{\epsilon_{i}}\phi_{i}(x_{i})\}$ (using that $\epsilon_{i}<\eta$ for a cofinite set of integers $i$ ); but the latter set has $\lambda$ -measure 1, since $\mu_{\operatorname{X}_{i}}(\{x\in\operatorname{X}_{i}:\phi_{i}(x)\approx_{\epsilon_{i}}g_{i}(x)\})\geq 1-\epsilon_{i}$ , and this contradicts that $\eta>0$ .

There is a sequence $(b_{i}>0)_{i\in\mathbb{N}}$ such that $\phi_{i}$ is $b_{i}$ -balanced for all $i$ and $\lim_{\omega}b_{i}=0$ . Indeed, otherwise some $b>0$ , $S^{\prime}_{2}\in\omega$ satisfy that $\forall\,i\in S^{\prime}_{2}$ , $\phi_{i}$ is not $b$ -balanced. Then there is $S_{2}\subset S^{\prime}_{2}$ with $S_{2}\in\omega$ , and $n\in[1/b]$ , with $d_{n}\big{(}\mu_{\operatorname{C}^{n}(\operatorname{X}_{i})}\operatorname{\circ}(\phi_{i}^{\llbracket n\rrbracket})^{-1},\mu_{\operatorname{C}^{n}(\operatorname{Y}_{j})}\big{)}\geq b$ for all $i\in S_{2}$ . As $\gamma^{\llbracket n\rrbracket}$ is measure-preserving, we have $\lim_{\omega}d_{n}\big{(}\mu_{\operatorname{C}^{n}(\operatorname{X}_{i})}\operatorname{\circ}(\phi_{i}^{\llbracket n\rrbracket})^{-1},\mu_{\operatorname{C}^{n}(\operatorname{Y}_{j})}\big{)}=\lim_{\omega}d_{n}\big{(}\mu_{\operatorname{C}^{n}(\operatorname{X}_{i})}\operatorname{\circ}(\phi_{i}^{\llbracket n\rrbracket})^{-1},\mu^{\llbracket n\rrbracket}\operatorname{\circ}(\gamma^{\llbracket n\rrbracket})^{-1}\big{)}=0$ (using Lemma B.5), a contradiction.

For each $i$ let $f_{s,i}=h\operatorname{\circ}\phi_{i}$ , and apply [35, Corollary 5.1] again to obtain a sequence of Borel functions $(f_{r,i}:\operatorname{X}_{i}\to\mathbb{C})_{i\in\mathbb{N}}$ such that $\lim_{\omega}f_{r,i}=_{\lambda}f_{r}$ . Let $f_{e,i}=f_{i}-f_{s,i}-f_{r,i}$ . Since $\lim_{\omega}g_{i}=_{\lambda}\lim_{\omega}\phi_{i}$ , we have $\lim_{\omega}f_{s,i}=_{\lambda}q$ , whence $\lim_{\omega}f_{e,i}=_{\lambda}f_{e}$ . We also have $\lim_{\omega}\|f_{r,i}\|_{U^{k+1}}=\|f_{r}\|_{U^{k+1}}=0$ . Since $q$ and $f_{e}$ are both $\mathcal{F}_{k}$ -measurable, we have $\langle f_{r},q\rangle$ and $\langle f_{r},f_{e}\rangle$ both 0, and therefore $\lim_{\omega}\langle f_{r,i},f_{s,i}\rangle=\langle f_{r},q\rangle=0$ and $\lim_{\omega}\langle f_{r,i},f_{e,i}\rangle=\langle f_{r},f_{e}\rangle=0$ . Let $m$ be the maximum of $C$ and the complexity of $\operatorname{Y}_{j}$ . Combining the properties in this paragraph and the previous one, we deduce that there is a set $S\in\omega$ such that for every $i\in S$ the decomposition $f_{i}=f_{s,i}+f_{r,i}+f_{e,i}$ satisfies the properties in the theorem with this value of $m$ , the initial $\epsilon$ , and the corresponding value $\mathcal{D}(\epsilon,m)$ . This gives a contradiction for $i\in S$ with $i\geq m$ . ∎

We deduce the following inverse theorem, which clearly implies Theorem 1.6.

Theorem 5.2.

Let $k\in\mathbb{N}$ , and let $b:\mathbb{R}_{>0}\to\mathbb{R}_{>0}$ be an arbitrary function. For every $\delta\in(0,1]$ there is $M>0$ such that for every compact nilspace $\operatorname{X}$ that is an inverse limit of cfr coset nilspaces, and every 1-bounded Borel function $f:\operatorname{X}\to\mathbb{C}$ such that $\|f\|_{U^{k+1}}\geq\delta$ , for some $m\leq M$ there is a $b(m)$ -balanced 1-bounded nilspace-polynomial $F\operatorname{\circ}\phi$ of degree $k$ and complexity at most $m$ such that $\langle f,F\operatorname{\circ}\phi\rangle\geq\delta^{2^{k+1}}/2$ .

Proof.

We apply Theorem 1.5 with $\epsilon=\epsilon(\delta)>0$ and $\mathcal{D}$ to be fixed later. By property $(ii)$ in the theorem and the fact that $|f_{s}|\leq 1$ , we have $|\langle f_{e},f_{s}\rangle|\leq\epsilon$ , and by property $(iii)$ we have $|\langle f_{r},f_{s}\rangle|\leq\mathcal{D}(\epsilon,m)$ . Therefore, taking the inner product of $f_{s}$ with each side of the decomposition $f=f_{s}+f_{e}+f_{r}$ , we obtain $\langle f,f_{s}\rangle\geq\langle f_{s},f_{s}\rangle-\epsilon-\mathcal{D}(\epsilon,m)$ .

We also have $\|f_{e}\|_{L^{1}}\leq\epsilon$ and $|f_{e}|\leq 3$ , whence $\|f_{e}\|_{U^{k+1}}\leq(3^{2^{k+1}-2}\epsilon^{2})^{1/2^{k+1}}\leq 3\epsilon^{1/2^{k}}$ . Combining this with the above decomposition of $f$ and the bound $\|f_{r}\|_{U^{k+1}}\leq\mathcal{D}(\epsilon,m)$ , we deduce that $\|f_{s}\|_{U^{k+1}}\geq\delta-3\epsilon^{1/2^{k}}-\mathcal{D}(\epsilon,m)$ . This together with $|f_{s}|\leq 1$ implies that $\langle f_{s},f_{s}\rangle=\|f_{s}\|_{L^{2}}^{2}\geq\|f_{s}\|_{U^{k+1}}^{2^{k+1}}\geq(\delta-3\epsilon^{1/2^{k}}-\mathcal{D}(\epsilon,m))^{2^{k+1}}$ .

We now fix $\epsilon=\big{(}\frac{\delta}{3}(1-(\frac{5}{6})^{1/2^{k+1}})\big{)}^{2^{k}}$ , and choose $\mathcal{D}$ so that the following hold: firstly, so that $\mathcal{D}(\epsilon,m)\leq b(m)$ ; secondly, so that by the last inequality in the previous paragraph we have $\langle f_{s},f_{s}\rangle\geq 2\delta^{2^{k+1}}/3$ ; finally, so that $\epsilon+\mathcal{D}(\epsilon,m)\leq\delta^{2^{k+1}}/6$ , which implies, by the last inequality in the first paragraph, that $\langle f,f_{s}\rangle\geq\delta^{2^{k+1}}/2$ . We can then let $M$ be the number $N$ given by Theorem 1.5 for this choice of $\epsilon$ and $\mathcal{D}$ . ∎

6. The case of simple abelian groups

In this final section we use Theorem 1.5 to prove Theorem 1.7.

Recall that Definition 5.1 presupposes that for each $n$ a metric has been fixed on the space $\mathcal{P}(\operatorname{C}^{n}(\operatorname{X}))$ of Borel probabilities on $\operatorname{C}^{n}(\operatorname{X})$ (equipped with the weak topology). For the proof of Theorem 1.7 it is convenient to fix the metrics in a process by induction on the step $k$ of $\operatorname{X}$ as follows: having already defined a metric $d_{n,k-1}$ on $\mathcal{P}(\operatorname{C}^{n}(\operatorname{X}_{k-1}))$ , we first let $d_{n,k}^{\prime}$ be a metric on $\mathcal{P}(\operatorname{C}^{n}(\operatorname{X}))$ defined the standard way (see [29, Theorem (17.19)]), and then we define $d_{n,k}$ for $\mu,\nu\in\mathcal{P}(\operatorname{C}^{n}(\operatorname{X}))$ by

[TABLE]

This construction is convenient for the proof because if $\phi$ is $b$ -balanced relative to the metrics $d_{n,k}$ , then $\pi_{k-1}\operatorname{\circ}\phi$ is automatically $b$ -balanced relative to the metrics $d_{n,k-1}$ . For the remainder of this section, we suppose that we have fixed what we call a factor-consistent metrization for cubic measures on cfr nilspaces, by which we mean the result of the following process: first we fix a sequence of metrics $d_{n,1}$ on $\mathcal{P}(\operatorname{C}^{n}(\operatorname{X}))$ ( $n\geq 0$ ) for each $1$ -step cfr nilspace $\operatorname{X}$ , then we fix metrics $d_{n,2}$ on $\mathcal{P}(\operatorname{C}^{n}(\operatorname{X}))$ for each $2$ -step cfr nilspace $\operatorname{X}$ using (5) as above, and so on for increasing $k$ .

In the proof of Theorem 1.7, a key ingredient is the following result, which ensures that the morphism that we obtain from Theorem 5.2 takes values in a toral nilspace.

Theorem 6.1.

Fix any complexity notion and any factor-consistent metrization for cubic measures on cfr nilspaces. Then for every $M>0$ there exist $b>0$ and $p_{0}>0$ with the following property. Let $\operatorname{Y}$ be a $k$ -step cfr nilspace of complexity at most $M$ , and let $\phi:\mathbb{Z}_{p}\to\operatorname{Y}$ be a $b$ -balanced morphism for a prime $p>p_{0}$ . Then $\operatorname{Y}$ is toral.

This section is mostly devoted to the proof of this result. The proof of Theorem 1.7 is a simple combination of Theorems 6.1 and 5.2, and is given at the end of this section.

Recall that a nilspace $\operatorname{X}$ can be equipped with a filtration of translation groups $\operatorname{\Theta}_{i}(\operatorname{X})$ , $i\geq 0$ (see [3, Definition 3.2.27]), and that for cfr nilspaces these translation groups are Lie groups (see [4, Theorem 2.9.10]).

In the proof of Theorem 6.1, we shall argue by induction on $k$ . This will enable us to assume that $\operatorname{Y}_{k-1}$ is toral, and we shall then use the following characterization of such nilspaces, which will be very convenient for the rest of the argument.

Theorem 6.2.

Let $\operatorname{X}$ be a $k$ -step cfr nilspace such that the factor $\operatorname{X}_{k-1}$ is toral. Let $G$ denote the Lie group $\operatorname{\Theta}(\operatorname{X})$ , let $G_{\bullet}$ denote the degree- $k$ filtration $(\operatorname{\Theta}_{i}(\operatorname{X}))_{i\geq 0}$ , and for an arbitrary fixed $x\in\operatorname{X}$ let $\Gamma=\operatorname{Stab}_{G}(x)$ . Then $\operatorname{X}$ is isomorphic as a compact nilspace to the coset nilspace $(G/\Gamma,G_{\bullet})$ .

This theorem tells us essentially that such a nilspace $\operatorname{X}$ must be a cfr coset nilspace, but it also gives us groups $G,\Gamma$ and a filtration $G_{\bullet}$ with which we can represent $\operatorname{X}$ . The proof is an adaptation of [4, Theorem 2.9.17]; see Theorem A.1 in Appendix A.

Given Theorem 6.2, for the proof of Theorem 6.1 we can focus on coset nilspaces. This is useful thanks to the following description of morphisms from $\mathbb{Z}_{p}$ into such nilspaces.

Proposition 6.3.

Let $\operatorname{X}=(G/\Gamma,G_{\bullet})$ be a coset nilspace. For a positive integer $N$ let $\phi:\mathbb{Z}_{N}\to G/\Gamma$ be a morphism (relative to the standard degree-1 cube structure on $\mathbb{Z}_{N}$ ). Then for every homomorphism $\beta:\mathbb{Z}\to\mathbb{Z}_{N}$ there is a polynomial map $g\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ such that $\phi\operatorname{\circ}\beta=\pi_{\Gamma}\operatorname{\circ}g$ .

The proof, adapting an argument from [38], is given at the end of Appendix A.

In the proof of Theorem 6.1, we use the following lemma in the inductive step.

Lemma 6.4.

*Let $\operatorname{X}$ be a cfr coset nilspace $(G/\Gamma,G_{\bullet})$ , and let $\operatorname{Y}$ be the coset nilspace $(G/(G^{0}\,\Gamma),G_{\bullet})$ where $G^{0}$ is the identity component of $G$ . Then the quotient map $q:G/\Gamma\to G/(G^{0}\,\Gamma)$ is a morphism of compact nilspaces, and $\operatorname{Y}$ is in bijection with the set of connected components of $\operatorname{X}$ . In particular $\operatorname{Y}$ is a finite *(discrete) nilspace.

Proof.

It is clear that $q$ is a (continuous) morphism, because any cube $\operatorname{c}\in\operatorname{C}^{n}(\operatorname{X})$ lifts to a cube $\tilde{\operatorname{c}}\in\operatorname{C}^{n}(G_{\bullet})$ , i.e. we have $\operatorname{c}=\tilde{\operatorname{c}}\Gamma^{\llbracket n\rrbracket}$ (by definition of the coset nilspace structure), so $q\operatorname{\circ}\operatorname{c}=\tilde{\operatorname{c}}(G^{0}\,\Gamma)^{\llbracket n\rrbracket}$ is indeed a cube on $\operatorname{Y}$ .

We claim that the quotient map $\pi_{\Gamma}:G\to G/\Gamma$ induces a bijection from the set of cosets of $G^{0}\Gamma$ (i.e. the set $\operatorname{Y}$ ) to the set of connected components of $G/\Gamma$ . First note that the image under $\pi_{\Gamma}$ of any coset of $G^{0}\Gamma$ is open, because $G^{0}$ is open (as $G$ is a Lie group) and $\pi_{\Gamma}$ is an open map. Since these images cover the compact set $G/\Gamma$ , and clearly two distinct cosets of $G^{0}\Gamma$ are mapped to disjoint such images by $\pi_{\Gamma}$ , these images form a finite partition of $G/\Gamma$ . Moreover, the image of every coset $gG^{0}\Gamma$ is connected in $G/\Gamma$ (indeed for any points $gg_{1}\gamma_{1},gg_{2}\gamma_{2}$ in this coset there are paths from $gg_{i}\gamma_{i}$ to $g\gamma_{i}$ via $G^{0}$ for $i=1,2$ , and then $g\gamma_{1}$ , $g\gamma_{2}$ are identified in the quotient), so each such image is included in one of the components of $G/\Gamma$ , and therefore must be the whole component (otherwise this component would be a disjoint union of at least two such images, which are open sets, contradicting the connectedness of the component). This shows that each component of $G/\Gamma$ is an image under $\pi_{\Gamma}$ of a unique coset of $G^{0}\Gamma$ , which proves our claim. ∎

We need two more lemmas before we can prove Theorem 6.1.

Lemma 6.5.

Let $\operatorname{Y}$ be a coset nilspace, let $N\in\mathbb{N}$ and let $\phi:\mathbb{Z}_{N}\to\operatorname{Y}$ be a morphism. Then for each $k\in\mathbb{N}$ the map $\phi^{\llbracket k\rrbracket}:\operatorname{c}\mapsto\phi\operatorname{\circ}\operatorname{c}$ is a nilspace morphism $\operatorname{C}^{k}(\mathbb{Z}_{N})\to\operatorname{C}^{k}(\operatorname{Y})$ .

Proof.

We are assuming that $\operatorname{Y}$ is the coset space $G/\Gamma$ , for some filtered group $(G,G_{\bullet})$ and $\Gamma\leq G$ , and that $\operatorname{C}^{k}(\operatorname{Y})=\{\operatorname{c}\Gamma^{\llbracket k\rrbracket}:\operatorname{c}\in\operatorname{C}^{k}(G_{\bullet})\}$ . We view the abelian group $\operatorname{C}^{k}(\mathbb{Z}_{N})$ as a nilspace by equipping it with the standard cubes, and we view $\operatorname{C}^{k}(\operatorname{Y})$ as the coset nilspace $\widetilde{G}/\widetilde{\Gamma}$ where $\widetilde{G}$ , $\widetilde{\Gamma}$ denote the group $\operatorname{C}^{k}(G_{\bullet})$ and subgroup $\operatorname{C}^{k}(\Gamma_{\bullet})$ respectively (with $\Gamma_{i}:=\Gamma\cap G_{i}$ ), and where $\widetilde{G}$ is equipped with the filtration $\widetilde{G}_{\bullet}=\big{(}G_{i}^{\llbracket k\rrbracket}\cap\operatorname{C}^{k}(G_{\bullet})\big{)}_{i\geq 0}$ . By Proposition 6.3 there is a polynomial map $g\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ such that, identifying $\mathbb{Z}_{N}$ with the set of integers $[0,N-1]$ with addition mod $N$ , we have $\phi(n)=g(n)\Gamma$ for all $n$ (in particular $g$ is $N$ -periodic mod $\Gamma$ ). Define

[TABLE]

The group isomorphism $\theta:\mathbb{Z}_{N}^{k+1}\to\operatorname{C}^{k}(\mathbb{Z}_{N})$ , $\mathbf{n}\mapsto\big{(}n_{0}+v\cdot(n_{1},\dots,n_{k})\mod N\big{)}_{v\in\llbracket k\rrbracket}$ is a nilspace isomorphism. Hence $\phi^{\llbracket k\rrbracket}$ is a morphism if and only if the map $\mathbf{n}\mapsto g^{(k)}(\mathbf{n})\Gamma^{\llbracket k\rrbracket}$ is a morphism $\mathbb{Z}_{N}^{k+1}\to\operatorname{C}^{k}(\operatorname{Y})$ (since the latter map is $\phi^{\llbracket k\rrbracket}\operatorname{\circ}\theta$ ). Recall that the morphisms between two group nilspaces are the polynomial maps between the filtered groups [3, Theorem 2.2.14]. Hence it suffices to prove that $g^{(k)}\in\operatorname{poly}(\mathbb{Z}^{k+1},\widetilde{G}_{\bullet})$ , as then $g^{(k)}$ is a morphism into $\widetilde{G}$ and then $g^{(k)}(\mathbf{n})\Gamma^{\llbracket k\rrbracket}$ is a morphism as required.

By Lemma A.5, there is a unique expression $g(n)=g_{0}g_{1}^{n}\cdots g_{k}^{\binom{n}{k}}$ , where $g_{i}\in G_{i}$ . Substituting this expression into (6) and expanding, we see that $g^{(k)}(\mathbf{n})$ is a pointwise product of maps $h_{j}:\mathbb{Z}^{k+1}\to\widetilde{G}$ , $j\in[0,k]$ , of the form $h_{j}(\mathbf{n})=\Big{(}g_{j}^{\binom{n_{0}+v\cdot(n_{1},\ldots,n_{k})}{j}}\Big{)}_{v\in\llbracket k\rrbracket}$ . By Leibman’s theorem [30], polynomial maps form a group under pointwise multiplication, so it suffices to show that for every $j\in[0,k]$ we have $h_{j}\in\operatorname{poly}(\mathbb{Z}^{k+1},\widetilde{G}_{\bullet})$ . We have $\binom{n_{0}+v\cdot(n_{1},\ldots,n_{k})}{j}=\sum_{\mathbf{i}=(i_{0},\ldots,i_{k})\in\mathbb{Z}_{\geq 0}^{k+1},|\mathbf{i}|=j}\binom{n_{0}}{i_{0}}\binom{v_{1}n_{1}}{i_{1}}\cdots\binom{v_{k}n_{k}}{i_{k}}$ , by the identity of Chu–Vandermonde. Letting $\mathbf{i}^{\prime}=(i_{1},\ldots,i_{k})$ be the restriction of $\mathbf{i}$ to its last $k$ coordinates, we note that $\binom{n_{0}}{i_{0}}\binom{v_{1}n_{1}}{i_{1}}\cdots\binom{v_{k}n_{k}}{i_{k}}$ gives a non-zero contribution to the last sum above only if $\operatorname{supp}(\mathbf{i}^{\prime})\subset\operatorname{supp}(v)$ . We deduce that $h_{j}(\mathbf{n})=\prod_{\mathbf{i},\,|\mathbf{i}|=j}g_{\mathbf{i}}^{\tbinom{n_{0}}{i_{0}}\cdots\tbinom{n_{k}}{i_{k}}}$ , where $g_{\mathbf{i}}$ is the element of $G^{\llbracket k\rrbracket}$ with $g_{\mathbf{i}}(v)=g_{j}$ if $\operatorname{supp}(v)\supset\operatorname{supp}(\mathbf{i}^{\prime})$ , and $g_{\mathbf{i}}(v)=\mathrm{id}_{G}$ otherwise. Now observe that, since $|\operatorname{supp}(\mathbf{i}^{\prime})|\leq j$ , the set $\{v:\operatorname{supp}(v)\supset\operatorname{supp}(\mathbf{i}^{\prime})\}$ is a face of codimension at most $j$ in $\llbracket k\rrbracket$ . Since $g_{j}\in G_{j}$ , it follows that $g_{\mathbf{i}}\in\widetilde{G}_{j}$ .

We have shown that $h_{j}$ is a pointwise product of maps of the form $\mathbf{n}\mapsto g_{\mathbf{i}}^{\binom{\mathbf{n}}{\mathbf{i}}}$ , where $\binom{\mathbf{n}}{\mathbf{i}}=\binom{n_{0}}{i_{0}}\binom{n_{1}}{i_{1}}\cdots\binom{n_{k}}{i_{k}}$ . It is known that these maps are polynomial (see the proof of [18, Lemma 6.7]). This proves that $g^{(k)}\in\operatorname{poly}(\mathbb{Z}^{k+1},\widetilde{G}_{\bullet})$ , and the result follows. ∎

Remark 6.6.

In Lemma 6.5 we equipped the cube set $\operatorname{C}^{k}(\operatorname{Y})$ itself with a natural nilspace structure, but note that this was enabled by the specific coset-nilspace nature of $\operatorname{Y}$ . There is in fact a cubespace structure that one can define on $\operatorname{C}^{k}(\operatorname{X})$ for a general nilspace $\operatorname{X}$ : given a map $\operatorname{c}:\llbracket m\rrbracket\to\operatorname{C}^{k}(\operatorname{X})$ , $v\mapsto\operatorname{c}(v)$ (where $\operatorname{c}(v)$ is itself a cube $w\mapsto\operatorname{c}(v)(w)$ in $\operatorname{C}^{k}(\operatorname{X})$ ), we declare $\operatorname{c}$ to be an $m$ -cube on $\operatorname{C}^{k}(\operatorname{X})$ if for every $w\in\llbracket k\rrbracket$ , the map $\llbracket m\rrbracket\to\operatorname{X}$ , $v\mapsto\operatorname{c}(v)(w)$ is in $\operatorname{C}^{m}(\operatorname{X})$ . It seems to be an interesting question whether this cubespace structure satisfies the completion axiom and thus defines a nilspace structure. The answer is affirmative when $\operatorname{X}$ is a coset nilspace, because it can be checked that in this case this structure is equivalent to the one used on $\operatorname{C}^{k}(\operatorname{Y})$ above. This fact can be used to give an alternative proof of Lemma 6.5.

Lemma 6.7.

Let $\operatorname{Z}_{1}$ , $\operatorname{Z}_{2}$ be finite abelian groups with coprime orders, and let $\ell\in\mathbb{N}$ . Then every morphism $\mathcal{D}_{1}(\operatorname{Z}_{1})\to\mathcal{D}_{\ell}(\operatorname{Z}_{2})$ is constant.

Proof.

We argue by induction on $\ell$ . For $\ell=1$ , note that a morphism $\phi:\mathcal{D}_{1}(\operatorname{Z}_{1})\mapsto\mathcal{D}_{1}(\operatorname{Z}_{2})$ satisfies $\Delta_{s}\Delta_{t}\phi(x)=0$ for every $s,t,x\in\operatorname{Z}_{1}$ (see [3, formula (2.9)]), which means that $\phi$ is an affine homomorphism $\operatorname{Z}_{1}\to\operatorname{Z}_{2}$ , so the map $\psi:x\mapsto\phi(x)-\phi(0)$ is a homomorphism. By standard group theory, the order $|\psi(\operatorname{Z}_{1})|$ divides both $|\operatorname{Z}_{1}|$ and $|\operatorname{Z}_{2}|$ , so we must have $|\psi(\operatorname{Z}_{1})|=1$ , so $\phi$ is constant. For $\ell>1$ , note that for every morphism $\phi:\mathcal{D}_{1}(\operatorname{Z}_{1})\to\mathcal{D}_{\ell}(\operatorname{Z}_{2})$ , for every $t\in\operatorname{Z}_{1}$ the map $\Delta_{t}\phi:x\mapsto\phi(x+t)-\phi(x)$ is a morphism $\mathcal{D}_{1}(\operatorname{Z}_{1})\to\mathcal{D}_{\ell-1}(\operatorname{Z}_{2})$ , so by induction $\Delta_{t}\phi$ is a constant function of $x$ , for each $t$ . Hence $\Delta_{s}\Delta_{t}\phi(x)=0$ for all $s,t,x\in\operatorname{Z}_{1}$ . Arguing as for $\ell=1$ , we deduce that $\phi$ is constant. ∎

We can now prove the characterization of balanced morphisms on $\mathbb{Z}_{p}$ .

Proof of Theorem 6.1.

By Theorem 1.10 it suffices to show that $\operatorname{C}^{k}(\operatorname{Y})$ is connected. We prove this by induction on $k$ . The base case $k=0$ is trivial.

Let $k\geq 1$ , and suppose for a contradiction that $\operatorname{C}^{k}(\operatorname{Y})$ is disconnected.

We have that $\pi_{k-1}\operatorname{\circ}\phi$ is also $b$ -balanced (by our choice of a factor-consistent metrization), so we can assume by induction that $\operatorname{Y}_{k-1}$ is toral. Hence $\operatorname{Y}$ is isomorphic to a compact coset nilspace $(G/\Gamma,G_{\bullet})$ , by Theorem 6.2. Letting $\widetilde{G}=\operatorname{C}^{k}(G_{\bullet})$ with the filtration $\widetilde{G}_{\bullet}=\big{(}G_{j}^{\llbracket k\rrbracket}\cap\operatorname{C}^{k}(G_{\bullet})\big{)}_{j\geq 0}$ , and $\widetilde{\Gamma}=\operatorname{C}^{k}(\Gamma_{\bullet})$ , we have that $\operatorname{C}^{k}(\operatorname{Y})$ is homeomorphic to the compact coset space $\widetilde{G}/\widetilde{\Gamma}$ , which we equip with the coset nilspace structure determined by $\widetilde{G}_{\bullet}$ . By Lemma 6.5, the map $\phi^{\llbracket k\rrbracket}:\operatorname{C}^{k}(\mathbb{Z}_{p})\to\operatorname{C}^{k}(\operatorname{Y})$ , $\operatorname{c}\mapsto\phi\operatorname{\circ}\operatorname{c}$ is a morphism. We apply Lemma 6.4 to $\operatorname{C}^{k}(\operatorname{Y})$ , and let $q:\widetilde{G}/\widetilde{\Gamma}\mapsto\widetilde{G}/(\widetilde{G}^{0}\widetilde{\Gamma})$ be the resulting quotient morphism. Then $q\operatorname{\circ}\phi^{\llbracket k\rrbracket}$ is a morphism from $\operatorname{C}^{k}(\mathbb{Z}_{p})$ to a discrete nilspace $\widetilde{Y}$ of finite cardinality equal to the number of connected components of $\operatorname{C}^{k}(\operatorname{Y})$ .

We claim that for $b$ sufficiently small (depending only on $M$ ), for every such component $C$ we have $\phi^{\llbracket k\rrbracket}\big{(}\operatorname{C}^{k}(\mathbb{Z}_{p})\big{)}\cap C\neq\emptyset$ . Indeed, by Lemma A.3 the finitely many connected components of $\operatorname{C}^{k}(\operatorname{Y})$ all have equal Haar measure $\nu>0$ . Hence, for any such component $C$ , it follows from the Portmanteau Theorem [29, (17.20)] (using that $C$ is open) that the measure $\mu_{\operatorname{C}^{k}(\mathbb{Z}_{p})}\operatorname{\circ}(\phi^{\llbracket k\rrbracket})^{-1}(C)$ is at least $\nu-o(1)_{b\to 0}$ (where $\mu_{\operatorname{C}^{k}(\mathbb{Z}_{p})}$ is the Haar measure on $\operatorname{C}^{k}(\mathbb{Z}_{p})$ ), so for $b$ sufficiently small this measure is positive, which proves our claim. This claim implies that $q\operatorname{\circ}\phi^{\llbracket k\rrbracket}$ is surjective.

Now let $\widetilde{\operatorname{Y}}_{i}$ be the nilspace factor of $\widetilde{\operatorname{Y}}$ for the minimal $i\in[k]$ such that $\widetilde{\operatorname{Y}}_{i}$ is not the 1-point nilspace. In particular, it follows from minimality of $i$ that $\widetilde{\operatorname{Y}}_{i}$ is a finite abelian group $\operatorname{Z}$ with the degree- $i$ nilspace structure $\mathcal{D}_{i}(\operatorname{Z})$ . Since the factor map $\pi_{i}:\widetilde{\operatorname{Y}}\to\widetilde{\operatorname{Y}}_{i}$ is a surjective morphism, it follows that the map $\psi:=\pi_{i}\operatorname{\circ}q\operatorname{\circ}\phi^{\llbracket k\rrbracket}$ is a surjective morphism $\operatorname{C}^{k}(\mathbb{Z}_{p})\to\widetilde{\operatorname{Y}}_{i}$ . For $p$ sufficiently large in terms of $M$ , the orders $|\operatorname{C}^{k}(\mathbb{Z}_{p})|=p^{k+1}$ and $|\widetilde{\operatorname{Y}}_{i}|$ are coprime, so by Lemma 6.7 the morphism $\psi$ must be constant, and therefore cannot be surjective, so we have a contradiction. ∎

Finally, having proved Theorem 6.1, we can prove the inverse theorem for $\mathbb{Z}_{p}$ .

Proof of Theorem 1.7.

We first note that, having fixed an arbitrary complexity notion for cfr nilspaces $\operatorname{Y}$ , there is a function $h:\mathbb{N}\to\mathbb{N}$ (which can be assumed to be increasing) such that if $\textrm{Comp}(\operatorname{Y})\leq m$ then $\operatorname{Y}$ has at most $h(m)$ connected components. Now suppose that $\|f\|_{U^{k+1}(\mathbb{Z}_{p})}\geq\delta$ . We apply Theorem 5.2 with $\delta$ , with a function $b$ to be specified later and with $\operatorname{X}=\mathbb{Z}_{p}$ . Let $M=M(k,\delta,b)>0$ be the resulting number and let $F\operatorname{\circ}\phi$ be the resulting nilspace polynomial, for an underlying cfr nilspace $\operatorname{Y}$ with $\textrm{Comp}(\operatorname{Y})\leq m\leq M$ , and with the morphism $\phi:\mathbb{Z}_{p}\to\operatorname{X}$ being $b(m)$ -balanced. If $p>h(m)$ and $b(m)$ is sufficiently small, then it follows by Theorem 6.1 that $\operatorname{X}$ is toral. In particular, it is a connected nilmanifold, and by Proposition 6.3 the nilspace polynomial is a $p$ -periodic nilsequence as required. Thus, for $p>h(m)$ we obtain the conclusion of Theorem 1.7 with $C_{k,\delta}=M$ . For $p\leq h(m)$ we also obtain the conclusion, but for a simpler reason: letting $\phi$ be the homomorphism embedding $\mathbb{Z}_{p}$ as a discrete subgroup of the circle group $\mathbb{R}/\mathbb{Z}$ , and letting $F:\mathbb{R}/\mathbb{Z}\to\mathbb{C}$ be some function with Lipschitz constant $O_{p}(1)$ that extends the function $f\operatorname{\circ}\phi^{-1}$ from $\phi(\mathbb{Z}_{p})$ to all of $\mathbb{R}/\mathbb{Z}$ , we then have $\langle f,F\operatorname{\circ}\phi\rangle=\|f\|_{L^{2}(\mathbb{Z}_{p})}^{2}\geq\|f\|_{U^{k+1}(\mathbb{Z}_{p})}^{2^{k+1}}\geq\delta^{2^{k+1}}$ , and the conclusion of Theorem 1.7 follows with constant $C_{k,\delta}$ still depending only on $k$ and $\delta$ . ∎

Appendix A Results from nilspace theory

In this appendix our first and main aim is to prove Theorem 1.10. We also gather some results from nilspace theory which are adaptations of results from previous works.

We begin with the following useful description of cfr $k$ -step nilspaces whose $k-1$ factor is toral, which was stated as Theorem 6.2.

Theorem A.1.

Let $\operatorname{X}$ be a $k$ -step cfr nilspace such that the factor $\operatorname{X}_{k-1}$ is toral. Let $G$ denote the Lie group $\operatorname{\Theta}(\operatorname{X})$ , let $G_{\bullet}$ denote the degree- $k$ filtration $(\operatorname{\Theta}_{i}(\operatorname{X}))_{i\geq 0}$ , and for an arbitrary fixed $x\in\operatorname{X}$ let $\Gamma=\operatorname{Stab}_{G}(x)$ . Then $\operatorname{X}$ is isomorphic as a compact nilspace to the coset space $G/\Gamma$ with cube sets $\operatorname{C}^{n}(\operatorname{X})=(\operatorname{C}^{n}(G_{\bullet})\cdot\Gamma^{\llbracket n\rrbracket})/\Gamma^{\llbracket n\rrbracket}$ , $n\geq 0$ .

To prove this we adapt the proof of [4, Theorem 2.9.17].

Proof.

Fix $x\in\operatorname{X}$ and let $\Gamma=\operatorname{Stab}_{G}(x)$ .

We first claim that $\Gamma$ is discrete. Indeed, letting $h:\operatorname{\Theta}(\operatorname{X})\to\operatorname{\Theta}(\operatorname{X}_{k-1})$ be the natural continuous homomorphism defined by $h(\alpha)(y)=\pi_{k-1}(\alpha(x))$ (see [4, Lemma 2.9.3]), note that $h(\Gamma)$ is a subgroup of the stabilizer of $\pi_{k-1}(x)$ in $\operatorname{\Theta}(\operatorname{X}_{k-1})$ , and since $\operatorname{X}_{k-1}$ is toral, this stabilizer is discrete (see the proof of [4, Theorem 2.9.17]), so $h(\Gamma)$ is discrete. Then, since $h^{-1}(h(\Gamma))$ is a union of cosets of $\ker(h)$ , it suffices to show that $\Gamma\cap\ker(h)$ is discrete. This follows from [4, Lemma 2.9.9], since no non-trivial element of $\tau(\operatorname{Z}_{k})$ stabilizes $x$ .

By [4, Corollary 2.9.12] the Lie group $\operatorname{\Theta}(\operatorname{X})^{0}$ acts transitively on the connected components of $\operatorname{X}$ , and since $\operatorname{X}_{k-1}$ is toral, it follows that $\langle\operatorname{\Theta}(\operatorname{X})^{0},\operatorname{Z}_{k}\rangle$ acts transitively on $\operatorname{X}$ . Indeed, if $x,y\in\operatorname{X}$ are in different components, then there is $g^{\prime}\in\operatorname{\Theta}(\operatorname{X}_{k-1})^{0}$ such that $g^{\prime}\pi_{k-1}(x)=\pi_{k-1}(y)$ . Then there is $g\in\operatorname{\Theta}(\operatorname{X})^{0}$ such that $h(g)=g^{\prime}$ , and since $g$ is path-connected to the identity in $G$ , it follows that $gx$ is in the same component as $x$ . Moreover, by definition of $h$ we have $\pi_{k-1}(gx)=g^{\prime}\pi_{k-1}(x)=\pi_{k-1}(y)$ . There is therefore $z\in\operatorname{Z}_{k}$ such that $zgx=y$ , which proves the claimed transitivity. Now since $G\supset\langle\operatorname{\Theta}(\operatorname{X})^{0},\operatorname{Z}_{k}\rangle$ , we have that $G$ also acts transitively on $\operatorname{X}$ , whence $\operatorname{X}$ is homeomorphic to the coset space $G/\Gamma$ (see [25, Ch. II, Theorem 3.2]). In particular, since $\operatorname{X}$ is compact, we have that $\Gamma$ is cocompact.

Recall from [3, Definition 3.2.38] that two cubes $\operatorname{c}_{1},\operatorname{c}_{2}\in\operatorname{C}^{n}(\operatorname{X})$ are said to be translation equivalent if there is an element $\operatorname{c}\in\operatorname{C}^{n}(G_{\bullet})$ such that $\operatorname{c}_{2}(v)=\operatorname{c}(v)\cdot\operatorname{c}_{1}(v)$ . We now show that $\operatorname{C}^{n}(\operatorname{X})=\pi_{\Gamma}^{\llbracket n\rrbracket}\big{(}\operatorname{C}^{n}(G_{\bullet})\big{)}$ , i.e., that every cube on $\operatorname{X}$ is translation equivalent to the constant $x$ cube. First we claim that for every cube $\operatorname{c}\in\operatorname{C}^{n}(\operatorname{X})$ there is a cube $\operatorname{c}^{\prime}\in\operatorname{C}^{n}(\operatorname{X})$ that is translation equivalent to the constant $x$ cube and such that $\pi_{k-1}\operatorname{\circ}\operatorname{c}=\pi_{k-1}\operatorname{\circ}\operatorname{c}^{\prime}$ . Indeed, given $\operatorname{c}\in\operatorname{C}^{n}(\operatorname{X})$ , we have $\pi_{k-1}\operatorname{\circ}\operatorname{c}\in\operatorname{C}^{n}(\operatorname{X}_{k-1})$ , and since $\operatorname{X}$ is toral the latter cube is translation equivalent to the cube with constant value $x^{\prime}=\pi_{k-1}(x)$ , i.e. $\pi_{k-1}\operatorname{\circ}\operatorname{c}=\tilde{\operatorname{c}}\cdot x^{\prime}$ for some cube $\tilde{\operatorname{c}}$ on the group $\operatorname{\Theta}(\operatorname{X}_{k-1})^{0}$ with the filtration $\big{(}\operatorname{\Theta}_{i}(\operatorname{X}_{k-1})^{0}\big{)}_{i\geq 0}$ . By the unique factorization result for these cubes [3, Lemma 2.2.5], we have $\tilde{\operatorname{c}}={\tilde{g}_{0}}^{F_{0}}\cdots{\tilde{g}_{2^{n}-1}}^{F_{2^{n}-1}}$ where $\tilde{g}_{j}\in\operatorname{\Theta}_{\operatorname{codim}(F_{j})}(\operatorname{X}_{k-1})^{0}$ . By [4, Theorem 2.9.10 (ii)], for each $j\in[0,2^{n})$ there is $g_{j}\in\operatorname{\Theta}_{\operatorname{codim}(F_{j})}(\operatorname{X})^{0}$ such that $h(g_{j})=\tilde{g}_{j}$ . Let $\operatorname{c}^{*}$ be the cube in $\operatorname{C}^{n}(\operatorname{\Theta}(\operatorname{X})^{0})$ defined by $\operatorname{c}^{*}={g_{0}}^{F_{0}}\cdots{g_{2^{n}-1}}^{F_{2^{n}-1}}$ . Let $\operatorname{c}^{\prime}=\operatorname{c}^{*}\cdot x$ . This is in $\operatorname{C}^{n}(\operatorname{X})$ , and is translation equivalent to the constant $x$ cube. By construction $\pi_{k-1}\operatorname{\circ}\operatorname{c}^{\prime}$ $=\pi_{k-1}^{\llbracket n\rrbracket}(\operatorname{c}^{*}\cdot x)=\big{(}\prod_{j}h(g_{j})^{F_{j}}\big{)}\cdot x^{\prime}=\big{(}\prod_{j}\tilde{g}_{j}^{F_{j}}\big{)}\cdot x^{\prime}=\tilde{\operatorname{c}}\cdot x^{\prime}=\pi_{k-1}\operatorname{\circ}\operatorname{c}$ , as we claimed.

It follows from [3, Theorem 3.2.19] and the definition of degree- $k$ bundles (in particular [3, (3.5)]) that $\operatorname{c}-\operatorname{c}^{\prime}\in\operatorname{C}^{n}(\mathcal{D}_{k}(\operatorname{Z}_{k}))$ . But then, using translations from $\tau(\operatorname{Z}_{k})=\operatorname{\Theta}_{k}(\operatorname{X})$ , we can correct $\operatorname{c}^{\prime}$ further to obtain $\operatorname{c}$ , thus showing that $\operatorname{c}$ is itself a translation cube with translations from $\operatorname{\Theta}(\operatorname{X})$ . (Such a correction procedure has been used in previous arguments, see for instance the proof of [3, Lemma 3.2.25].)

We have thus shown that $\operatorname{C}^{n}(\operatorname{X})\subset\pi_{\Gamma}^{\llbracket n\rrbracket}\big{(}\operatorname{C}^{n}(G_{\bullet})\big{)}$ . The opposite inclusion is clear, by definition of the groups $\operatorname{\Theta}_{i}(\operatorname{X})$ . ∎

We can now prove Theorem 1.10, which we restate here.

Theorem A.2.

Let $\operatorname{X}$ be a $k$ -step cfr nilspace. If $\operatorname{C}^{k}(\operatorname{X})$ is connected, then $\operatorname{X}$ is toral.

Proof.

We argue by induction on $k$ . For $k=1$ the statement is clear. For $k>1$ , first note that $\operatorname{C}^{k}(\operatorname{X}_{k-1})$ is connected (by continuity of $\pi_{k-1}$ ), and so (since projection to a $k-1$ face of a $k$ cube is a continuous map) we have also that $\operatorname{C}^{k-1}(\operatorname{X}_{k-1})$ is connected, so by induction we have that $\operatorname{X}_{k-1}$ is toral. Now suppose for a contradiction that $\operatorname{X}$ is not toral. Then the last structure group $\operatorname{Z}_{k}$ must be a disconnected compact abelian Lie group. By quotienting out the torus factor of $\operatorname{Z}_{k}$ if necessary, we can assume that $\operatorname{X}$ now has $k$ -th structure group $\operatorname{Z}_{k}$ being a finite abelian group of cardinality greater than 1. We shall now deduce that $\operatorname{C}^{k}(\operatorname{X})$ must be disconnected, a contradiction.

By Theorem A.1 we have that $\operatorname{X}$ is isomorphic to the coset nilspace $(G/\Gamma,G_{\bullet})$ where $G=\operatorname{\Theta}(\operatorname{X})$ and $\Gamma=\operatorname{Stab}_{G}(x)$ for some fixed point $x\in\operatorname{X}$ . Hence $\operatorname{C}^{k}(\operatorname{X})=\operatorname{C}^{k}(G_{\bullet})/\Gamma^{\llbracket k\rrbracket}$ . Let $\sigma_{k}$ be the Gray code map on $G^{\llbracket k\rrbracket}$ [3, Definition 2.2.22], and recall that restricted to $\operatorname{C}^{k}(G_{\bullet})$ this map takes values in $G_{k}$ (see [4, Proposition 2.2.25]) and that $G_{k}\cong\operatorname{Z}_{k}$ (see [3, Lemma 3.2.37]). We know that shifting any value $\operatorname{c}(v)$ of a cube $\operatorname{c}\in\operatorname{C}^{k}(G_{\bullet})$ by any element of $\operatorname{Z}_{k}$ still gives a cube in $\operatorname{C}^{k}(G_{\bullet})$ (see [3, Remark 3.2.12]). It follows that $\sigma_{k}$ maps $\operatorname{C}^{k}(G_{\bullet})$ onto $\operatorname{Z}_{k}$ . On the other hand, the map $\sigma_{k}$ only takes the value $\mathrm{id}_{G}$ on $\Gamma^{\llbracket k\rrbracket}$ , since $\Gamma\cap G_{k}=\{\mathrm{id}_{G}\}$ (as the action of $G_{k}\cong\operatorname{Z}_{k}$ is free). Now let $C$ denote the identity component of $\operatorname{C}^{n}(G_{\bullet})$ . It is standard that $C$ is normal in $\operatorname{C}^{n}(G_{\bullet})$ . We also have $\sigma_{k}(C\cdot\Gamma^{\llbracket k\rrbracket})=\{\mathrm{id}_{G}\}$ . Indeed, since $\sigma_{k}$ is continuous and $\operatorname{Z}_{k}$ is discrete, for every element $c\cdot\gamma\in C\cdot\Gamma^{\llbracket k\rrbracket}$ we have $\sigma_{k}(\gamma)=0$ , and $c\cdot\gamma$ is in the same component as $\gamma$ , so we must also have $\sigma_{k}(c\cdot\gamma)=0$ . But then the product set $C\cdot\Gamma^{\llbracket k\rrbracket}$ must be a proper subgroup of $\operatorname{C}^{n}(G_{\bullet})$ (otherwise its image under $\sigma_{k}$ would be $G_{k}$ ). Thus we have shown that $\operatorname{C}^{n}(G_{\bullet})/C\cdot\Gamma^{\llbracket k\rrbracket}$ is not the one point space. Hence there are at least two disjoint cosets of $C\cdot\Gamma^{\llbracket k\rrbracket}$ forming a cover of $\operatorname{C}^{n}(G_{\bullet})$ . Since the latter group is a Lie group, $C$ is open, and therefore these covering cosets of $C\cdot\Gamma^{\llbracket k\rrbracket}$ are open sets. But then the quotient map $q:\operatorname{C}^{n}(G_{\bullet})\to\operatorname{C}^{n}(G_{\bullet})/\Gamma^{\llbracket k\rrbracket}$ (which is open) sends these cosets to disjoint open sets covering $\operatorname{C}^{k}(G_{\bullet})/\Gamma^{\llbracket k\rrbracket}$ , so $\operatorname{C}^{k}(\operatorname{X})$ is disconnected. ∎

We add the following lemma concerning the Haar measures on cube sets.

Lemma A.3.

Let $\operatorname{X}$ be a $k$ -step cfr nilspace such that $\operatorname{X}_{k-1}$ is toral. Then for every integer $n\geq 0$ the connected components of $\operatorname{C}^{n}(\operatorname{X})$ have equal positive Haar measure.

Proof.

Recall that $\operatorname{C}^{n}(\operatorname{X})$ is a compact abelian bundle with base $\operatorname{C}^{n}(\operatorname{X}_{k-1})$ , bundle projection $\pi:=\pi_{k-1}^{\llbracket n\rrbracket}$ , and structure group $\widetilde{\operatorname{Z}}_{k}:=\operatorname{C}^{n}(\mathcal{D}_{k}(\operatorname{Z}_{k}))$ , where $\operatorname{Z}_{k}$ is the $k$ -th structure group of $\operatorname{X}$ (see[4, Lemma 2.2.12]). The Haar measure $\mu$ on $\operatorname{C}^{n}(\operatorname{X})$ is invariant under the continuous action of $\widetilde{\operatorname{Z}}_{k}$ , by construction (see [4, Proposition 2.2.5]). Assuming that there is more than one component of $\operatorname{C}^{n}(\operatorname{X})$ , let $\operatorname{c}_{1},\operatorname{c}_{2}$ be any points in distinct components $C_{1}$ , $C_{2}$ respectively. Then, since $\operatorname{X}_{k-1}$ is toral, by [4, Theorem 2.9.17] there is a cube $\operatorname{c}\in\operatorname{C}^{n}(\operatorname{\Theta}(\operatorname{X}_{k-1})^{0}_{\bullet})$ such that $\operatorname{c}\cdot\pi(\operatorname{c}_{1})=\pi(\operatorname{c}_{2})$ . By [4, Theorem 2.9.10] there is a cube $\widetilde{\operatorname{c}}\in\operatorname{C}^{n}(\operatorname{\Theta}(\operatorname{X})^{0}_{\bullet})$ such that $\pi(\widetilde{\operatorname{c}}\cdot\operatorname{c}_{1})=\pi(\operatorname{c}_{2})$ . There is therefore $z\in\widetilde{\operatorname{Z}}_{k}$ such that $\widetilde{\operatorname{c}}\cdot\operatorname{c}_{1}+\,z=\operatorname{c}_{2}$ . Note that $\widetilde{\operatorname{c}}\cdot\operatorname{c}_{1}$ is still in $C_{1}$ , since the map $\operatorname{c}_{1}\mapsto\widetilde{\operatorname{c}}\cdot\operatorname{c}_{1}$ is a composition of multiplications by face-group elements of the form $g^{F}$ where $F$ is a face in $\llbracket n\rrbracket$ and $g$ is in the connected Lie group $\operatorname{\Theta}_{\operatorname{codim}(F)}(\operatorname{X})^{0}$ . Hence $(C_{1}+z)\cap C_{2}$ is non-empty (containing $\operatorname{c}_{2}$ ), so $C_{1}+z\subset C_{2}$ (since $C_{1}+z$ is connected and $C_{2}$ is a maximal connected set), whence $\mu(C_{1})=\mu(C_{1}+z)\leq\mu(C_{2})$ . Similarly $\mu(C_{2})\leq\mu(C_{1})$ . ∎

Next, we prove the properties of the $U^{d}$ -seminorms from Definition 1.4.

Lemma A.4.

For every $k$ -step compact nilspace $\operatorname{X}$ and every $d\geq 2$ , the function $f\mapsto\|f\|_{U^{d}}$ is a seminorm on $L^{\infty}(\operatorname{X})$ .

The case of this lemma for compact abelian groups is given in several sources, all based essentially on the original argument of Gowers in [14, Lemma 3.9]. The case of nilmanifolds appears in [27, Ch. 12, Proposition 12]. These two cases already yield (via inverse limits) the result for the class of nilspaces concerned in our main results. Below we recall another proof from [7], which works at the more general level of cubic couplings. Let us mention also that $\|\cdot\|_{U^{d}}$ is non-degenerate (and is therefore a norm on $L^{\infty}(\operatorname{X})$ ) when the step $k$ of $\operatorname{X}$ is less than $d$ . For compact abelian groups this follows from the fact that $\|f\|_{U^{d}}\geq\|f\|_{U^{2}}=\|\widehat{f}\|_{\ell^{4}}$ , and for nilmanifolds it is given in [27, Ch. 12, Theorem 17]. For general compact nilspaces, the non-degeneracy follows from results in nilspace theory; as it is not needed in this paper, we omit the details.

Proof of Lemma A.4.

The lemma follows from results in [7], namely [7, Proposition 3.6], which shows that the Haar measures $\mu^{\llbracket n\rrbracket}$ on $\operatorname{C}^{n}(\operatorname{X})$ form a cubic coupling, and [7, Corollary 3.17], which yields the seminorm properties for a general cubic coupling. ∎

We close this appendix with a proof of Proposition 6.3. Recall the following basic useful description of polynomial sequences (see for instance [6, Lemma 2.8]).

Lemma A.5 (Taylor expansion).

Let $g\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ , where $G_{\bullet}$ has degree at most $s$ . Then there are unique Taylor coefficients $g_{i}\in G_{i}$ such that for all $n\in\mathbb{Z}$ we have $g(n)=g_{0}g_{1}^{n}g_{2}^{\binom{n}{2}}\cdots g_{s}^{\binom{n}{s}}$ . Conversely, every such expression defines a map $g\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ . Moreover, if $H\leq G$ and $g$ is $H$ -valued then $g_{i}\in H$ for each $i$ .

Proof of Proposition 6.3.

Since $\phi\operatorname{\circ}\beta$ is a morphism $\mathbb{Z}\to G/\Gamma$ , it suffices to prove the following statement: for every morphism $\phi:\mathbb{Z}\to G/\Gamma$ , there is a morphism $\psi:\mathbb{Z}\to G$ (whence $\psi\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ ) such that $\pi_{\Gamma}\operatorname{\circ}\psi=\phi$ . We prove this by descending induction on $j\in[k+1]$ , showing that the statement holds for maps $\phi$ taking values in $(G_{j}\Gamma)/\Gamma$ . For $j=k+1$ , since $G_{k+1}=\{\mathrm{id}_{G}\}$ , the map $\phi$ is constant and the statement is trivially verified letting $\psi$ be a constant $\Gamma$ -valued map. For $j<k+1$ , suppose that the statement holds for $j+1$ and that $\phi$ takes values in $(G_{j}\Gamma)/\Gamma$ . It follows from the filtration property that $G_{j+1}\Gamma$ is a normal subgroup of $G_{j}\Gamma$ and that the quotient $G_{j}\Gamma/(G_{j+1}\Gamma)$ is an abelian group. Denoting this abelian group by $A_{j}$ , let $q_{j}:(G_{j}\Gamma)/\Gamma\to A_{j}$ be the quotient map for the action of $G_{j+1}$ on $(G_{j}\Gamma)/\Gamma$ . Note that $q_{j}$ is a nilspace morphism. More precisely, for every cube $\operatorname{c}\Gamma^{\llbracket n\rrbracket}$ on $(G_{j}\Gamma)/\Gamma$ (where $\operatorname{c}\in G_{j}^{\llbracket n\rrbracket}\cap\operatorname{C}^{n}(G_{\bullet})$ ), we have $q_{j}\operatorname{\circ}(\operatorname{c}\Gamma^{\llbracket n\rrbracket})=(\tilde{q}_{j}\operatorname{\circ}\operatorname{c})\Gamma^{\llbracket n\rrbracket}$ where $\tilde{q}_{j}$ is the quotient homomorphism $G_{j}\to G_{j}/G_{j+1}$ ; this implies that every $(j+1)$ -face of $q_{j}\operatorname{\circ}(\operatorname{c}\Gamma^{\llbracket n\rrbracket})$ has value [math] under the Gray-code map $\sigma_{j+1}$ , so $q_{j}$ is a morphism into $\mathcal{D}_{j}(A_{j})$ . It follows that $q_{j}\operatorname{\circ}\phi$ is a morphism $\mathbb{Z}\to\mathcal{D}_{j}(A_{j})$ , and is in particular a polynomial map of degree at most $k$ , so by Lemma A.5 we have $q_{j}\operatorname{\circ}\phi(x)=\sum_{\ell=0}^{k}a_{\ell}\binom{x}{\ell}$ for $x\in\mathbb{Z}$ , for some $a_{\ell}\in A_{j}$ , and binomial coefficients $\binom{x}{\ell}$ . Since $q_{j}$ is surjective, there exist elements $b_{0},b_{1},\dots,b_{k}$ in $G_{j}$ such that $q_{j}(b_{\ell}\Gamma)=a_{\ell}$ for each $\ell$ . Let $\alpha:\mathbb{Z}\to G$ be the polynomial map $\alpha(x)=\prod_{\ell=0}^{k}b_{\ell}^{\binom{x}{\ell}}$ , and note that $q_{j}(\alpha(x)\Gamma)=q_{j}\operatorname{\circ}\phi(x)$ for all $x$ . It follows that the map $\alpha^{-1}\phi$ is a morphism $\mathbb{Z}\to(G_{j+1}\Gamma)/\Gamma$ , so by induction there is a map $\psi^{\prime}\in\operatorname{poly}(\mathbb{Z},G_{\bullet})$ such that $\alpha^{-1}(x)\phi(x)=\psi^{\prime}(x)\Gamma$ for all $x$ . Then $\psi(x):=\alpha(x)\psi^{\prime}(x)$ is a map in $\operatorname{poly}(\mathbb{Z},G_{\bullet})$ with the required property. ∎

Appendix B Miscellaneous measure-theoretic results

Lemma B.1.

Let $(\Omega,\mathcal{A},\lambda)$ be a probability space, let $\mathcal{B}$ be a sub- $\sigma$ -algebra of $\mathcal{A}$ , and suppose that $S\in\mathcal{A}$ satisfies $\|1_{S}-\mathbb{E}(1_{S}|\mathcal{B})\|_{L^{2}}\leq\epsilon$ . Then $S^{\prime}=\{x\in\Omega:\mathbb{E}(1_{S}|\mathcal{B})(x)>\epsilon^{1/2}\}$ satisfies $\lambda(S\Delta S^{\prime})<5\epsilon^{1/2}$ .

Proof.

We first observe that $\lambda(S^{\prime}\setminus S)\,\epsilon^{1/2}<\int_{\Omega}(1-1_{S})\mathbb{E}(1_{S}|\mathcal{B})\,\mathrm{d}\lambda$ , which equals $\int_{\Omega}\mathbb{E}(1_{S}|\mathcal{B})-1_{S}\mathbb{E}(1_{S}|\mathcal{B})\,\mathrm{d}\lambda=\lambda(S)-\|\mathbb{E}(1_{S}|\mathcal{B})\|_{L^{2}}^{2}$ . Moreover, from the assumption and the triangle inequality we have $\|\mathbb{E}(1_{S}|\mathcal{B})\|_{L^{2}}\geq\|1_{S}\|_{L^{2}}-\epsilon$ , whence $\|\mathbb{E}(1_{S}|\mathcal{B})\|_{L^{2}}^{2}\geq\|1_{S}\|_{L^{2}}^{2}-2\epsilon=\lambda(S)-2\epsilon$ . Therefore $\lambda(S^{\prime}\setminus S)<2\epsilon^{1/2}$ .

On the other hand, we have $\lambda(S)-2\epsilon\leq\|\mathbb{E}(1_{S}|\mathcal{B})\|_{L^{2}}^{2}=\langle\mathbb{E}(1_{S}|\mathcal{B}),\mathbb{E}(1_{S}|\mathcal{B})\rangle=\langle 1_{S},\mathbb{E}(1_{S}|\mathcal{B})\rangle\leq\int_{S\cap S^{\prime}}\mathbb{E}(1_{S}|\mathcal{B})\,\mathrm{d}\lambda+\int_{S\setminus S^{\prime}}\mathbb{E}(1_{S}|\mathcal{B})\,\mathrm{d}\lambda\leq\lambda(S\cap S^{\prime})+\epsilon^{1/2}$ , so $\lambda(S^{\prime}\cap S)\geq\lambda(S)-3\epsilon^{1/2}$ , whence $\lambda(S\setminus S^{\prime})\leq 3\epsilon^{1/2}$ .

Combining the main two inequalities above, the result follows. ∎

We use this lemma to prove the following fact about mod 0 intersections of conditionally independent $\sigma$ -algebras.

Lemma B.2.

Let $(\Omega,\mathcal{A},\lambda)$ be a probability space, let $\mathcal{B}_{0},\mathcal{B}_{1}$ be sub- $\sigma$ -algebras of $\mathcal{A}$ such that $\mathcal{B}_{0}\operatorname{\perp\!\!\!\perp}_{\lambda}\mathcal{B}_{1}$ , let $S_{i}\in\mathcal{B}_{i}$ , $i=0,1$ , and suppose that $\lambda(S_{0}\Delta S_{1})\leq\epsilon$ . Then there exists $C\in\mathcal{B}_{0}\wedge\mathcal{B}_{1}$ such that $\lambda(C\Delta S_{i})\leq 10\epsilon^{1/4}$ for $i=0,1$ .

Proof.

The assumption $\|1_{S_{0}}-1_{S_{1}}\|_{L^{2}}^{2}\leq\epsilon$ implies $\|1_{S_{0}}-\mathbb{E}(1_{S_{0}}|\mathcal{B}_{1})\|_{L^{2}}\leq\|1_{S_{0}}-1_{S_{1}}\|_{L^{2}}+\|1_{S_{1}}-\mathbb{E}(1_{S_{0}}|\mathcal{B}_{1})\|_{L^{2}}\leq\epsilon^{1/2}+\|\mathbb{E}(1_{S_{1}}-1_{S_{0}}|\mathcal{B}_{1})\|_{L^{2}}\leq 2\epsilon^{1/2}$ . The assumption $\mathcal{B}_{0}\operatorname{\perp\!\!\!\perp}_{\lambda}\mathcal{B}_{1}$ implies that $\mathbb{E}(1_{S_{0}}|\mathcal{B}_{1})$ is $\mathcal{B}_{0}\wedge\mathcal{B}_{1}$ -measurable (in particular $\mathbb{E}(1_{S_{0}}|\mathcal{B}_{1})=\mathbb{E}(1_{S_{0}}|\mathcal{B}_{0}\wedge\mathcal{B}_{1})$ ). By Lemma B.1 with $\mathcal{B}=\mathcal{B}_{0}\wedge\mathcal{B}_{1}$ and $\mathcal{A}=\mathcal{B}_{0}$ , the set $C=\{x\in\Omega:\mathbb{E}(1_{S_{0}}|\mathcal{B}_{1})>(2\epsilon^{1/2})^{1/2}\}$ is in $\mathcal{B}_{0}\wedge\mathcal{B}_{1}$ and satisfies $\lambda(C\Delta S_{0})\leq 5(2\epsilon^{1/2})^{1/2}\leq 10\epsilon^{1/4}$ . Similarly, by Lemma B.1 with $\mathcal{A}=\mathcal{B}_{1}$ instead of $\mathcal{A}=\mathcal{B}_{0}$ , this set $C$ satisfies $\lambda(C\Delta S_{1})\leq 10\epsilon^{1/4}$ . ∎

We can use this lemma in turn to prove the following fact about ultraproducts of conditionally independent $\sigma$ -algebras.

Lemma B.3.

Let $(\mathbf{X},\mathcal{A},\lambda)$ be the ultraproduct of probability spaces $(X_{i},\mathcal{A}_{i},\lambda_{i})$ . For each $i$ let $\mathcal{B}_{i,0},\mathcal{B}_{i,1}$ be sub- $\sigma$ -algebras of $\mathcal{A}_{i}$ such that $\mathcal{B}_{i,0}\operatorname{\perp\!\!\!\perp}_{\lambda_{i}}\mathcal{B}_{i,1}$ . For $j=0,1$ let $\mathcal{B}_{j}$ be the Loeb $\sigma$ -algebra corresponding to the sequence $(\mathcal{B}_{i,j})_{i\in\mathbb{N}}$ , and let $\mathcal{C}$ be the Loeb $\sigma$ -algebra corresponding to $(\mathcal{B}_{i,0}\wedge_{\lambda_{i}}\mathcal{B}_{i,1})_{i\in\mathbb{N}}$ . Then $\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1}=_{\lambda}\mathcal{C}$ and $\mathcal{B}_{0}\operatorname{\perp\!\!\!\perp}_{\lambda}\mathcal{B}_{1}$ .

Proof.

The inclusion $\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1}\supset_{\lambda}\mathcal{C}$ is clear, for if $A\in\mathcal{C}$ then there are sets $A_{i}\in\mathcal{B}_{i,0}\wedge_{\lambda_{i}}\mathcal{B}_{i,1}$ such that $A=_{\lambda}\prod_{i\to\omega}A_{i}$ , so $\prod_{i\to\omega}A_{i}$ is in $\mathcal{B}_{j}$ up to a null set, $j=0,1$ , whence $A\in\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1}$ . For the opposite inclusion, let $Q$ be in $\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1}$ , so for $j=0,1$ there are sets $Q_{i,j}\in\mathcal{B}_{i,j}$ for each $i\in\mathbb{N}$ such that $Q=_{\lambda}\prod_{i\to\omega}Q_{i,j}$ . Then $0=\lambda\big{(}(\prod_{i\to\omega}Q_{i,0})\Delta(\prod_{i\to\omega}Q_{i,1})\big{)}=\lambda\big{(}\prod_{i\to\omega}(Q_{i,0}\Delta Q_{i,1})\big{)}$ , so letting $\epsilon_{i}=\lambda_{i}(Q_{i,0}\Delta Q_{i,1})$ , we have $\lim_{\omega}\epsilon_{i}=0$ . By Lemma B.2, for each $i$ there is $C_{i}\in\mathcal{B}_{i,0}\wedge_{\lambda_{i}}\mathcal{B}_{i,1}$ such that $\lambda(C_{i}\Delta Q_{i,j})\leq 10\epsilon_{i}^{1/4}$ for $j=0,1$ . Let $R=\prod_{i\to\omega}C_{i}$ . By construction $R\in\mathcal{C}$ , and by the last inequality we have $R=_{\lambda}Q$ , so the required inclusion holds. Finally, the desired conclusion $\mathcal{B}_{0}\operatorname{\perp\!\!\!\perp}_{\lambda}\mathcal{B}_{1}$ can be seen to follow from $\mathcal{B}_{i,0}\operatorname{\perp\!\!\!\perp}_{\lambda_{i}}\mathcal{B}_{i,1}$ , $i\in\mathbb{N}$ , using the definition of conditional independence [7, Definition 2.9] and basic facts about Loeb probability spaces. More precisely, by [7, Theorem 2.4 and Remark 2.5] it suffices to show that every function $f$ in $L^{\infty}(\mathcal{B}_{1})$ satisfies $\mathbb{E}(f|\mathcal{B}_{0})=_{\lambda}\mathbb{E}(f|\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1})$ . To show this, we use first that $f$ is $\lambda$ -almost-surely equal to a measurable function of the form $f^{\prime}=\lim_{\omega}f_{i}^{\prime}$ (see [35, Corollary 5.1]), and then we prove the equality $\mathbb{E}(f^{\prime}|\mathcal{B}_{0})=_{\lambda}\mathbb{E}(f^{\prime}|\mathcal{B}_{0}\wedge_{\lambda}\mathcal{B}_{1})$ , by deducing it from the fact that, by the assumption $\mathcal{B}_{i,0}\operatorname{\perp\!\!\!\perp}_{\lambda_{i}}\mathcal{B}_{i,1}$ , the analogous equality holds for the $f^{\prime}_{i}$ . This last deduction is enabled by the fact that $\mathbb{E}(\cdot|\mathcal{B}_{0})=\lim_{\omega}\mathbb{E}(\cdot|\mathcal{B}_{i,0})$ , a fact which is confirmed in a straightforward way by checking that for any function of the form $g=\lim_{\omega}g_{i}\in L^{1}(\mathcal{A})$ (with each $g_{i}$ measurable) we have that $\lim_{\omega}\mathbb{E}(g_{i}|\mathcal{B}_{i,0})$ satisfies the defining property of the conditional expectation $\mathbb{E}(g|\mathcal{B}_{0})$ , i.e. that for every $h\in L^{1}(\mathcal{B}_{0})$ we have $\int_{\mathbf{X}}h\,g\,\mathrm{d}\lambda=\int_{\mathbf{X}}h\,\lim_{\omega}\mathbb{E}(g_{i}|\mathcal{B}_{i,0})\,\mathrm{d}\lambda$ . This last equality is seen using an $S$ -integrable lifting of $h$ (see [35, Theorem 6.4]), commuting ultralimit and integrals as afforded by [35, Theorem 6.2, part 4], and basic properties of ultralimits. ∎

We also prove the following approximation result for measure-preserving group actions.

Lemma B.4.

Let $G$ be an amenable group acting on a Borel probability space $(\Omega,\mathcal{A},\lambda)$ by measure-preserving transformations, and let $S\in\mathcal{A}$ be such that for some $\epsilon>0$ we have $\lambda\big{(}S\Delta(g\cdot S)\big{)}\leq\epsilon$ for every $g\in G$ . Then there exists $S^{\prime}\in\mathcal{A}$ such that $g\cdot S^{\prime}=_{\lambda}S^{\prime}$ for all $g\in G$ and $\lambda(S\Delta S^{\prime})\leq 5\epsilon^{1/4}$ .

Proof.

We first suppose that $G$ is countable. Let $(F_{j})_{j\in\mathbb{N}}$ be a Følner sequence in $G$ and for each $j$ let $h_{j}=\mathbb{E}_{g\in F_{j}}1_{g\cdot S}$ . By the mean ergodic theorem for amenable groups [43, Theorem 2.1], letting $\mathcal{B}$ be the $\sigma$ -algebra of $G$ -invariant sets in $\mathcal{A}$ , and $f$ be a version of $\mathbb{E}(1_{S}|\mathcal{B})$ , we have $\|f-h_{j}\|_{L^{2}}\to 0$ as $j\to\infty$ . Note that for every $j$ we have $\|1_{S}-f\|_{L^{2}}\leq\|1_{S}-h_{j}\|_{L^{2}}+\|h_{j}-f\|_{L^{2}}\leq\|h_{j}-f\|_{L^{2}}+\mathbb{E}_{g\in F_{j}}\|1_{S}-1_{g\cdot S}\|_{L^{2}}\leq\|h_{j}-f\|_{L^{2}}+\epsilon^{1/2}$ , so letting $j\to\infty$ yields $\|1_{S}-f\|_{L^{2}}\leq\epsilon^{1/2}$ . By Lemma B.1, the set $S^{\prime}=\{x\in\Omega:f(x)>\epsilon^{1/4}\}$ satisfies $\lambda(S\Delta S^{\prime})\leq 5\epsilon^{1/4}$ , and since $f$ is $G$ -invariant, we have $g\cdot S^{\prime}=_{\lambda}S^{\prime}$ for every $g\in G$ .

We now reduce the general case to the countable case. It suffices to prove that if $G$ is a group acting on a separable metric space $(X,d)$ by isometries, then there is a countable group $G_{0}\leq G$ such that if $x\in X$ is a fixed point for $G_{0}$ then it is a fixed point for $G$ (we then apply this with $X$ the measure algebra of $\mathcal{A}$ ). Let $(x_{i})_{i}$ be a dense sequence in $X$ . For each $i$ , the orbit $G\cdot x_{i}$ is itself separable, so there is a countable set $S_{i}\subset G$ such that $S_{i}\cdot x_{i}$ is dense in this orbit. Let $G_{0}$ be the subgroup of $G$ generated by $\bigcup_{i}S_{i}$ . Observe that for every $i\in\mathbb{N}$ , $g\in G$ and $\epsilon>0$ , there is $g^{\prime}\in S_{i}\subset G_{0}$ such that $d(g\cdot x_{i},g^{\prime}\cdot x_{i})<\epsilon$ . Suppose for a contradiction that there is $x\in X$ that is $G_{0}$ -invariant but not $G$ -invariant, so $d(g\cdot x,x)=\epsilon>0$ . Then by the density of $(x_{i})_{i}$ there is $i$ such that $d(x,x_{i})<\epsilon/100$ , so $d(g\cdot x_{i},x_{i})\geq d(g\cdot x_{i},x)-d(x,x_{i})\geq d(g\cdot x,x)-d(g\cdot x_{i},g\cdot x)-d(x,x_{i})$ , which by the isometry property equals $d(g\cdot x,x)-2d(x,x_{i})\geq 98\epsilon/100$ . Hence $d(g\cdot x_{i},x_{i})\geq 98\epsilon/100$ . By the earlier observation, there is $g^{\prime}\in G_{0}$ such that $d(g\cdot x_{i},g^{\prime}\cdot x_{i})<\epsilon/100$ , so $d(g^{\prime}\cdot x_{i},x_{i})\geq d(g\cdot x_{i},x_{i})-d(g\cdot x_{i},g^{\prime}\cdot x_{i})\geq 97\epsilon/100$ . Combining this last inequality with $d(x,x_{i})<\epsilon/100$ and the triangle inequality and isometry property, we deduce that $d(g^{\prime}\cdot x,x)\geq d(g^{\prime}\cdot x_{i},x_{i})-2d(x,x_{i})\geq 95\epsilon/100$ , which contradicts that $x$ is $G_{0}$ -invariant. ∎

Lemma B.5.

Let $\operatorname{Y}$ be a compact Polish space, let $d$ be a metric compatible with the weak topology on $\mathcal{P}(\operatorname{Y})$ , and let $(\operatorname{X}_{i},\lambda_{i})_{i\in\mathbb{N}}$ be a sequence of Borel probability spaces. For each $i\in\mathbb{N}$ let $f_{i}:\operatorname{X}_{i}\to\operatorname{Y}$ be a Borel function, and let $\omega$ be a non-principal ultrafilter on $\mathbb{N}$ . Then, letting $f=\lim_{\omega}f_{i}$ , we have $\lim_{\omega}d(\lambda_{i}\operatorname{\circ}f_{i}^{-1},\lambda\operatorname{\circ}f^{-1})=0$ .

Proof.

As shown in [29, Theorem (17.19)], one can always metrize this space of probability measures with a metric of the form $d^{\prime}(\mu,\nu)=\sum_{r\in\mathbb{N}}\tfrac{1}{2^{r}}|\int h_{r}\,\mathrm{d}\mu-\int h_{r}\,\mathrm{d}\nu|$ , for a sequence of continuous functions $h_{r}:\operatorname{Y}\to\mathbb{C}$ with $\|h_{r}\|_{\infty}\leq 1$ , $r\in\mathbb{N}$ . Since $d$ and $d^{\prime}$ metrize the same topology, it suffices to prove that $\lim_{\omega}d^{\prime}(\lambda_{i}\operatorname{\circ}f_{i}^{-1},\lambda\operatorname{\circ}f^{-1})=0$ .

Suppose for a contradiction that for some $b\in(0,1)$ and some set $S\in\omega$ , for every $i\in S$ we have $d^{\prime}(\lambda_{i}\operatorname{\circ}f_{i}^{-1},\lambda\operatorname{\circ}f^{-1})>b$ . Then, for each $i\in S$ , a short argument by contradiction shows that there exists $r=r(i)\in[1,2\lceil\log_{2}(2/b)\rceil\,]$ such that $|\int_{\operatorname{X}_{i}}h_{r}\operatorname{\circ}f_{i}\,\mathrm{d}\lambda_{i}-\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda|\geq b/2$ . Using the ultrafilter properties, we then deduce that for some fixed integer $r$ there is a set $S^{\prime}\subset S$ with $S^{\prime}\in\omega$ such that for all $i\in S^{\prime}$ we have $|\int_{\operatorname{X}_{i}}h_{r}\operatorname{\circ}f_{i}\,\mathrm{d}\lambda_{i}-\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda|\geq b/2$ . Now we have two exhaustive possibilities. The first one is that some $S^{\prime\prime}\subset S^{\prime}$ with $S^{\prime\prime}\in\omega$ satisfies $\int_{\operatorname{X}_{i}}h_{r}\operatorname{\circ}f_{i}\,\mathrm{d}\lambda_{i}\geq\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda+b/2$ for all $i\in S^{\prime\prime}$ ; but then, commuting ultralimit and integrals (as in the proof of Lemma B.3), we obtain $\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda=\lim_{\omega}\int_{\operatorname{X}_{i}}h_{r}\operatorname{\circ}f_{i}\,\mathrm{d}\lambda_{i}\geq\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda+b/2>\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda$ , a contradiction. The other option is that some $S^{\prime\prime}\subset S^{\prime}$ with $S^{\prime\prime}\in\omega$ satisfies $\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda\geq$ $\int_{\operatorname{X}_{i}}h_{r}\operatorname{\circ}f_{i}\,\mathrm{d}\lambda_{i}+b/2$ for all $i\in S^{\prime\prime}$ ; then we deduce similarly that $\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda$ $=\lim_{\omega}\int_{\operatorname{X}_{i}}h_{r}\operatorname{\circ}f_{i}\,\mathrm{d}\lambda_{i}\leq\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda-b/2<\int_{\operatorname{X}}h_{r}\operatorname{\circ}f\,\mathrm{d}\lambda$ , obtaining again a contradiction. ∎

We finish with a lemma concerning the interaction of the Loeb-measure construction with products, when the underlying measures are couplings on Borel probability spaces.

Lemma B.6.

Let $(\operatorname{X}_{i})_{i\in\mathbb{N}}$ , $(\operatorname{Y}_{i})_{i\in\mathbb{N}}$ be sequences of Polish spaces, and for each $i\in\mathbb{N}$ let $\mu_{i}$ be a Borel probability measure on $\mathcal{B}(\operatorname{X}_{i})$ and $\nu_{i}$ be a Borel probability measure on $\mathcal{B}(\operatorname{X}_{i})\otimes\mathcal{B}(\operatorname{Y}_{i})$ . Let $(\mathbf{X},\mathcal{L}_{\mathbf{X}},\mu)$ , $(\mathbf{X}\times\mathbf{Y},\mathcal{L}_{\mathbf{X}\times\mathbf{Y}},\nu)$ be the corresponding Loeb probability spaces. Suppose that the projection $\pi_{i}:\operatorname{X}_{i}\times\operatorname{Y}_{i}\to\operatorname{X}_{i}$ , $(x,y)\mapsto x$ is measure preserving for every $i\in\mathbb{N}$ . Then the projection $\pi:\mathbf{X}\times\mathbf{Y}\to\mathbf{X}$ , $(x,y)\mapsto x$ is measurable with respect to $\mathcal{L}_{\mathbf{X}}$ , $\mathcal{L}_{\mathbf{X}\times\mathbf{Y}}$ , and is measure-preserving with respect to $\mu,\nu$ .

Proof.

The preimage under $\pi$ of any internal measurable set in $\mathbf{X}$ is an internal measurable set in $\mathbf{X}\times\mathbf{Y}$ , and it is also clear that if $A$ is an internal measurable subset of $\mathbf{X}$ then $\nu\operatorname{\circ}\pi^{-1}(A)=\mu(A)$ . (These claims follow from the fact the projections $\pi_{i}$ are measure-preserving maps and that taking ultraproducts commutes with taking preimages under the projections.) Now $\mathcal{L}_{\mathbf{X}}$ consists precisely of sets $S$ such that for every $\epsilon>0$ there exist internal measurable sets $A_{i},A_{o}\subset\mathbf{X}$ with $A_{i}\subset S\subset A_{o}$ and $\mu(A_{o}\setminus A_{i})<\epsilon$ [35, §2.1]. This combined with the properties already established for $\pi$ for internal sets implies that $\pi^{-1}(\mathcal{L}_{\mathbf{X}})\subset\mathcal{L}_{\mathbf{X}\times\mathbf{Y}}$ and $\mu\operatorname{\circ}\pi^{-1}=\nu$ , as required. ∎

Bibliography43

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Bergelson, T. Tao, T. Ziegler, An inverse theorem for the uniformity seminorms associated with the action 𝔽 p ∞ superscript subscript 𝔽 𝑝 \mathbb{F}_{p}^{\infty} , Geom. Funct. Anal. 19 (2010), no. 6, 1539–1596.
2[2] O. A. Camarena, B. Szegedy, Nilspaces, nilmanifolds and their morphisms , preprint. ar Xiv:1009.3825
3[3] P. Candela, Notes on nilspaces: algebraic aspects , Discrete Analysis, 2017, Paper No. 15, 59 pp.
4[4] P. Candela, Notes on compact nilspaces , Discrete Analysis, 2017, Paper No. 16, 57pp.
5[5] P. Candela, D. González-Sánchez, B. Szegedy, On nilspace systems and their morphisms , Ergodic Theory Dynam. Systems 40 (2020), no. 11, 3015–3029.
6[6] P. Candela, O. Sisask, Convergence results for systems of linear forms on cyclic groups, and periodic nilsequences , SIAM J. Discrete Math. 28 (2014) (2), 786–810.
7[7] P. Candela, B. Szegedy, Nilspace factors for general uniformity seminorms, cubic exchangeability and limits , to appear in Mem. Amer. Math. Soc. ar Xiv:1803.08758
8[8] N. J. Cutland, Nonstandard measure theory and its applications , Bull. London Math. Soc. 15 (1983), no. 6, 529–589.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Regularity and inverse theorems for uniformity norms on compact abelian groups and nilmanifolds

Abstract.

2010 Mathematics Subject Classification:

1. Introduction

Definition 1.1** (cfr coset nilspaces).**

Definition 1.2**.**

Definition 1.3** (Nilspace polynomials).**

Definition 1.4** (Uniformity seminorms on compact nilspaces).**

Theorem 1.5** (Regularity).**

Theorem 1.6** (Inverse theorem).**

Theorem 1.7**.**

Remark 1.8**.**

Remark 1.9**.**

Theorem 1.10**.**

Remark 1.11**.**

2. Ultraproducts of nilspaces, and an outline of the main proof

Definition 2.1**.**

Definition 2.2**.**

3. The cubic coupling axioms for ultraproducts of cfr coset nilspaces

Proposition 3.1**.**

Lemma 3.2**.**

Proof.

Remark 3.3**.**

Lemma 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Lemma 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Lemma 3.9**.**

Proof.

Proof of Lemma 3.8.

Proposition 3.10**.**

Proof.

Remark 3.11**.**

3.1. Locating a separable factor yielding a Borel cubic coupling

Proposition 3.12**.**

Lemma 3.13**.**

Proof.

Lemma 3.14**.**

Proof.

Proof of Proposition 3.12.

4. Stability of morphisms into compact finite-rank nilspaces

Definition 4.1**.**

Theorem 4.2**.**

4.1. Cocycles close to the 0 cocycle are coboundaries

Definition 4.3**.**

Proposition 4.4**.**

Lemma 4.5**.**

Proof of Lemma 4.5.

Lemma 4.6**.**

Proof.

Proof of Proposition 4.4.

4.2. Proof of the stability result for morphisms

Lemma 4.7**.**

Lemma 4.8**.**

Proof.

Lemma 4.9**.**

Proof.

Proof of Theorem 4.2.

5. Proof of the regularity and inverse theorems

Definition 5.1** (Balance).**

Proof of Theorem 1.5.

Theorem 5.2**.**

Proof.

6. The case of simple abelian groups

Theorem 6.1**.**

Theorem 6.2**.**

Proposition 6.3**.**

Lemma 6.4**.**

Proof.

Definition 1.1 (cfr coset nilspaces).

Definition 1.2.

Definition 1.3 (Nilspace polynomials).

Definition 1.4 (Uniformity seminorms on compact nilspaces).

Theorem 1.5 (Regularity).

Theorem 1.6 (Inverse theorem).

Theorem 1.7.

Remark 1.8.

Remark 1.9.

Theorem 1.10.

Remark 1.11.

Definition 2.1.

Definition 2.2.

Proposition 3.1.

Lemma 3.2.

Remark 3.3.

Lemma 3.4.

Lemma 3.5.

Lemma 3.6.

Lemma 3.7.

Lemma 3.8.

Lemma 3.9.

Proposition 3.10.

Remark 3.11.

Proposition 3.12.

Lemma 3.13.

Lemma 3.14.

Definition 4.1.

Theorem 4.2.

Definition 4.3.

Proposition 4.4.

Lemma 4.5.

Lemma 4.6.

Lemma 4.7.

Lemma 4.8.

Lemma 4.9.

Definition 5.1 (Balance).

Theorem 5.2.

Theorem 6.1.

Theorem 6.2.

Proposition 6.3.

Lemma 6.4.

Lemma 6.5.

Remark 6.6.

Lemma 6.7.

Theorem A.1.

Theorem A.2.

Lemma A.3.

Lemma A.4.

Lemma A.5 (Taylor expansion).

Lemma B.1.

Lemma B.2.

Lemma B.3.

Lemma B.4.

Lemma B.5.

Lemma B.6.