Fourier decay for self-similar measures

Boris Solomyak

arXiv:1906.12164·math.DS·June 23, 2020

Fourier decay for self-similar measures

Boris Solomyak

PDF

TL;DR

This paper proves that most self-similar measures on the line exhibit power decay in their Fourier transform at infinity, extending classical results to non-homogeneous cases with complex contraction ratios.

Contribution

It establishes Fourier decay for a broad class of self-similar measures, including non-homogeneous cases, outside a zero Hausdorff dimension set of parameters.

Findings

01

Most self-similar measures have Fourier transform decay at infinity.

02

Fourier decay holds outside a zero Hausdorff dimension exceptional set.

03

Extends classical results from homogeneous to non-homogeneous measures.

Abstract

We prove that, after removing a zero Hausdorff dimension exceptional set of parameters, all self-similar measures on the line have a power decay of the Fourier transform at infinity. In the homogeneous case, when all contraction ratios are equal, this is essentially due to Erd\H{o}s and Kahane. In the non-homogeneous case the difficulty we have to overcome is the apparent lack of convolution structure.

Equations130

μ (t) = \int_{R} e^{i t x} d μ (x) .

μ (t) = \int_{R} e^{i t x} d μ (x) .

{\mathcal{D}}(\alpha)=\bigl{\{}\nu\ \mbox{finite positive measure on ${\mathbb{R}}$}:\ \left|\widehat{\nu}(t)\right|=O(|t|^{-\alpha}),\ \ |t|\to\infty\bigr{\}},

{\mathcal{D}}(\alpha)=\bigl{\{}\nu\ \mbox{finite positive measure on ${\mathbb{R}}$}:\ \left|\widehat{\nu}(t)\right|=O(|t|^{-\alpha}),\ \ |t|\to\infty\bigr{\}},

μ = j = 1 \sum m q_{j} (μ \circ f_{j}^{- 1}), \mbox w h er e f_{j} (x) = λ_{j} x + b_{j},

μ = j = 1 \sum m q_{j} (μ \circ f_{j}^{- 1}), \mbox w h er e f_{j} (x) = λ_{j} x + b_{j},

0 < C_{2}^{- 1} \leq j min λ_{j} \leq j max λ_{j} \leq C_{1}^{- 1} < 1,

0 < C_{2}^{- 1} \leq j min λ_{j} \leq j max λ_{j} \leq C_{1}^{- 1} < 1,

μ_{λ, b}^{q} (t) = O (∣ lo g ∣ t ∣ ∣^{- α}), ∣ t ∣ \to \infty.

μ_{λ, b}^{q} (t) = O (∣ lo g ∣ t ∣ ∣^{- α}), ∣ t ∣ \to \infty.

{t \in [- T, T] : ∣ μ (t) ∣ \geq T^{- δ}}

{t \in [- T, T] : ∣ μ (t) ∣ \geq T^{- δ}}

F_{γ, a}^{p} = {γ_{j} x + a_{k}^{(j)} : j = 1, \dots, d; 1 \leq k \leq k_{j}}, p = (p_{k}^{(j)} : j \leq d; 1 \leq k \leq k_{j}),

F_{γ, a}^{p} = {γ_{j} x + a_{k}^{(j)} : j = 1, \dots, d; 1 \leq k \leq k_{j}}, p = (p_{k}^{(j)} : j \leq d; 1 \leq k \leq k_{j}),

0 < B_{2}^{- 1} \leq γ_{m i n} < γ_{m a x} \leq B_{1}^{- 1} < 1.

0 < B_{2}^{- 1} \leq γ_{m i n} < γ_{m a x} \leq B_{1}^{- 1} < 1.

B_{1}^{s} > d .

B_{1}^{s} > d .

a_{2}^{(1)} - a_{1}^{(1)} = π,

a_{2}^{(1)} - a_{1}^{(1)} = π,

d = (m - 1 ℓ + m - 1),

d = (m - 1 ℓ + m - 1),

f_{1}^{ℓ - 1} f_{2} (x) = λ_{1}^{ℓ - 1} (λ_{2} x + b) \mbox an d f_{2} f_{1}^{ℓ - 1} (x) = λ_{2} λ_{1}^{ℓ - 1} x + b .

f_{1}^{ℓ - 1} f_{2} (x) = λ_{1}^{ℓ - 1} (λ_{2} x + b) \mbox an d f_{2} f_{1}^{ℓ - 1} (x) = λ_{2} λ_{1}^{ℓ - 1} x + b .

B_{1} = C_{1}^{ℓ}, B_{2} = C_{2}^{ℓ},

B_{1} = C_{1}^{ℓ}, B_{2} = C_{2}^{ℓ},

C_{1}^{ℓ s} > d ⟺ ℓ > \frac{lo g d}{s lo g C _{1}} .

C_{1}^{ℓ s} > d ⟺ ℓ > \frac{lo g d}{s lo g C _{1}} .

ℓ > \frac{m lo g ( ℓ + m )}{s lo g C _{1}},

ℓ > \frac{m lo g ( ℓ + m )}{s lo g C _{1}},

μ = j = 1 \sum d k = 1 \sum k_{j} p_{k}^{(j)} (μ \circ (f_{k}^{(j)})^{- 1}), \mbox w h er e f_{k}^{(j)} (x) = γ_{j} x + a_{k}^{(j)} .

μ = j = 1 \sum d k = 1 \sum k_{j} p_{k}^{(j)} (μ \circ (f_{k}^{(j)})^{- 1}), \mbox w h er e f_{k}^{(j)} (x) = γ_{j} x + a_{k}^{(j)} .

\widehat{\mu}(t)=\sum_{j=1}^{d}\Bigl{(}\sum_{k=1}^{k_{j}}p_{k}^{(j)}e^{ia_{k}^{(j)}t}\Bigr{)}\widehat{\mu}(\gamma_{j}t).

\widehat{\mu}(t)=\sum_{j=1}^{d}\Bigl{(}\sum_{k=1}^{k_{j}}p_{k}^{(j)}e^{ia_{k}^{(j)}t}\Bigr{)}\widehat{\mu}(\gamma_{j}t).

|\widehat{\mu}(t)|\leq\Bigl{(}\bigl{|}p_{1}^{(1)}+p_{2}^{(1)}e^{i(a_{2}^{(1)}-a_{1}^{(1)})t}\bigr{|}+\sum_{k=3}^{k_{1}}p_{k}^{(1)}\Bigr{)}\cdot|\widehat{\mu}(\gamma_{1}t)|+\sum_{j=2}^{d}\Bigl{(}\sum_{k=1}^{k_{j}}p_{k}^{(j)}\Bigr{)}|\widehat{\mu}(\gamma_{j}t)|.

|\widehat{\mu}(t)|\leq\Bigl{(}\bigl{|}p_{1}^{(1)}+p_{2}^{(1)}e^{i(a_{2}^{(1)}-a_{1}^{(1)})t}\bigr{|}+\sum_{k=3}^{k_{1}}p_{k}^{(1)}\Bigr{)}\cdot|\widehat{\mu}(\gamma_{1}t)|+\sum_{j=2}^{d}\Bigl{(}\sum_{k=1}^{k_{j}}p_{k}^{(j)}\Bigr{)}|\widehat{\mu}(\gamma_{j}t)|.

p_{j} := k = 1 \sum k_{j} p_{k}^{(j)}, j = 1, \dots, d .

p_{j} := k = 1 \sum k_{j} p_{k}^{(j)}, j = 1, \dots, d .

∣1 + e^{π i z} ∣ \leq 2 (1 - \frac{π}{4} z^{2}) \mbox f or z \in [- 1/2, 1/2] .

∣1 + e^{π i z} ∣ \leq 2 (1 - \frac{π}{4} z^{2}) \mbox f or z \in [- 1/2, 1/2] .

|\widehat{\mu}(t)|\leq p_{1}\Bigl{(}1-\frac{\pi{\varepsilon}}{2}\|t\|^{2}\Bigr{)}|\widehat{\mu}(\gamma_{1}t)|+\sum_{j=2}^{d}p_{j}|\widehat{\mu}(\gamma_{j}t)|,

|\widehat{\mu}(t)|\leq p_{1}\Bigl{(}1-\frac{\pi{\varepsilon}}{2}\|t\|^{2}\Bigr{)}|\widehat{\mu}(\gamma_{1}t)|+\sum_{j=2}^{d}p_{j}|\widehat{\mu}(\gamma_{j}t)|,

γ^{n} = j = 1 \prod d γ_{j}^{n_{j}}, p^{n} = j = 1 \prod d p_{j}^{n_{j}},

γ^{n} = j = 1 \prod d γ_{j}^{n_{j}}, p^{n} = j = 1 \prod d p_{j}^{n_{j}},

|\widehat{\mu}(t)|\leq\sum_{w\in{\mathcal{A}}^{N}}{\bf p}^{\ell(w)}|\widehat{\mu}(\boldsymbol{\gamma}^{\ell(w)}t)|\prod_{i:\ w_{i}=1}\Bigl{(}1-\frac{\pi{\varepsilon}}{2}\|\boldsymbol{\gamma}^{\ell(w[1,i-1])}t\|^{2}\Bigr{)}.

|\widehat{\mu}(t)|\leq\sum_{w\in{\mathcal{A}}^{N}}{\bf p}^{\ell(w)}|\widehat{\mu}(\boldsymbol{\gamma}^{\ell(w)}t)|\prod_{i:\ w_{i}=1}\Bigl{(}1-\frac{\pi{\varepsilon}}{2}\|\boldsymbol{\gamma}^{\ell(w[1,i-1])}t\|^{2}\Bigr{)}.

∥ γ^{n} t ∥ \geq ρ

∥ γ^{n} t ∥ \geq ρ

\#\bigl{\{}d+1\leq i\leq N-d-1:\ \ell(w[1,i])\ \mbox{\em is ``on a $(\boldsymbol{\gamma},t,\rho)$-good track''}\bigr{\}}\leq\frac{N}{k_{1}}.

\#\bigl{\{}d+1\leq i\leq N-d-1:\ \ell(w[1,i])\ \mbox{\em is ``on a $(\boldsymbol{\gamma},t,\rho)$-good track''}\bigr{\}}\leq\frac{N}{k_{1}}.

ρ := \frac{1}{4 ( 1 + B _{2} ) ( 1 + 3 B _{2} )} .

ρ := \frac{1}{4 ( 1 + B _{2} ) ( 1 + 3 B _{2} )} .

X_{N}^{(d)} \geq \frac{N}{( d + 1 ) k _{1}} .

X_{N}^{(d)} \geq \frac{N}{( d + 1 ) k _{1}} .

P (Y_{N} < δ N) \leq C^{'} exp (- c N) .

P (Y_{N} < δ N) \leq C^{'} exp (- c N) .

|\widehat{\mu}(t)|\leq\Bigl{(}1-\frac{\pi{\varepsilon}}{2}\rho^{2}\Bigr{)}^{\delta N}+C^{\prime}\exp(-cN).

|\widehat{\mu}(t)|\leq\Bigl{(}1-\frac{\pi{\varepsilon}}{2}\rho^{2}\Bigr{)}^{\delta N}+C^{\prime}\exp(-cN).

P (X_{N - 1}^{(r)} \leq δ_{r} N) \leq C_{r}^{'} \cdot exp (- c_{r} N) .

P (X_{N - 1}^{(r)} \leq δ_{r} N) \leq C_{r}^{'} \cdot exp (- c_{r} N) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Fourier decay for self-similar measures

BORIS SOLOMYAK

Boris Solomyak, Department of Mathematics, Bar-Ilan University, Ramat Gan, Israel

[email protected]

Abstract.

We prove that, after removing a zero Hausdorff dimension exceptional set of parameters, all self-similar measures on the line have a power decay of the Fourier transform at infinity. In the homogeneous case, when all contraction ratios are equal, this is essentially due to Erdős and Kahane. In the non-homogeneous case the difficulty we have to overcome is the apparent lack of convolution structure.

Supported in part by the Israel Science Foundation grant 396/15.

1. Introduction

For a finite positive Borel measure $\mu$ on ${\mathbb{R}}$ , consider the Fourier transform

[TABLE]

The behavior of the Fourier transform at infinity is an important issue in many areas of mathematics. The measure $\mu$ is called a Rajchman measure if $\lim_{|t|\to\infty}\widehat{\mu}(t)=0$ . Riemann-Lebesgue Lemma says that absolutely continuous measures are Rajchman, but which singular measures are Rajchman is a subtle question with a long history, see [23]. For many purposes simple convergence of $\widehat{\mu}(t)$ to zero is not enough, and some quantitative decay is needed.

Definition 1.1.

For $\alpha>0$ let

[TABLE]

and denote ${\mathcal{D}}=\bigcup_{\alpha>0}{\mathcal{D}}(\alpha)$ . A measure $\nu$ is said to have power Fourier decay if $\nu\in{\mathcal{D}}$ .**

This property has a number of applications: for instance, if $\mu$ has power Fourier decay, then $\mu$ -almost every number is normal to any base, see [7, 29], and the support of $\mu$ has positive Fourier dimension, see [24].

In this paper we focus on the most basic class of “fractal measures,” namely, self-similar measures on the line.

Definition 1.2.

Let $m\geq 2$ , $\boldsymbol{\lambda}=(\lambda_{1},\ldots,\lambda_{m})\in(0,1)^{m}$ , $\boldsymbol{b}=(b_{1},\ldots,b_{m})\in{\mathbb{R}}^{m}$ , and let $\boldsymbol{q}=(q_{1},\ldots,q_{m})$ be a probability vector. The Borel probability measure $\mu=\mu_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}$ on ${\mathbb{R}}$ satisfying

[TABLE]

is called self-similar, or invariant, for the iterated function system (IFS) $\{f_{j}\}_{j=1}^{m}$ , with the probability vector $\boldsymbol{q}$ . It is well-known that there exists a unique such measure [12]. We assume that the fixed points ${\rm Fix}(f_{j})=\frac{b_{j}}{1-\lambda_{j}}$ are not all equal (otherwise, the measure $\mu$ is a point mass) and call the corresponding pairs $(\boldsymbol{\lambda},\boldsymbol{b})$ non-trivial. We write $\boldsymbol{q}>0$ if $q_{j}>0$ for all $j$ .**

Theorem 1.3.

For $m\geq 2$ , there exists a set ${\mathcal{E}}$ of zero Hausdorff dimension in $(0,1)^{m}$ such that for all $\boldsymbol{\lambda}\in(0,1)^{m}\setminus{\mathcal{E}}$ , for all non-trivial $(\boldsymbol{\lambda},\boldsymbol{b})$ and for all $\boldsymbol{q}>0$ we have $\mu_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}\in{\mathcal{D}}$ .

The theorem is an immediate consequence of the following:

Theorem 1.4.

Fix $m\geq 2$ , $1<C_{1}<C_{2}<\infty$ and $\epsilon,s>0$ . Then there exist $\alpha>0$ and $\widetilde{{\mathcal{E}}}\subset(0,1)^{m}$ , depending on these parameters, such that $\dim_{H}(\widetilde{{\mathcal{E}}})\leq s$ and for all $\boldsymbol{\lambda}\in(0,1)^{m}\setminus\widetilde{{\mathcal{E}}}$ , with

[TABLE]

$(\boldsymbol{\lambda},\boldsymbol{b})$ * non-trivial, and $\boldsymbol{q}$ satisfying $\min_{j}q_{j}\geq\epsilon$ , we have ${\mu}_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}\in{\mathcal{D}}(\alpha)$ . *

We do not attempt to give specific quantitative estimates of the decay rate, although in principle, this is possible. Our proof gives extremely slow power decay.

Theorem 1.3 should be compared with a recent result of Li and Sahlsten, which was an inspiration for us.

Theorem 1.5 ([22]).

Let ${\mu}_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}$ be a self-similar measure with non-trivial $(\boldsymbol{\lambda},\boldsymbol{b})$ and $\boldsymbol{q}>0$ .

(i)* If there exist $i\neq j$ such that $\log\lambda_{i}/\log\lambda_{j}$ is irrational, then $\widehat{\mu}_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}(t)\to 0$ as $|t|\to\infty$ .*

(ii)* If $\log\lambda_{i}/\log\lambda_{j}$ is Diophantine for some $i\neq j$ , then there exists $\alpha>0$ such that*

[TABLE]

The methods are quite different: [22] uses an approach based on renewal theory, whereas we develop a multi-parameter generalization of the so-called “Erdős-Kahane argument”.

1.1. Background

The best-known case is homogeneous, when all the contraction ratios are equal: $\lambda_{j}=\lambda,\ j\leq m$ . There is a vast literature devoted to it, so we will be brief. An important class of examples is the family of Bernoulli convolutions $\nu_{\lambda}$ , which is defined as the invariant measure for the IFS $\{\lambda x-1,\lambda x+1\}$ , with $\lambda\in(0,1)$ and probabilities $\{\frac{1}{2},\frac{1}{2}\}$ . One of the original motivations for studying $\widehat{\nu}_{\lambda}$ was the problem: for which $\lambda\in(\frac{1}{2},1)$ is $\nu_{\lambda}$ singular/absolutely continuous? (it follows from the “Law of Pure Type” that $\nu_{\lambda}$ cannot be of mixed type [13]). Erdős [8] proved that $\widehat{\nu}_{\lambda}(t)\not\to 0$ as $t\to\infty$ when $\theta=1/\lambda$ is a Pisot number, hence the corresponding $\nu_{\lambda}$ is singular. Recall that a Pisot number is an algebraic integer greater than one whose algebraic (Galois) conjugates are all less than one in modulus. Later Salem [32] showed that if $1/\lambda$ is not a Pisot number, then $\widehat{\nu}_{\lambda}(t)\to 0$ as $t\to\infty$ , thus providing a characterization of Rajchman Bernoulli convolution measures. In spite of the recent breakthrough results, see [11, 33, 34, 39, 38], the original problem of absolute continuity/singularity for $\mu_{\lambda}$ is still open.

The first non-trivial result on absolute continuity of $\nu_{\lambda}$ was obtained by Erdős [9] in 1940. In fact, he proved that for any $[a,b]\subset(0,1)$ there exists $\alpha>0$ such that $\nu_{\lambda}\in{\mathcal{D}}(\alpha)$ for a.e. $\lambda\in[a,b]$ . Using this and the convolution structure of $\nu_{\lambda}$ , he deduced that $\nu_{\lambda}$ is absolutely continuous for a.e. $\lambda$ sufficiently close to 1. Later, Kahane [16] realized that Erdős’ argument actually gives that $\nu_{\lambda}\in{\mathcal{D}}$ for all $\lambda\in(0,1)$ outside a set of zero Hausdorff dimension. (We should mention that only very few specific $\lambda$ are known, for which $\nu_{\lambda}$ has power Fourier decay, found by Dai, Feng, and Wang [6].) The Erdős-Kahane result plays an important role in the proof of absolute continuity for all $\lambda\in(\frac{1}{2},1)$ outside of a zero Hausdorff dimension set by Shmerkin [33, 34]. The general homogeneous case is treated analogously to Bernoulli convolutions: the self-similar measure is still an infinite convolution and most of the arguments go through with minor modifications, see [6, 36]. An exposition of the “Erdős-Kahane argument” with quantitative estimates was given in [28], and then extended and generalized. Its variants were used in a number of recent papers in fractal geometry and dynamical systems, among them [36, 35, 10, 30, 15, 3, 4].

In the non-homogeneous not all contraction ratios are the same and the self-similar measure is not a convolution, which makes its study more difficult. First results on absolute continuity were obtained by Neunhäuserer [26] and Ngai and Wang [27]. In [30], joint with Saglietti and Shmerkin, we proved that, given a probability vector $\boldsymbol{q}>0$ and vector of translations $\boldsymbol{b}\in{\mathbb{R}}^{m}$ , with all components distinct, for a.e. $\boldsymbol{\lambda}\in(0,1)^{m}$ in the “natural” parameter region (which depends on $\boldsymbol{q}$ ), the self-similar measure $\mu_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}$ is absolutely continuous. The proof was based on a decomposition of the self-similar measure into an integral of measures having a convolution structure, that are only statistically self-similar. A variant of the Erdős-Kahane argument was used to establish power Fourier decay for the latter (for all but a zero-dimensional set of parameters), but this was not sufficient to deduce any Fourier decay for the original self-similar measure. The methods of [30] were pushed further by Käenmaki and Orponen [15], but again, no conclusion was made for the Fourier decay of non-homogeneous self-similar measures.

We note (thanks to Pablo Shmerkin for bringing this to my attention) that a measure may have power decay outside of a sparse set of frequencies, even if it is not a Rajchman measure. In fact, Kaufman [17] (in the homogeneous case) and Tsujii [37] (in the non-homogeneous case) proved that for any non-trivial self-similar measure $\mu$ on the real line, for any ${\varepsilon}>0$ there exists $\delta>0$ such that the set

[TABLE]

can be covered by $T^{\varepsilon}$ intervals of length $1$ . Mosquera and Shmerkin [25] made the dependence of $\delta$ on ${\varepsilon}$ quantitative in the homogeneous case. The papers [17, 25] use a version of the Erdős-Kahane argument, whereas the proof in [37] is based on large deviation estimates.

The study of Fourier decay for other classes of dynamically defined measures has been quite active recently. We only mention a few papers, without an attempt to be comprehensive. Jordan and Sahlsten [14] obtained power Fourier decay for Gibbs measures for the Gauss map, using methods from dynamics and number theory. Bourgain and Dyatlov [2] established Fourier decay for Patterson-Sullivan measures associated to a convex co-compact Fuchsian group, using methods from additive combinatorics; see also [31, 20]. Li [19] proved that the stationary measure for a random walk on $SL_{2}({\mathbb{R}})$ has power decay, when the support of the driving measure generates a Zariski dense subgroup, following his earlier work [18] showing that such a measure is Rajchman. He initiated the approach based on renewal theory, which was later used by Li and Sahlsten [22] to prove Theorem 1.5. Recently the same authors extended their result to a class of self-affine measures in ${\mathbb{R}}^{d}$ in [21].

The rest of the paper is devoted to the proof of Theorem 1.4. As already mentioned, it is based on a generalization of the Erdős-Kahane argument, but there are many new features, mainly because we have to deal with multi-parameter families.

2. Reduction

In view of $(\boldsymbol{\lambda},\boldsymbol{b})$ being non-trivial, by a linear change of variable we can fix two translation parameters, for instance, $b_{1}=0$ and $b_{2}>0$ arbitrary (it can even depend on the other parameters; this would only change the scale on the $t$ -axis, but would not affect the rate of decay of the Fourier transform). After that, we will pass to a higher iterate $\ell$ of the IFS, which preserves the invariant measure. The reason for doing this is to obtain an IFS with many maps having the same contraction ratio (in fact, the number of maps $m^{\ell}$ , grows exponentially with $\ell$ , whereas the number of distinct contraction ratios grows polynomially). In this sense, the proof resembles the strategy of the proof in [30], although in other aspects it is very different. We now formulate the main technical result.

Theorem 2.1.

Let $d\geq 2$ , and consider the IFS

[TABLE]

where $\boldsymbol{\gamma}=(\gamma_{1},\ldots,\gamma_{d})$ is the set of distinct contraction ratios, $k_{1}\geq 2$ (so the number of maps in the IFS is strictly greater than $d$ ), $\boldsymbol{a}$ is a vector of translations, and $\boldsymbol{p}$ is a probability vector. Let $\mu_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}$ be the corresponding self-similar measure. Fix $\epsilon>0$ and $1<B_{1}<B_{2}<\infty$ . Assume that

[TABLE]

Let $s>0$ be such that

[TABLE]

Then there exist $\alpha>0$ and ${\mathcal{E}}^{\prime}\subset(0,1)^{d}$ , depending on $d,B_{1},B_{2},s,{\varepsilon}$ , such that $\dim_{H}({\mathcal{E}}^{\prime})\leq s$ and for all $\boldsymbol{\gamma}\in(0,1)^{d}\setminus{\mathcal{E}}^{\prime}$ , satisfying (2.2), for all $\boldsymbol{a}$ such that

[TABLE]

*and all $\boldsymbol{p}$ such that $\min_{j}p_{j}\geq\epsilon$ , we have ${\mu}_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}\in{\mathcal{D}}(\alpha)$ . *

Derivation of Theorem 1.4 from Theorem 2.1.

As already mentioned, we may assume that the original IFS $\{\lambda_{j}x+b_{j}\}_{j=1}^{m}$ has $f_{1}(x)=\lambda_{1}x,\ f_{2}(x)=\lambda_{2}x+b$ , with $b>0$ arbitrary (we do not exclude the case $\lambda_{1}=\lambda_{2}$ ). Passing to the $\ell$ -th iterate, we obtain an IFS with the number of maps equal to $m^{\ell}$ and the number of distinct contractions less than or equal to

[TABLE]

which is the number of ways to write $\ell$ as a sum of $m$ non-negative integers. Among the maps of the new IFS there are

[TABLE]

This way, we can let $\gamma_{1}=\lambda_{2}\lambda^{\ell-1}_{1}$ , $a_{1}^{(1)}=\lambda_{1}^{\ell-1}b,\ a_{2}^{(1)}=b$ , so that $a_{2}^{(1)}-a_{1}^{(1)}=b(1-\lambda_{1}^{\ell-1})$ , and choose $b=\pi(1-\lambda_{1}^{\ell-1})^{-1}$ to satisfy (2.4). Denote the new IFS by ${\mathcal{F}}_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}$ . Since the invariant measure remains unchanged when we pass to a higher iterate of the IFS, we have $\mu_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}=\mu_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}$ . The bounds for inverses of the contraction ratios of ${\mathcal{F}}_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}$ are

[TABLE]

and the probabilities satisfy $\min_{j,k}p_{k}^{(j)}\geq(\min_{i}q_{i})^{\ell}\geq\epsilon^{\ell}$ . In order to satisfy (2.3), we need

[TABLE]

Since $d<(\ell+m)^{m}$ , it is enough to choose $\ell\geq 2$ so that

[TABLE]

which is certainly possible. Now we apply Theorem 2.1 and obtain an exceptional set ${\mathcal{E}}^{\prime}\subset(0,1)^{d}$ of Hausdorff dimension $\leq s$ , such that for all $\boldsymbol{\gamma}\in{[C_{2}^{-\ell},C_{1}^{-\ell}]}^{d}\setminus{\mathcal{E}}^{\prime}$ , for all vectors of translations $\boldsymbol{a}$ satisfying (2.4) and all probability vectors $\boldsymbol{p}$ , with $\min_{j,k}p_{k}^{(j)}\geq\epsilon^{\ell}$ , holds $\mu_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}\in{\mathcal{D}}(\alpha)$ , for some $\alpha=\alpha(d,B_{1},B_{2},\epsilon^{\ell},s)$ .

It remains to observe that we can recover $\boldsymbol{\lambda}$ from $\boldsymbol{\gamma}$ via a function which does not increase Hausdorff dimension. For instance, among the contraction ratios of ${\mathcal{F}}_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}$ there are $\lambda_{1}^{\ell},\ldots,\lambda_{m}^{\ell}$ . We can project $\boldsymbol{\gamma}$ to these coordinates and then take $\ell$ -th root component-wise, to obtain $\boldsymbol{\lambda}$ . This map is Lipschitz outside of the neighborhood of zero of radius $C_{2}^{-\ell}$ . We obtain an exceptional set $\widetilde{{\mathcal{E}}}\subset(0,1)^{m}$ of $\boldsymbol{\lambda}$ as an image of ${\mathcal{E}}^{\prime}$ under this map, and $\dim_{H}(\widetilde{\mathcal{E}})\leq\dim_{H}({\mathcal{E}}^{\prime})\leq s$ . For all $\boldsymbol{\lambda}\in{[B_{2}^{-1},B_{1}^{-1}]}^{m}\setminus\widetilde{\mathcal{E}}$ , all $\boldsymbol{b}$ satisfying $b_{1}=0,b_{2}=\pi(1-\lambda_{1}^{\ell-1})$ , and all probability vectors $\boldsymbol{q}$ , with $\min_{i}q_{i}\geq\epsilon$ , we have $\mu_{\boldsymbol{\lambda},\boldsymbol{b}}^{\boldsymbol{q}}=\mu_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}\in{\mathcal{D}}(\alpha)$ . This completes the proof of the derivation. ∎

The rest of the paper is devoted to the proof of Theorem 2.1.

3. Beginning of the Proof

We consider the Fourier transform $\widehat{\mu}(t)=\int_{\mathbb{R}}e^{itx}\,d\mu(x)$ , where $\mu=\mu_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}$ is the invariant measure for the IFS (2.1), that is,

[TABLE]

It follows that

[TABLE]

We can estimate

[TABLE]

Denote

[TABLE]

Recall that $a_{2}^{(1)}-a_{1}^{(1)}=\pi$ by assumption, and use an elementary inequality

[TABLE]

We then obtain from (3.1), denoting by $\|t\|$ the distance from $t\in{\mathbb{R}}$ to the nearest integer:

[TABLE]

using that $\min_{k}p_{k}^{(1)}\geq{\varepsilon}$ .

Next we introduce some notation. Let ${\mathcal{A}}=\{1,\ldots,d\}$ . For a word $w\in{\mathcal{A}}^{*}$ let $\ell_{j}(w)$ be the number of $j$ ’s in $w$ , and let $\ell(w)=(\ell_{j}(w))_{j=1}^{d}\in{\mathbb{Z}}_{+}^{d}$ . For $\boldsymbol{n}=(n_{j})_{j=1}^{d}\in{\mathbb{Z}}^{d}_{+}$ we will write

[TABLE]

where ${\bf p}:=(p_{1},\ldots,p_{d})$ . (Note that ${\bf p}\neq\boldsymbol{p}$ ; hopefully, this will not cause a confusion; in any case, we do not need $\boldsymbol{p}$ any more.) Further, let $w[1,i]$ be the prefix of $w$ of length $i$ ; if $i=0$ , this is empty word, by convention.

Iterating (3.2) we obtain

[TABLE]

Notation 3.1.

We will consider ${\mathbb{Z}}^{d}_{+}$ as the vertex set of a directed graph, with a directed edge going from $\boldsymbol{n}\in{\mathbb{Z}}^{d}_{+}$ to each of $\boldsymbol{n}^{\prime}=\boldsymbol{n}+\boldsymbol{e}_{j},\,j\leq d$ , where $\boldsymbol{e}_{j}$ is $j$ ’th unit vector. We will then write $\boldsymbol{n}\to\boldsymbol{n}^{\prime}$ . A vertex $\boldsymbol{n}^{\prime}$ is a descendant of $\boldsymbol{n}$ of level $r\geq 1$ if there is a path of length $r$ from $\boldsymbol{n}$ to $\boldsymbol{n}^{\prime}$ (the length of a path is the number of edges). We will identify a word $w\in{\mathcal{A}}^{N}$ with a path of length $N$ in ${\mathbb{Z}}^{d}_{+}$ , formed by the sequence of vertices $\{\ell(w[1,i]):\ i=0,\ldots,N\}$ and denote this path by $\Gamma(w)$ . It is clear that $\ell(w[1,i])\to\ell(w[1,i+1])$ for $i=0,\ldots,N-1$ .

We will write $\boldsymbol{n}\leadsto\boldsymbol{n}^{\prime}$ if $\boldsymbol{n}^{\prime}$ is a descendant of $\boldsymbol{n}$ and $\|\boldsymbol{n}^{\prime}-\boldsymbol{n}\|_{\infty}\leq 1$ . Equivalently, $\boldsymbol{n}\leadsto\boldsymbol{n}^{\prime}$ iff $\boldsymbol{n}^{\prime}=\boldsymbol{n}+\sum_{\kappa\in{\mathcal{G}}}\boldsymbol{e}_{\kappa}$ for some, possibly empty, subset ${\mathcal{G}}\subset{\mathcal{A}}$ . Thus $\boldsymbol{n}\leadsto\boldsymbol{n}^{\prime}$ implies that either $\boldsymbol{n}^{\prime}=\boldsymbol{n}$ , or $\boldsymbol{n}^{\prime}$ is a descendant of $\boldsymbol{n}$ of level $\leq d$ . **

Definition 3.2.

Let $\rho\in(0,\frac{1}{2})$ , $t>0$ , and $\boldsymbol{\gamma}\in{[B_{2}^{-1},B_{1}^{-1}]}^{d}$ . Say that a vertex $\boldsymbol{n}\in{\mathbb{Z}}^{d}_{+}$ is $(\boldsymbol{\gamma},t,\rho)$ -good if $\boldsymbol{\gamma}^{\boldsymbol{n}}t\geq 1$ and

[TABLE]

(recall that $\|\cdot\|$ denotes the distance from the nearest integer).

Further, say that a vertex $\boldsymbol{n}\in{\mathbb{Z}}^{d}_{+}$ is “on a $(\boldsymbol{\gamma},t,\rho)$ -good track”* if there exists $\boldsymbol{n}^{\prime}$ that is $(\boldsymbol{\gamma},t,\rho)$ -good and $\boldsymbol{n}\leadsto\boldsymbol{n}^{\prime}$ .*

Finally, we say that an edge $[\boldsymbol{n},\boldsymbol{n}^{\prime}]$ is $(\boldsymbol{\gamma},t,\rho)$ -good if $\boldsymbol{n}$ is $(\boldsymbol{\gamma},t,\rho)$ -good and $\boldsymbol{n}^{\prime}=\boldsymbol{n}+\boldsymbol{e}_{1}$ . (Notice that the 1-st coordinate, corresponding to $w_{i}=1$ , is “special” by construction, see (2.4) and (3.3).)

Consider $t\in(B_{1}^{N-1},B_{1}^{N}]$ . Then $\boldsymbol{\gamma}^{\ell(w)}t\leq 1$ for all $w\in{\mathcal{A}}^{N}$ , by the assumption $\gamma_{\max}\leq B_{1}^{-1}$ . It follows from (3.3), roughly speaking, that in order to have a power decay for $\widehat{\mu}(t)$ for $t$ at this scale, it is sufficient that for “most” (up to exponentially small number) words $w\in{\mathcal{A}}^{N}$ there is a fixed positive proportion of $(\boldsymbol{\gamma},t,\rho)$ -good edges on the path corresponding to $w$ , for some $\rho>0$ . With this in mind, we define the exceptional set of $\boldsymbol{\gamma}$ at scale $N$ as follows:

Definition 3.3.

Fix $k_{1}\in{\mathbb{N}}$ and $\rho>0$ , and let ${\mathcal{E}}_{N}={\mathcal{E}}_{N}(k_{1},\rho)$ be the set of $\boldsymbol{\gamma}\in{[B_{2}^{-1},B_{1}^{-1}]}^{d}$ such that there exists $t\in(B_{1}^{N-1},B_{1}^{N}]$ and a word $w\in{\mathcal{A}}^{N}$ with the properties:

[TABLE]

Further, we define the exceptional set by ${\mathcal{E}}^{\prime}:={\mathcal{E}}^{\prime}(k_{1},\rho)=\limsup{\mathcal{E}}_{N}(k_{1},\rho).$

Let

[TABLE]

Theorem 2.1 will follow, once we prove the next two propositions:

Proposition 3.4.

For all $k_{1}\in{\mathbb{N}}$ sufficiently large, there exists $\alpha>0$ , depending on $d,k,B_{1},B_{2},{\varepsilon}$ , such that for all $\boldsymbol{\gamma}\in{[B_{2}^{-1},B_{1}^{-1}]}^{d}\setminus{\mathcal{E}}^{\prime}(k_{1},\rho)$ , for all $\boldsymbol{p}$ , with $\min_{j}p_{j}\geq{\varepsilon}$ , and all $\boldsymbol{a}$ satisfying (2.4), we have $\mu={\mu}_{\boldsymbol{\gamma},\boldsymbol{a}}^{\boldsymbol{p}}\in{\mathcal{D}}(\alpha)$ .

Proposition 3.5.

For all $k_{1}\in{\mathbb{N}}$ sufficiently large we have $\dim_{H}({\mathcal{E}}^{\prime}(k_{1},\rho))<s$ .

4. Fourier decay for non-exceptional $\boldsymbol{\gamma}$

Fix $\boldsymbol{\gamma}\not\in{\mathcal{E}}^{\prime}={\mathcal{E}}^{\prime}(k_{1},\rho)$ , where $\rho$ is given by (3.5) and $k_{1}$ is fixed, sufficiently large. (A specific value for $k_{1}$ will be chosen in (5.6).) Then $\boldsymbol{\gamma}\not\in{\mathcal{E}}_{N}={\mathcal{E}}_{N}(k_{1},\rho)$ for all $N$ sufficiently large. Fix such an $N$ . The condition $\boldsymbol{\gamma}\not\in{\mathcal{E}}_{N}$ means, by definition, that for every $t\in(B_{1}^{N-1},B_{1}^{N}]$ and for every $w\in{\mathcal{A}}^{N}$ , the number of vertices “on a $(\boldsymbol{\gamma},t,\rho)$ -good track” on the path $\Gamma(w[d+1,N-d-1])$ is greater than $N/k_{1}$ . Fix $t\in(B_{1}^{N-1},B_{1}^{N}]$ . Since $\boldsymbol{\gamma}$ , $t$ , and $\rho$ are now fixed, we will omit $(\boldsymbol{\gamma},t,\rho)$ when talking about vertices and edges that are good or “on a good track”.

We will consider ${\mathcal{A}}^{N}$ as a probability space, with the Bernoulli measure ${\mathbb{P}}\,={\bf p}^{N}$ , and the “random environment” provided by the configuration of good vertices and edges. Let $p_{\min}:=\min_{j\leq d}p_{j}$ . Let us introduce the following random variables for $i=1,\ldots,N$ :

•

for $r\geq 1$ , $X_{i}^{(r)}$ is the number of vertices on the path $\Gamma(w[1,i])$ having a good vertex among its $r$ -level descendants;

•

$X_{i}=X_{i}^{(0)}$ is the number of good vertices on the path $\Gamma(w[1,i])$ ;

•

$Y_{i}$ is is the number of good edges on the path $\Gamma(w[1,i])$ .

Notice that for every vertex of $\Gamma(w[d+1,N])$ that is “on a good track”, there is a vertex of $\Gamma(w)$ that had a good vertex among its $d$ -level descendants, and this mapping is at most $(d+1)$ -to- $1$ . It follows that, with probability one,

[TABLE]

Lemma 4.1.

There exist $\delta>0$ , $C^{\prime}>0$ , and $c>0$ , depending only on $p_{\min}$ and $k_{1}$ , such that, assuming $N$ is sufficiently large (depending only on $p_{\min}$ and $k_{1}$ ), holds

[TABLE]

We first deduce power Fourier decay for $\boldsymbol{\gamma}\not\in{\mathcal{E}}^{\prime}$ from the lemma.

Proof of Proposition 3.4.

Consider the sum in the inequality (3.3) and split it according to whether $Y_{N}=Y_{N}(w)<\delta N$ or $\geq\delta N$ . The sum over $w$ such that $Y_{N}(w)<\delta N$ , is bounded by ${\mathbb{P}}\,\left(Y_{N}<\delta N\right)$ . If $w$ is such that $Y_{N}\geq\delta N$ , then the corresponding term in the right-hand side of (3.3) is estimated from above by $\bigl{(}1-\frac{\pi{\varepsilon}}{2}\rho^{2}\bigr{)}^{\delta N}$ , by the definition of a good edge (we also use the fact that $|\widehat{\mu}(t)|\leq 1$ , since $\mu$ is a probability measure). Then Lemma 4.1 implies, for $N$ sufficiently large:

[TABLE]

Since $N$ was arbitrary, sufficiently large, and $t$ arbitrary in $(B_{1}^{N-1},B_{1}^{N}]$ , this implies that $\mu\in{\mathcal{D}}(\alpha)$ for some $\alpha>0$ . ∎

As a step in the proof of Lemma 4.1, we will first establish the following

Lemma 4.2.

There exist $\delta_{r}>0$ , $C_{r}^{\prime}$ , and $c_{r}>0$ , for $r=0,\ldots,d$ , such that, for $N$ sufficiently large,

[TABLE]

Proof of Lemma 4.2.

We will show this by induction in $r$ , going from $r=d$ to $r=0$ . For $r=d$ the claim trivially holds, by (4.1). Fix $r\in\{0,\ldots,d-1\}$ and assume that (4.3) holds for $r+1$ . Consider the sequence of random variables

[TABLE]

We claim that this a submartingale; in fact,

[TABLE]

Indeed, we have either (a) $X_{i}^{(r+1)}=X_{i-1}^{(r+1)}$ , or (b) $X_{i}^{(r+1)}=X_{i-1}^{(r+1)}+1$ . The former case occurs when $\ell(v[1,i])$ has no good descendants of level $r+1$ , and then $\ell(v[1,i+1])$ has no good descendants of level $r$ . Thus in case (a) we have $X_{i+1}^{(r)}=X_{i}^{(r)}$ and $Z_{i}^{(r)}=Z_{i-1}^{(r)}$ .

In case (b), on the other hand, $\ell(v[1,i])$ has a good descendant of level $r+1$ , and then $\ell(v[1,i+1])$ has a good descendant of level $r$ , with probability $\geq p_{\min}$ , independently of the past. Then either $X^{(r)}_{i+1}=X^{(r)}_{i}$ or $X^{(r)}_{i+1}=X^{(r)}_{i}+1$ , hence $Z^{(r)}_{i}=Z^{(r)}_{i-1}-1$ or $Z^{(r)}_{i}=Z^{(r)}_{i-1}+\frac{1}{p_{\min}}-1$ .

Formally, we obtain

[TABLE]

Since ${\mathbb{P}}\,[X_{i}^{(r+1)}=X_{i-1}^{(r+1)}]+{\mathbb{P}}\,[X_{i}^{(r+1)}=X_{i-1}^{(r+1)}+1]=1$ , we have

[TABLE]

confirming the claim that $\{Z^{(r)}_{i}\}$ is a submartingale.

We are going to apply the Azuma-Hoeffding inequality, which says that, given that $\{Z^{(r)}_{i}\}$ is a submartingale, if $|Z^{(r)}_{i}-Z^{(r)}_{i-1}|\leq\alpha_{i}$ for all $i$ , then

[TABLE]

See, e.g., [1] for the (two-sided) Azuma-Hoeffding inequality for martingales. The one-sided inequality for submartingales is proved similarly, see e.g., [5].

We have $|Z^{(r)}_{i}-Z^{(r)}_{i-1}|\leq p_{\min}^{-1}$ , hence taking $y=\frac{\delta_{r+1}N}{3}$ yields

[TABLE]

Since $Z_{1}^{(r)}$ is bounded, we have for $N$ sufficiently large:

[TABLE]

Recall that $Z^{(r)}_{N-1}=\frac{X^{(r)}_{N}}{p_{\min}}-X^{(r+1)}_{N-1}$ , and

[TABLE]

by the inductive assumption. Therefore for $N$ sufficiently large,

[TABLE]

and (4.3) follows. ∎

Proof of Lemma 4.1.

Consider the sequence of random variables

[TABLE]

We claim that $\{U_{i}\}$ is a martingale; in fact,

[TABLE]

This is proved analogously to the proof of the submartingale property for $\{Z^{(r)}_{i}\}$ above. If $\boldsymbol{n}=\ell(w[1,i])$ is not a good vertex, then the edge $[\boldsymbol{n},\boldsymbol{n}^{\prime}]$ , with $\boldsymbol{n}^{\prime}=\ell(w[1,i+1])$ , is not good either, and we have $X_{i}=X_{i-1},\ Y_{i+1}=Y_{i},\ U_{i}=U_{i-1}$ . If, other other hand, $\boldsymbol{n}=\ell(w[1,i])$ is a good vertex, then the edge $[\boldsymbol{n},\boldsymbol{n}^{\prime}]$ , with $\boldsymbol{n}^{\prime}=\ell(w[1,i+1])$ , is good with probability $p_{1}$ , and this is independent from the past. Thus, if $X_{i}=X_{i-1}+1$ , then

[TABLE]

This implies (4.6); the formal computation, similar to the above, is left to the reader.

Applying the Azuma-Hoeffding inequality to $\{U_{i}\}$ , in view of $|U_{i}-U_{i-1}|\leq p_{1}^{-1}$ , after a computation similar to that above, we can estimate, for $N$ large enough, using (4.3) for $r=0$ :

[TABLE]

This implies the desired estimate (4.2). ∎

5. Dimension of the exceptional set

Fix $\boldsymbol{\gamma}\in{\mathcal{E}}^{\prime}$ . This means that $\boldsymbol{\gamma}\in{\mathcal{E}}_{N}$ for infinitely many $N$ . Fix such an $N$ , sufficiently large. We will show that this imposes constraints on $\boldsymbol{\gamma}$ allowing us to construct a good cover of ${\mathcal{E}}_{N}$ . By definition of ${\mathcal{E}}_{N}$ , there exists $t\in(B_{1}^{N-1},B_{1}^{N}]$ and a word $w\in{\mathcal{A}}^{N}$ , such that the number of vertices “on a good $(\boldsymbol{\gamma},t,\rho)$ -track” on the path $\Gamma(w[d+1,N-d-1])$ does not exceed $N/k_{1}$ . Fix such a $t$ and $w\in{\mathcal{A}}^{N}$ , and for $\boldsymbol{n}\in{\mathbb{Z}}^{d}_{+}$ let

[TABLE]

that is, $K_{\boldsymbol{n}}$ is the nearest integer to $\boldsymbol{\gamma}^{\boldsymbol{n}}t$ and $\|\boldsymbol{\gamma}^{\boldsymbol{n}}t\|=|{\varepsilon}_{\boldsymbol{n}}|$ . One should keep in mind that $K_{\boldsymbol{n}}$ and ${\varepsilon}_{\boldsymbol{n}}$ depend on $\boldsymbol{\gamma}$ and $t$ , but we suppress this in notation to reduce “clutter”.

The next lemma is analogous to the ones appearing in other variants of the Erdős-Kahane argument; see e.g., [28, Lemma 6.3].

Lemma 5.1.

Let $\rho$ be given by (3.5) and

[TABLE]

Let $\boldsymbol{n}\in{\mathbb{Z}}^{d}_{+}$ , $\boldsymbol{n}^{\prime}=\boldsymbol{n}+\boldsymbol{e}_{j},\ \boldsymbol{n}^{\prime\prime}=\boldsymbol{n}+2\boldsymbol{e}_{j}$ for some $j\in{\mathcal{A}}$ (in particular, $\boldsymbol{n}\to\boldsymbol{n}^{\prime}\to\boldsymbol{n}^{\prime\prime}$ ), such that $K_{\boldsymbol{n}^{\prime\prime}}\geq 1$ . The following hold:

(i)* Given $K_{\boldsymbol{n}^{\prime\prime}}$ and $K_{\boldsymbol{n}^{\prime}}$ , there are at most $A$ possibilities for $K_{\boldsymbol{n}}$ .*

(ii)* Given $K_{\boldsymbol{n}^{\prime\prime}}$ and $K_{\boldsymbol{n}^{\prime}}$ , the number $K_{\boldsymbol{n}}$ is uniquely determined, provided*

[TABLE]

that is, provided none of the $\boldsymbol{n},\boldsymbol{n}^{\prime},\boldsymbol{n}^{\prime\prime}$ is $(\boldsymbol{\gamma},t,\rho)$ -good.

Proof.

We have, by assumption,

[TABLE]

The idea is that

[TABLE]

hence $K_{\boldsymbol{n}}$ must be not too far from $\frac{K_{\boldsymbol{n}^{\prime}}^{2}}{K_{\boldsymbol{n}^{\prime\prime}}}$ . First note that

[TABLE]

where we used the bound $1\leq\gamma^{\boldsymbol{n}^{\prime\prime}}=\gamma^{\boldsymbol{n}^{\prime}}\gamma_{j}t\leq\gamma^{\boldsymbol{n}^{\prime}}t$ . Next,

[TABLE]

Therefore,

[TABLE]

using (5.3) in the last step. Now both parts of the lemma follow easily. Indeed, $K_{\boldsymbol{n}}$ is an integer.

(i) Since $\max\{|{\varepsilon}_{\boldsymbol{n}}|,|{\varepsilon}_{\boldsymbol{n}^{\prime}}|,|{\varepsilon}_{\boldsymbol{n}^{\prime\prime}}|\}\leq\frac{1}{2}$ , once $K_{\boldsymbol{n}^{\prime}}$ and $K_{\boldsymbol{n}^{\prime\prime}}$ are given, there are at most $A$ possibilities for $K_{\boldsymbol{n}}$ , see (5.1).

(ii) The choice of $K_{\boldsymbol{n}}$ will be unique, provided

[TABLE]

see (3.5). ∎

Corollary 5.2.

Suppose that $\boldsymbol{n}\to\boldsymbol{n}^{\prime}$ , and we are given $K_{\boldsymbol{v}^{\prime}}$ for all $\boldsymbol{v}^{\prime}$ such that $\boldsymbol{n}^{\prime}\leadsto\boldsymbol{v}^{\prime}$ ; assume that all of them satisfy $K_{\boldsymbol{v}^{\prime}}\geq 1$ . Then

(i)* for any $\boldsymbol{v}$ , such that $\boldsymbol{n}\leadsto\boldsymbol{v}$ , there at most $A$ possibilities for $K_{\boldsymbol{v}}$ ;*

(ii)* for any $\boldsymbol{v}$ , such that $\boldsymbol{n}\leadsto\boldsymbol{v}$ , assuming that neither $\boldsymbol{v}$ , nor any of $\boldsymbol{v}^{\prime}$ , with $\boldsymbol{n}^{\prime}\leadsto\boldsymbol{v}^{\prime}$ , is $(\boldsymbol{\gamma},t,\rho)$ -good, $K_{\boldsymbol{v}}$ is uniquely determined.*

Proof.

Fix $\boldsymbol{v}$ such that $\boldsymbol{n}\leadsto\boldsymbol{v}$ . Then $\boldsymbol{v}=\boldsymbol{n}+\sum_{\kappa\in{\mathcal{G}}}\boldsymbol{e}_{\kappa}$ , for some ${\mathcal{G}}\subset{\mathcal{A}}$ . We have $\boldsymbol{n}\to\boldsymbol{n}^{\prime}$ ; suppose $\boldsymbol{n}^{\prime}=\boldsymbol{n}+\boldsymbol{e}_{j}$ for some $j\in{\mathcal{A}}$ . If $j\in{\mathcal{G}}$ , then $\boldsymbol{v}=\boldsymbol{n}^{\prime}+\sum_{\kappa\in{\mathcal{G}}\setminus\{j\}}\boldsymbol{e}_{\kappa}$ , so $\boldsymbol{n}^{\prime}\leadsto\boldsymbol{v}$ and $K_{\boldsymbol{v}}$ is already known by assumption.

If $j\not\in{\mathcal{G}}$ , then $\boldsymbol{v}^{\prime}:=\boldsymbol{n}^{\prime}+\sum_{\kappa\in{\mathcal{G}}}\boldsymbol{e}_{\kappa}$ and $\boldsymbol{v}^{\prime\prime}:=\boldsymbol{n}^{\prime}+\sum_{\kappa\in{\mathcal{G}}\cup\{j\}}\boldsymbol{e}_{\kappa}$ satisfy

[TABLE]

Moreover, $\boldsymbol{n}^{\prime}\leadsto\boldsymbol{v}^{\prime},\ \boldsymbol{n}^{\prime}\leadsto\boldsymbol{v}^{\prime\prime},$ so $K_{\boldsymbol{v}^{\prime}}$ and $K_{\boldsymbol{v}^{\prime\prime}}$ are already given, and we are exactly in the situation of Lemma 5.1. Applying the lemma yields the desired result. ∎

Proof of Proposition 3.5.

Let $q\in{\mathbb{N}}$ be maximal, such that

[TABLE]

Note that

[TABLE]

Let

[TABLE]

be the $i$ -th vertex on the path corresponding to the word $w$ , so that $\boldsymbol{n}_{i}\to\boldsymbol{n}_{i+1}$ . We have chosen $q$ in such a way that

[TABLE]

Recall that $w\in{\mathcal{A}}^{N}$ is fixed and the number of vertices “on a good $(\boldsymbol{\gamma},t,\rho)$ -track” on the path $\Gamma(w[d+1,N-d-1])$ does not exceed $N/k_{1}$ . We are going to estimate from above the number of possible configurations of integers $K_{\boldsymbol{v}}$ , where $\boldsymbol{n}_{i}\leadsto\boldsymbol{v}$ for some $i=q,\ q-1,\ldots,0$ . Note that for any $\boldsymbol{n}\in{\mathbb{Z}}^{d}_{+}$ there are $2^{d}$ vertices $\boldsymbol{v}$ such that $\boldsymbol{n}\leadsto\boldsymbol{v}$ .

We start with the “initial configuration” of $K_{\boldsymbol{v}}$ for $\boldsymbol{v}$ such that $\boldsymbol{n}_{q}\leadsto\boldsymbol{v}$ . By the choice of $q$ we have

[TABLE]

so $K_{\boldsymbol{v}}\in[1,B_{2}^{d+1}]$ for all $\boldsymbol{v}$ such that $\boldsymbol{n}_{q}\leadsto\boldsymbol{v}$ . It follows that the total number of possibilities for $K_{\boldsymbol{v}}$ for all $\boldsymbol{v}$ such that $\boldsymbol{n}_{q}\leadsto\boldsymbol{v}$ , is at most

[TABLE]

Now we follow the path $\boldsymbol{n}_{q},\boldsymbol{n}_{q-1},\ldots,\boldsymbol{n}_{{\bf 0}}$ backwards, applying Corollary 5.2 at each step. Fix $i\leq q$ . Part (i) of the corollary says that for any $\boldsymbol{v}$ , with $\boldsymbol{n}_{i-1}\leadsto\boldsymbol{v}$ , there are at most $A$ choices for $K_{\boldsymbol{v}}$ , once all the $K_{\boldsymbol{v}^{\prime}}$ , with $\boldsymbol{n}_{i}\leadsto\boldsymbol{v}^{\prime}$ are determined. Part (ii) of the corollary says that if none of $\boldsymbol{n}_{i}$ , $\boldsymbol{n}_{i-1}$ are on a “good $(\boldsymbol{\gamma},t,\rho)$ -track”, those $K_{\boldsymbol{v}}$ are determined uniquely. By assumption, there are no more that $N/k_{1}$ vertices of $w[d+1,N-d-1]$ that are “on a good $(\boldsymbol{\gamma},t,\rho)$ -track”, hence this will affect at most $2N/k_{1}$ transitions between $\boldsymbol{n}_{N-d-1}$ and $\boldsymbol{n}_{d+1}$ . On each transition, we determine at most $2^{d}$ “new” values of $K_{\boldsymbol{v}}$ (this is an “overcount,” but we do not try to be precise here). If we fix the subset of $\{1,\ldots,q\}$ corresponding to the $\leq N/k_{1}$ vertices “on a good $(\boldsymbol{\gamma},t,\rho)$ -track” on the path $\Gamma(w[d+1,N-d-1])$ , we will obtain at most

[TABLE]

total configurations, where $L_{2}=L_{1}\cdot A^{2(d+1)},\ A_{1}=A^{2^{d+1}}$ . Taking into account all the possibilities for the subset in question and also possible values of $q\leq N$ yields that the total number of configurations of $K_{\boldsymbol{v}}$ , for all $\boldsymbol{v}$ under consideration, is at most

[TABLE]

Next, note that the knowledge of all $K_{\boldsymbol{v}}$ associated with the path $w$ gives a good approximation of $\gamma_{j}$ . In fact, we have $\boldsymbol{n}_{0}={\bf 0}\leadsto\boldsymbol{e}_{j}$ for $j\in{\mathcal{A}}$ , so $K_{{\bf 0}}$ and $K_{\boldsymbol{e}_{j}}$ are among the “known” ones. Estimating as in Lemma 5.1, we have

[TABLE]

and $K_{\boldsymbol{e}_{j}}\geq t\gamma_{j}-\frac{1}{2}\geq\frac{B_{1}^{N-1}}{B_{2}}-\frac{1}{2}$ . It follows that the knowledge of all $K_{\boldsymbol{v}}$ associated with the path $w$ gives a cover of the exceptional $\boldsymbol{\gamma}$ by balls of diameter $\sim B_{1}^{-N}$ . Taking into account that the number of words $w\in{\mathcal{A}}^{N}$ is equal to $d^{N}$ , we obtain that the exceptional set ${\mathcal{E}}_{N}$ at scale $N$ may be covered by

[TABLE]

balls of diameter $\sim B_{1}^{-N}$ . Recall that $B_{1}^{s}>d$ by (2.3). Thus we can choose $k_{1}\in{\mathbb{N}}$ such that

[TABLE]

Then

[TABLE]

whence $\dim_{H}({\mathcal{E}}^{\prime})\leq s$ , as desired. ∎

Acknowledgement. I am grateful to Ori Gurel-Gurevich for his help with the probabilistic argument, and to Tuomas Sahlsten for helpful discussions.

Bibliography39

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Noga Alon and Joel H. Spencer. The probabilistic method . Wiley-Interscience Series in Discrete Mathematics and Optimization. John Wiley & Sons, Inc., New York, 1992. With an appendix by Paul Erdős, A Wiley-Interscience Publication.
2[2] Jean Bourgain and Semyon Dyatlov. Fourier dimension and spectral gaps for hyperbolic surfaces. Geom. Funct. Anal. , 27(4):744–771, 2017.
3[3] Alexander I. Bufetov and Boris Solomyak. On the modulus of continuity for spectral measures in substitution dynamics. Adv. Math. , 260:84–129, 2014.
4[4] Alexander I. Bufetov and Boris Solomyak. The Hölder property for the spectrum of translation flows in genus two. Israel J. Math. , 223(1):205–259, 2018.
5[5] Fan Chung and Linyuan Lu. Concentration inequalities and martingale inequalities: a survey. Internet Math. , 3(1):79–127, 2006.
6[6] Xin-Rong Dai, De-Jun Feng, and Yang Wang. Refinable functions with non-integer dilations. J. Funct. Anal. , 250(1):1–20, 2007.
7[7] H. Davenport, P. Erdős, and W. J. Le Veque. On Weyl’s criterion for uniform distribution. Michigan Math. J. , 10:311–314, 1963.
8[8] Paul Erdős. On a family of symmetric Bernoulli convolutions. Amer. J. Math. , 61:974–976, 1939.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Fourier decay for self-similar measures

Abstract.

1. Introduction

Definition 1.1**.**

Definition 1.2**.**

Theorem 1.3**.**

Theorem 1.4**.**

Theorem 1.5** ([22]).**

1.1. Background

2. Reduction

Theorem 2.1**.**

Derivation of Theorem 1.4 from Theorem 2.1.

3. Beginning of the Proof

Notation 3.1**.**

Definition 3.2**.**

Definition 3.3**.**

Proposition 3.4**.**

Proposition 3.5**.**

4. Fourier decay for non-exceptional γ\boldsymbol{\gamma}γ

Lemma 4.1**.**

Proof of Proposition 3.4.

Lemma 4.2**.**

Proof of Lemma 4.2.

Proof of Lemma 4.1.

5. Dimension of the exceptional set

Lemma 5.1**.**

Proof.

Corollary 5.2**.**

Proof.

Proof of Proposition 3.5.

Definition 1.1.

Definition 1.2.

Theorem 1.3.

Theorem 1.4.

Theorem 1.5 ([22]).

Theorem 2.1.

Notation 3.1.

Definition 3.2.

Definition 3.3.

Proposition 3.4.

Proposition 3.5.

4. Fourier decay for non-exceptional $\boldsymbol{\gamma}$

Lemma 4.1.

Lemma 4.2.

Lemma 5.1.

Corollary 5.2.