Billiards on pythagorean triples and their Minkowski functions

Giovanni Panti

arXiv:1902.00414·math.NT·January 23, 2020

Billiards on pythagorean triples and their Minkowski functions

Giovanni Panti

PDF

TL;DR

This paper explores hyperbolic billiard tables related to Pythagorean triples, analyzing their dynamics, invariant measures, and a Minkowski-like function that links algebraic properties to geometric structures.

Contribution

It unifies Pythagorean triple enumeration with hyperbolic billiards, computes invariant densities, and introduces a Minkowski question mark function analog for these systems.

Findings

01

Invariant densities of the billiard maps are computed.

02

Lagrange and Galois theorems are extended to these billiard systems.

03

A singular, Holder continuous conjugacy function is explicitly constructed.

Abstract

It has long been known that the set of primitive pythagorean triples can be enumerated by descending certain ternary trees. We unify these treatments by considering hyperbolic billiard tables in the Poincare disk model. Our tables have m>=3 ideal vertices, and are subject to the restriction that reflections in the table walls are induced by matrices in the triangle group PSU^\pm_{1,1}\Zbb[i]. The resulting billiard map \tilde B acts on the de Sitter space x_1^2+x_2^2-x_3^2=1, and has a natural factor B on the unit circle, the pythagorean triples appearing as the B-preimages of fixed points. We compute the invariant densities of these maps, and prove the Lagrange and Galois theorems: A complex number of unit modulus has a preperiodic (purely periodic) B-orbit precisely when it is quadratic (and isolated from its conjugate by a billiard wall) over Q(i). Each B as above is a (m-1)-to-1…

Figures7

Click any figure to enlarge with its caption.

Equations217

L = 11 - 1

L = 11 - 1

L

L

C*\bigl{(}\mu(\bm{y})\bigr{)}=\begin{bmatrix}1&-i\\ -i&1\end{bmatrix}*\frac{y_{1}+y_{3}i}{1-y_{2}}=\frac{y_{1}+y_{2}i}{1+y_{3}}=\tau(\bm{y}).

C*\bigl{(}\mu(\bm{y})\bigr{)}=\begin{bmatrix}1&-i\\ -i&1\end{bmatrix}*\frac{y_{1}+y_{3}i}{1-y_{2}}=\frac{y_{1}+y_{2}i}{1+y_{3}}=\tau(\bm{y}).

π ([x_{1}, x_{2}, x_{3}, 0])

π ([x_{1}, x_{2}, x_{3}, 0])

τ_{0} ([x_{1}, x_{2}, x_{3}, 0])

η ([x_{1}, x_{2}, x_{3}, 0])

\operatorname{PSU}^{\pm}_{1,1}\mathbb{C}=\biggl{\{}\begin{pmatrix}\alpha&\beta\\ \bar{\beta}&\bar{\alpha}\end{pmatrix}\in\operatorname{GL}_{2}\mathbb{C}:\bigl{|}\lvert\alpha\rvert^{2}-\lvert\beta\rvert^{2}\bigr{|}=1\biggr{\}}\Big{/}\pm I,

\operatorname{PSU}^{\pm}_{1,1}\mathbb{C}=\biggl{\{}\begin{pmatrix}\alpha&\beta\\ \bar{\beta}&\bar{\alpha}\end{pmatrix}\in\operatorname{GL}_{2}\mathbb{C}:\bigl{|}\lvert\alpha\rvert^{2}-\lvert\beta\rvert^{2}\bigr{|}=1\biggr{\}}\Big{/}\pm I,

[α \overset{ˉ}{β} β \overset{α}{ˉ}] * z = {(α z + β) / (\overset{ˉ}{β} z + \overset{α}{ˉ}), (β \overset{z}{ˉ} + α) / (\overset{α}{ˉ} \overset{z}{ˉ} + \overset{ˉ}{β}), if ∣ α ∣^{2} - ∣ β ∣^{2} = 1; if ∣ α ∣^{2} - ∣ β ∣^{2} = - 1 .

[α \overset{ˉ}{β} β \overset{α}{ˉ}] * z = {(α z + β) / (\overset{ˉ}{β} z + \overset{α}{ˉ}), (β \overset{z}{ˉ} + α) / (\overset{α}{ˉ} \overset{z}{ˉ} + \overset{ˉ}{β}), if ∣ α ∣^{2} - ∣ β ∣^{2} = 1; if ∣ α ∣^{2} - ∣ β ∣^{2} = - 1 .

W = (- w_{2} + w_{3} - w_{1} - w_{1} w_{2} + w_{3}),

W = (- w_{2} + w_{3} - w_{1} - w_{1} w_{2} + w_{3}),

[cos (t) sin (t) - sin (t) cos (t)] [exp (t /2) exp (- t /2)] [1 t 1] \mapsto cos (- 2 t) sin (- 2 t) - sin (- 2 t) cos (- 2 t) 1, \mapsto 1 cosh (t) sinh (t) sinh (t) cosh (t), \mapsto 1 t t - t 1 - t^{2} /2 - t^{2} /2 t t^{2} /2 1 + t^{2} /2 .

[cos (t) sin (t) - sin (t) cos (t)] [exp (t /2) exp (- t /2)] [1 t 1] \mapsto cos (- 2 t) sin (- 2 t) - sin (- 2 t) cos (- 2 t) 1, \mapsto 1 cosh (t) sinh (t) sinh (t) cosh (t), \mapsto 1 t t - t 1 - t^{2} /2 - t^{2} /2 t t^{2} /2 1 + t^{2} /2 .

J

J

F

P

G

K

K

⟨ x_{0}, \dots, x_{m - 1} ∣ x_{0}^{2} = \dots = x_{m - 1}^{2} = (x_{0} x_{1})^{e_{0}} = \dots = (x_{m - 1} x_{0})^{e_{m - 1}} = 1 ⟩

⟨ x_{0}, \dots, x_{m - 1} ∣ x_{0}^{2} = \dots = x_{m - 1}^{2} = (x_{0} x_{1})^{e_{0}} = \dots = (x_{m - 1} x_{0})^{e_{m - 1}} = 1 ⟩

R_{w} (x) = x - \frac{2 ⟨ w , x ⟩}{⟨ w , w ⟩} w,

R_{w} (x) = x - \frac{2 ⟨ w , x ⟩}{⟨ w , w ⟩} w,

R_{w} = I - \frac{2}{⟨ w , w ⟩} w w^{⊤} L,

R_{w} = I - \frac{2}{⟨ w , w ⟩} w w^{⊤} L,

C^{- 1} [a + bi c - d i c + d i a - bi] C = [a + d - b + c b + c a - d],

C^{- 1} [a + bi c - d i c + d i a - bi] C = [a + d - b + c b + c a - d],

μ = (a_{1} + b_{1} i)^{e_{1}} \dots (a_{q} + b_{q} i)^{e_{q}},

μ = (a_{1} + b_{1} i)^{e_{1}} \dots (a_{q} + b_{q} i)^{e_{q}},

Q = (q_{1} - q_{2} /2 - q_{2} /2 q_{3}),

Q = (q_{1} - q_{2} /2 - q_{2} /2 q_{3}),

ω = \frac{q _{2} + 1}{2 q _{1}}, α = \frac{q _{2} - 1}{2 q _{1}} .

ω = \frac{q _{2} + 1}{2 q _{1}}, α = \frac{q _{2} - 1}{2 q _{1}} .

E = ⎩ ⎨ ⎧ [1 α 1], [ω 1 - 1], ∣ ω - α ∣^{- 1/2} [ω 1 α 1] J^{e (sgn (ω - α))}, if ω = \infty; if α = \infty; otherwise;

E = ⎩ ⎨ ⎧ [1 α 1], [ω 1 - 1], ∣ ω - α ∣^{- 1/2} [ω 1 α 1] J^{e (sgn (ω - α))}, if ω = \infty; if α = \infty; otherwise;

Q^{'} = - \frac{1}{2} (E^{- 1})^{⊤} (11) E^{- 1};

Q^{'} = - \frac{1}{2} (E^{- 1})^{⊤} (11) E^{- 1};

E = \frac{1}{2 ∣ q _{1} ∣ ^{1/2}} [q_{2} + 1 2 q_{1} q_{2} - 1 2 q_{1}] J^{e (sgn q_{1})},

E = \frac{1}{2 ∣ q _{1} ∣ ^{1/2}} [q_{2} + 1 2 q_{1} q_{2} - 1 2 q_{1}] J^{e (sgn q_{1})},

E^{- 1} = \frac{1}{2 ∣ q _{1} ∣ ^{1/2}} J^{e (sgn q_{1})} [2 q_{1} - 2 q_{1} - q_{2} + 1 q_{2} + 1] .

E^{- 1} = \frac{1}{2 ∣ q _{1} ∣ ^{1/2}} J^{e (sgn q_{1})} [2 q_{1} - 2 q_{1} - q_{2} + 1 q_{2} + 1] .

Q^{'}

Q^{'}

= - (sgn q_{1}) \frac{1}{8 ∣ q _{1} ∣} (2 q_{1} - q_{2} + 1 - 2 q_{1} q_{2} + 1) (11) (2 q_{1} - 2 q_{1} - q_{2} + 1 q_{2} + 1)

= - \frac{1}{8 q _{1}} (- 8 q_{1}^{2} 4 q_{1} q_{2} 4 q_{1} q_{2} - 2 q_{2}^{2} + 2),

w_{1} w_{2} w_{3} = - 1 1 111 q_{1} q_{2} q_{3} .

w_{1} w_{2} w_{3} = - 1 1 111 q_{1} q_{2} q_{3} .

q

q

q

q

w = \frac{Lt ^{'} \times Lt}{⟨ t ^{'} , t ⟩},

w = \frac{Lt ^{'} \times Lt}{⟨ t ^{'} , t ⟩},

(\omega,\alpha)=\bigl{(}(\mu\circ\upsilon)(\bm{t}^{\prime}),(\mu\circ\upsilon)(\bm{t})\bigr{)}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Billiards on pythagorean triples

and their Minkowski functions

Giovanni Panti

Department of Mathematics, Computer Science and Physics

University of Udine

via delle Scienze 206

33100 Udine, Italy

[email protected]

Abstract.

It has long been known that the set of primitive pythagorean triples can be enumerated by descending certain ternary trees. We unify these treatments by considering hyperbolic billiard tables in the Poincaré disk model. Our tables have $m\geq 3$ ideal vertices, and are subject to the restriction that reflections in the table walls are induced by matrices in the triangle group $\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ . The resulting billiard map $\widetilde{B}$ acts on the de Sitter space $x_{1}^{2}+x_{2}^{2}-x_{3}^{2}=1$ , and has a natural factor $B$ on the unit circle, the pythagorean triples appearing as the $B$ -preimages of fixed points. We compute the invariant densities of these maps, and prove the Lagrange and Galois theorems: A complex number of unit modulus has a preperiodic (purely periodic) $B$ -orbit precisely when it is quadratic (and isolated from its conjugate by a billiard wall) over $\mathbb{Q}(i)$ .

Each $B$ as above is a $(m-1)$ -to- $1$ orientation-reversing covering map of the circle, a property shared by the group character $T(z)=z^{-(m-1)}$ . We prove that there exists a homeomorphism $\Phi$ , unique up to postcomposition with elements in a dihedral group, that conjugates $B$ with $T$ ; in particular $\Phi$ —whose prototype is the classical Minkowski question mark function— establishes a bijection between the set of points of degree $\leq 2$ over $\mathbb{Q}(i)$ and the torsion subgroup of the circle. We provide an explicit formula for $\Phi$ , and prove that $\Phi$ is singular and Hölder continuous with exponent $\log(m-1)$ divided by the maximal periodic mean free path in the associated billiard table.

2020 Math. Subj. Class.: 37D40; 11J70.

The author is partially supported by the research project SiDiA of the University of Udine.

1. Introduction

Rational points in the real projective line $\operatorname{P}^{1}\mathbb{R}$ involve two integers, a numerator and a denominator; we can enumerate them by reversing the euclidean algorithm or —equivalently— taking inverse branches of continued fraction maps. Rational points in the unit circle $S^{1}$ involve three integers, the two legs and the hypotenuse of a pythagorean triangle. As the line and the circle can be mutually parametrized with preservation of rational points, the complexity of the enumeration is the same, and there is a line of work (starting from [6], and running through [4], [11], [3], [33], [15] and references therein) describing how pythagorean triples can be generated by descending trees.

Ascending the same trees amounts to iterating continued fraction maps, and in [42] Romik analyzes one such map, relating it to the geodesic flow on the three-punctured sphere. It turns out that Romik’s map can also be seen as the Gauss map of even continued fractions; see [2, §4], [15, §5], [7, §2] for various developments.

Although there is a birational bijection with rational coefficients between the line and the circle, continued fraction maps on the two spaces are not exactly the same thing. Indeed, the rational symmetry group of the projective line is the extended modular group $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ , while that of the circle is $\operatorname{SO}_{2,1}\mathbb{Z}$ , the stabilizer of the Lorentz form inside $\operatorname{SL}_{3}\mathbb{Z}$ . When embedded in a larger ambient group —say $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ — they appear as the $(2,3,\infty)$ and the $(2,4,\infty)$ extended triangle groups, and neither is a subgroup of the other (of course, they are commensurable).

In this paper we develop continued fraction maps (of the “slow” type, that is with parabolic fixed points) directly on the circle, as factors of billiard maps determined by ideal polygons in the hyperbolic plane. We summarize our main results as follows:

•

Let $D$ be a polygon in the Poincaré disk having $m\geq 3$ vertices, all at the boundary at infinity $S^{1}$ . Let $B:S^{1}\to S^{1}$ be the map that sends the interval between two vertices to the union of the remaining intervals via reflection in the corresponding polygon side. Let $T$ be the group character $z\mapsto z^{-(m-1)}$ . Then $B$ and $T$ are conjugate by an essentially unique homeomorphism $\Phi$ , which provides a bijection between the set of points of degree at most $2$ over $\mathbb{Q}(i)$ and the torsion subgroup of $S^{1}$ . The homeomorphism $\Phi$ is singular and Hölder continuous, of exponent $\log(m-1)$ divided by the maximal mean free path (see Definition 10.3) of periodic trajectories in the hyperbolic billiard determined by $D$ .

The route leading to the above statement is somehow long; we offer two justifications.

(1)

The end result is a flexible and applicable tool. Indeed, the maximal mean free path referred to above equals twice the logarithm of the joint spectral radius of the set $\Sigma$ of matrices expressing reflections in the billiard walls. When the vertices of $D$ determine a unimodular partition of $S^{1}$ (an arithmetical condition explained in §5), this joint spectral radius can often be explicitly computed; see Example 10.6. 2. (2)

Along that route we encounter fair landscapes.

We describe our route: in §2 we determine finite sets of reflections generating the orthogonal group $\operatorname{O}_{2,1}\mathbb{Z}$ and its subgroups $\operatorname{SO}_{2,1}\mathbb{Z}$ and $\operatorname{O}^{\uparrow}_{2,1}\mathbb{Z}$ , the latter being the stabilizer of the upper sheet of the hyperboloid $x_{1}^{2}+x_{2}^{2}-x_{3}^{2}=-1$ . Then, as a warmup, in §3 we review the construction of the Romik map using our formalism. In §4 we provide explicit $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ -equivariant bijections between the homogeneous space $\operatorname{PSL}_{2}\mathbb{R}/\{\text{diagonal matrices}\}$ , the de Sitter space $x_{1}^{2}+x_{2}^{2}-x_{3}^{2}=1$ , the space of oriented geodesics in the hyperbolic plane, and that of quadratic forms of discriminant $1$ . These correspondences are known, but since they appear scattered in the literature and some care is required to extend the acting group from the usual $\operatorname{PSL}_{2}\mathbb{R}$ to the full $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ , our brief self-contained treatment in Theorem 4.1 may have some value. In §5 we treat unimodular partitions of the circle; a reader not interested in arithmetical issues may safely skip Theorems 5.3 and 5.5.

The preliminaries being over, we introduce in §6 our continued fraction maps $B$ as factors of billiard maps $\widetilde{B}$ associated to ideal polygons whose vertices form a unimodular partition of the circle. Reflections in the table walls are expressed by elements of $\operatorname{PSU}_{1,1}^{\pm}\mathbb{Z}[i]$ —which we naturally take as matrices— in the Poincaré model, and by matrices in $\operatorname{O}^{\uparrow}_{2,1}\mathbb{Z}$ in the Klein model. Here the de Sitter space plays a twofold rôle, as the phase space of $\widetilde{B}$ as well as the space of shrinking intervals, this double nature being reflected in a double action of $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ ; see Remark 5.2. In §7 we use the bijections in §4 to characterize the natural extension and the absolutely continuous invariant measure of $B$ . In §8 we show that the map $B$ and the extended fuchsian group generated by $\Sigma$ are orbit-equivalent, and prove the following statement, which combines the classical Lagrange and Galois theorems. A complex number of unit modulus is quadratic over $\mathbb{Q}(i)$ if and only if its $B$ -orbit is eventually periodic; moreover, if this is the case, then the conjugate point has the reverse period, and the two points are purely periodic precisely when they are separated by a billiard wall.

In §9 we introduce the conjugacy alluded to above. It is a natural conjugacy; indeed, $B$ is an $(m-1)$ -to- $1$ orientation-reversing covering map of the circle, a topological property shared by precisely one group character, namely $T(z)=z^{-(m-1)}$ . We thus have a “linearized” version of a continued fraction map, precisely as the tent map on $[0,1]$ is a linearized version of the Farey map. It turns out (Lemma 9.3) that the natural symbolic coding of points via $B$ , as well as the analogous coding via $T$ , characterizes the ternary betweenness relation on the circle. Since the latter relation determines the circle topology, we obtain in Theorem 9.2 that $B$ and $T$ are conjugate by a homeomorphism $\Phi$ , unique up to postcomposition with elements of the dihedral group with $2m$ elements. This homeomorphism is the analogue of the classical Minkowski question mark function [19], [43], [27], which conjugates the Farey map with the tent map. We provide in Theorem 9.4 an explicit expression for $\Phi$ analogous to the Denjoy-Salem formula [43, p. 436] for the question mark function, and show in Examples 8.4 and 9.5 how the arithmetic properties of $B$ and $T$ are intertwined by $\Phi$ . In Theorem 10.1 we provide an ergodic-theoretic proof of the fact that $\Phi$ has zero derivative at Lebesgue-all points.

In the final Theorem 10.5 we complete the proof of the connection sketched above between the joint spectral radius of $\Sigma$ and the Hölder exponent of $\Phi$ . In all instances we examined the Lagarias-Wang finiteness conjecture ([31], see §10) turned out to be true for $\Sigma$ , and a maximizing periodic billiard trajectory was easily guessed and verified. It is plausible that the conjecture holds for all billiard tables determined by unimodular partitions of the circle, and we leave this as an interesting open problem.

2. Notation and preliminaries

Since we treat various spaces of matrices, we will distinguish them notationally, by using boldface for $3\times 3$ matrices and lightface for $2\times 2$ ones. Points in $\mathbb{R}^{3}$ are written in boldface and are always column vectors, although we may write $\bm{x}=(x_{1},x_{2},x_{3})$ for typographical reasons. We will use square or round brackets for vectors and matrices, according whether we are in a projective setting (that is, up to multiplication by nonzero scalars) or in a linear-algebra one. Zero entries in matrices are replaced by blank spaces.

Let

[TABLE]

be the matrix of the three-variable Lorentz quadratic form, and let $\langle\bm{x},\bm{y}\rangle=\bm{x}^{\top}\bm{L}\bm{y}$ be the corresponding symmetric bilinear map. The upper sheet $\mathcal{L}=\{\bm{x}:\langle\bm{x},\bm{x}\rangle=-1,\,x_{3}>0\}$ of the $2$ -sheeted hyperboloid $\langle\bm{x},\bm{x}\rangle=-1$ is one of the standard models of the hyperbolic plane, other models being the upper halfplane $\mathcal{H}=\{z\in\mathbb{C}:\operatorname{im}z>0\}$ , the Klein disk $\mathcal{K}=\{[x_{1},x_{2},x_{3}]\in\operatorname{P}^{2}\mathbb{R}:x_{1}^{2}+x_{2}^{2}<x_{3}^{2}\}$ , and the Poincaré disk $\mathcal{D}=\{z\in\mathbb{C}:\lvert z\rvert<1\}$ ; we refer the reader to [10] for an enjoyable introduction to hyperbolic geometry. We need explicit bijections between these models, so we introduce a fifth auxiliary model, namely the upper hemisphere $\mathcal{J}=\{\bm{x}\in\mathbb{R}^{3}:x_{1}^{2}+x_{2}^{2}+x_{3}^{2}=1,\,x_{3}>0\}$ , and state a lemma.

Lemma 2.1.

The spaces $\mathcal{L}$ , $\mathcal{H}$ , $\mathcal{K}$ , $\mathcal{D}$ , $\mathcal{J}$ are in bijective correspondence via the commuting diagram

[TABLE]

where

•

$\pi:\mathbb{R}^{3}\setminus\{0\}\to\operatorname{P}^{2}\mathbb{R}$ * is the natural quotient map,*

•

$\tau_{0}$ * is the stereographic projection through $(0,0,-1)$ ,*

•

$\eta(\bm{x})=(x_{1}+i)/(x_{3}-x_{2})$ ,

•

$\upsilon([x_{1},x_{2},x_{3}])=\bigl{(}x_{1}/x_{3},x_{2}/x_{3},(x_{3}^{2}-x_{1}^{2}-x_{2}^{2})^{1/2}/x_{3}\bigr{)}$ * is the “vertical” projection,*

•

$\mu$ * is the stereographic projection through $(0,1,0)$ to the halfplane $\{x_{2}=0,x_{3}>0\}$ , followed by the obvious identification of the latter with $\mathcal{H}$ ,*

•

$\tau$ * is the stereographic projection through $(0,0,-1)$ to the disk $\{x_{1}^{2}+x_{2}^{2}<1,x_{3}=0\}$ , followed by the obvious identification of the latter with $\mathcal{D}$ ,*

•

$C$ * is the Möbius transformation $z\mapsto C*z=(z-i)/(-iz+1)$ induced by the Cayley matrix $C=2^{-1/2}\bigl{[}\begin{smallmatrix}1&-i\\ -i&1\end{smallmatrix}\bigr{]}\in\operatorname{PSL}_{2}\mathbb{C}$ (as customary, we blur the distinction between matrices and the maps they induce).*

These correspondences extend to the respective ideal boundaries.

Proof.

The proof reduces to a commentary on the figure on page $70$ of [10]. The upper-left triangle commutes because $\upsilon\circ\pi$ sends $\bm{x}=(x_{1},x_{2},x_{3})\in\mathcal{L}$ to $\bigl{(}x_{1},x_{2},(x_{3}^{2}-x_{1}^{2}-x_{2}^{2})^{1/2}\bigr{)}/x_{3}=(x_{1},x_{2},1)/x_{3}=(1/x_{3})(x_{1},x_{2},x_{3})+(1-1/x_{3})(0,0,-1)$ . The upper-right triangle commutes because $\mu$ sends $(x_{1},x_{2},1)/x_{3}\in\mathcal{J}$ to $x_{1}/(x_{3}-x_{2})+i/(x_{3}-x_{2})=\eta(\bm{x})$ . The lower-right triangle commutes because, given $\bm{y}\in\mathcal{J}$ ,

[TABLE]

The fact that these correspondences extend to the ideal boundaries is obvious as soon as the boundary $\partial\mathcal{L}$ of $\mathcal{L}$ and the maps $\pi$ , $\tau_{0}$ , $\eta$ on it are properly defined. We see $\partial\mathcal{L}$ as the intersection of the projective closure of $\mathcal{L}\cup(-\mathcal{L})$ (i.e., the variety $x_{1}^{2}+x_{2}^{2}-x_{3}^{2}+x_{4}^{2}=0$ in $\operatorname{P}^{3}\mathbb{R}$ ) with the plane at infinity $x_{4}=0$ , and we set

[TABLE]

We can then view $[x_{1},x_{2},x_{3},0]\in\partial\mathcal{L}$ as the limit (in the euclidean metric of an appropriate local chart) of $\bm{x}(t)=t\bigl{(}x_{1},x_{2},(x_{1}^{2}+x_{2}^{2}+1/t^{2})^{1/2}\bigr{)}\in\mathcal{L}$ , for $t\to+\infty$ . An easy computation shows that the $\pi$ -, $\tau_{0}$ -, $\eta$ -images of $[x_{1},x_{2},x_{3},0]\in\partial\mathcal{L}$ , as defined above, agree with the limits (in the euclidean metric) of $\pi(\bm{x}(t))$ , $\tau_{0}(\bm{x}(t))$ , $\eta(\bm{x}(t))$ , for $t\to+\infty$ . This guarantees the required commutativity. ∎

It is well known that the orthogonal group $\operatorname{O}_{2,1}\mathbb{R}$ of the Lorentz form has four connected components, namely the component of the identity (which is a normal subgroup) and its cosets with respect to the diagonal matrices having diagonal entries $(-1,1,1)$ , $(1,1,-1)$ , $(-1,1,-1)$ . The union of the component of the identity with its $(-1,1,-1)$ -coset is the special orthogonal group $\operatorname{SO}_{2,1}\mathbb{R}$ , while its union with the $(-1,1,1)$ -coset is the group $\operatorname{O}_{2,1}^{\uparrow}\mathbb{R}$ of all matrices that preserve $\mathcal{L}$ ; equivalently, $\operatorname{O}_{2,1}^{\uparrow}\mathbb{R}=\{\bm{A}\in\operatorname{O}_{2,1}\mathbb{R}:\text{the$ (3,3) $-entry of$ \bm{A} $is$ >0 $}\}$ . We will write $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{R}=\operatorname{SO}_{2,1}\mathbb{R}\cap\operatorname{O}^{\uparrow}_{2,1}\mathbb{R}$ for the component of the identity.

The group of isometries (including the orientation-reversing ones) of $\mathcal{H}$ is $\operatorname{PSL}^{\pm}_{2}\mathbb{R}=\{A\in\operatorname{GL}_{2}\mathbb{R}:\lvert\det A\rvert=1\}/\{\pm I\}$ , which acts on $\mathcal{H}$ as follows: given $A=\bigl{[}\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}\bigr{]}$ , then $A*z$ equals $(az+b)/(cz+d)$ if $\det A=1$ , and equals $(a\bar{z}+b)/(c\bar{z}+d)$ if $\det A=-1$ . Conjugating $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ with the Cayley matrix we obtain the group

[TABLE]

which acts on $\mathcal{D}$ via

[TABLE]

We construct an isomorphic representation $\operatorname{PSL}^{\pm}_{2}\mathbb{R}\to\operatorname{O}^{\uparrow}_{2,1}\mathbb{R}$ by identifying the vector $\bm{w}=(w_{1},w_{2},w_{3})\in\mathbb{R}^{3}$ with the matrix

[TABLE]

on which $A\in\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ acts on the left by $W\mapsto(A^{-1})^{\top}WA^{-1}$ . This is a well defined action, independent from the lift of $A$ to $\operatorname{SL}^{\pm}_{2}\mathbb{R}$ , linear, and preserving the form $\langle\bm{w},\bm{w}\rangle=-\det W$ . Computing the images of the $1$ -parameter subgroups in the Iwasawa decomposition of $\operatorname{PSL}_{2}\mathbb{R}$ provides a geometric picture of the representation, namely

[TABLE]

Convention 2.2.

In order to simplify notation we adopt the convention that, whenever a matrix in $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ is denoted by a certain capital letter, then its image under the above representation, and its $C$ -conjugate, are denoted by the same capital letter in bold and in calligraphic fonts, respectively. With this understanding, we give names to a few matrices that will recur throughout this paper.

[TABLE]

Explicit computation —which we omit— shows that $\eta\circ\bm{A}=A\circ\eta$ on $\mathcal{L}$ , for every $A$ in the above $1$ -parameter subgroups, and also for $A=J$ ; therefore the identity $\eta\circ\bm{A}=A\circ\eta$ holds for every $A\in\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ . The action of $\operatorname{O}^{\uparrow}_{2,1}\mathbb{R}$ on $\mathbb{R}^{3}$ descends to a projective action on $\operatorname{P}^{2}\mathbb{R}$ that fixes the Klein model $\mathcal{K}$ and its boundary $\partial\mathcal{K}$ . These observations, together with Lemma 2.1, imply that for every $A\in\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ the diagram

[TABLE]

commutes. The analogous diagram involving the ideal boundaries of $\mathcal{K},\mathcal{D},\mathcal{H}$ commutes as well, and actually simplifies. Indeed, the nontrivial bijection $\tau\circ\upsilon$ reduces on $\partial\mathcal{K}$ to the obvious identification $[x_{1},x_{2},x_{3}]\mapsto(x_{1}+x_{2}i)/x_{3}$ , while $C^{-1}\circ\tau\circ\upsilon$ reduces to the stereographic projection through $[0,1,1]$ , namely $[x_{1},x_{2},x_{3}]\mapsto x_{1}/(x_{3}-x_{2})$ . We will thus switch freely between $\partial\mathcal{K}$ and $\partial\mathcal{D}$ , using $S^{1}$ as a neutral name for both.

Let $D$ be a polygon in $\mathcal{H}$ , bounded by $m\geq 3$ geodesics $l_{0},\ldots,l_{m-1}$ , and having angles at vertices $\pi/e_{0},\ldots,\pi/e_{m-1}$ , with $e_{0},\ldots,e_{m-1}$ integers $\geq 2$ or $\infty$ (if the corresponding vertex lies in $\partial\mathcal{H}$ ); the Gauss-Bonnet formula forces $m-2>\sum_{a}e_{a}^{-1}$ . The extended Coxeter group associated to $D$ is the subgroup $\Gamma^{\pm}$ of $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ generated by the reflections in the sides of $D$ . It has the presentation

[TABLE]

(with the understanding that relators $(x_{a}x_{a+1})^{\infty}$ do not appear), and $D$ is a fundamental domain for it. Its index- $2$ subgroup of orientation-preserving elements $\Gamma=\Gamma^{\pm}\cap\operatorname{PSL}_{2}\mathbb{R}$ is a fuchsian group of finite covolume; see [28], [32]. When $D$ is a triangle we write $\Delta(e_{0},e_{1},e_{2})$ and $\Delta^{\pm}(e_{0},e_{1},e_{2})$ for $\Gamma$ and $\Gamma^{\pm}$ , referring to them as a triangle group and an extended triangle group, respectively (the adjective extended stresses the fact that orientation-reversing isometries are allowed; in both cases, the action on $\mathcal{H}$ is properly discontinuous). Note that the numbers $e_{0},e_{1},e_{2}$ determine the triangle up to isometry, and hence the groups up to conjugation. We will freely use all of the above terminology when working in other models of the hyperbolic plane.

Let us return to the Lorentz form $\langle\operatorname{--},\operatorname{--}\rangle$ . We recall that, given a nonisotropic vector $\bm{w}$ , the reflection $\bm{R_{w}}$ is the unique linear involution of $\mathbb{R}^{3}$ that fixes pointwise the polar hyperplane $\{\bm{x}:\langle\bm{w},\bm{x}\rangle=0\}$ and exchanges $\bm{w}$ with $-\bm{w}$ . An easy computation (of course, all of this is well known) shows that:

(i)

[TABLE]

(ii)

$\bm{R_{w}}$ preserves $\langle\operatorname{--},\operatorname{--}\rangle$ ,

(iii)

in terms of matrices,

[TABLE]

(iv)

$\bm{R_{w}}\in\operatorname{O}_{2,1}^{\uparrow}\mathbb{R}$ if and only if $\langle\bm{w},\bm{w}\rangle>0$ .

Notation 2.3.

•

$\operatorname{O}_{2,1}\mathbb{Z}$ (respectively, $\operatorname{SO}_{2,1}\mathbb{Z}$ , $\operatorname{O}_{2,1}^{\uparrow}\mathbb{Z}$ , $\operatorname{SO}_{2,1}^{\uparrow}\mathbb{Z}$ ) is the intersection of $\operatorname{O}_{2,1}\mathbb{R}$ (respectively, $\operatorname{SO}_{2,1}\mathbb{R}$ , $\operatorname{O}_{2,1}^{\uparrow}\mathbb{R}$ , $\operatorname{SO}_{2,1}^{\uparrow}\mathbb{R}$ ) with $\operatorname{GL}_{3}\mathbb{Z}$ .

•

$\operatorname{PSL}^{\pm}_{2}\mathbb{Z}=\{A\in\operatorname{PSL}^{\pm}_{2}\mathbb{R}:\text{$ A $has entries in$ \mathbb{Z} $}\}$ .

•

$\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]=\{\mathscr{A}\in\operatorname{PSU}^{\pm}_{1,1}\mathbb{C}:\mathscr{A}\text{ has entries in }\mathbb{Z}[i]\}$ .

•

$\langle F,P,G\rangle^{+}$ (and analogously for other groups generated by involutions) is the group of all products of an even number of elements in $\{F,P,G\}$ .

The four matrices $\bm{J},\bm{F},\bm{P},\bm{G}$ in (2.3) are in $\operatorname{O}^{\uparrow}_{2,1}\mathbb{Z}$ ; in particular they are of the form $\bm{R_{w}}$ , for $\bm{w}$ equal to $(1,0,0)$ , $(0,1,0)$ , $(1,1,1)$ , $(-1,1,0)$ , respectively. In [35] it is proved that the five reflections $\bm{J}$ , $\bm{F}$ , $\bm{R}_{(0,0,1)}=\operatorname{diag}(1,1,-1)$ , $\bm{R}_{(1,1,0)}=\bm{J}\bm{G}\bm{J}$ , $\bm{P}$ generate $\operatorname{O}_{2,1}\mathbb{Z}$ (see [17] for an elementary proof which avoids the theory of Kac-Moody Lie algebras); we give an independent and expanded version in the following theorem.

Theorem 2.4.

We have $\operatorname{O}_{2,1}^{\uparrow}\mathbb{Z}=\langle\bm{F}$ , $\bm{P}$ , $\bm{G}\rangle$ , which is isomorphic to the extended triangle group $\Delta^{\pm}(2,4,\infty)$ ; adding $\bm{R}_{(0,0,1)}$ as a further generator we obtain the full group $\operatorname{O}_{2,1}\mathbb{Z}$ . The group $\langle\bm{F},\bm{P},\bm{J}\rangle$ is an index- $2$ subgroup of $\operatorname{O}_{2,1}^{\uparrow}\mathbb{Z}$ , and equals $\Delta^{\pm}(2,\infty,\infty)$ ; its image $\langle\mathscr{F},\mathscr{P},\mathscr{J}\rangle$ inside $\operatorname{PSU}^{\pm}_{1,1}\mathbb{C}$ is $\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ .

Proof.

We work in $\mathcal{H}$ . Let $\Gamma=\{A\in\operatorname{PSL}_{2}\mathbb{R}:\bm{A}\in\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}\}$ ; then, by definition, $\Gamma$ is an arithmetic fuchsian group. We observe that $\langle F,P,G\rangle^{+}$ is the triangle group $\Delta(2,4,\infty)$ . Indeed $F,P,G$ are the reflections in the three geodesics

•

$l_{0}$ , whose endpoints are $1$ and $-1$ ;

•

$l_{1}$ , whose endpoints are $\infty$ and $1$ ;

•

$l_{2}$ , whose endpoints are $1-\sqrt{2}$ and $1+\sqrt{2}$ .

These geodesics determine a triangle $D$ in $\mathcal{H}$ with vertices at $1+i\sqrt{2}$ with angle $\pi/2$ , at $i$ with angle $\pi/4$ , and at the ideal point $1$ with angle [math].

Clearly $\langle F,P,G\rangle^{+}$ is a subgroup of $\Gamma$ , and it is well-known that a fuchsian group containing a triangle group must itself be a triangle group [44, §6]. The partially ordered set of all nine non-cocompact arithmetic triangle groups has been determined by Takeuchi in [46], and $\Delta(2,4,\infty)$ is maximal in it; therefore $\Gamma=\langle F,P,G\rangle^{+}$ . Adding $F$ as a further generator to $\langle F,P,G\rangle^{+}$ we obtain $\langle F,P,G\rangle=\{A\in\operatorname{PSL}^{\pm}_{2}\mathbb{R}:\bm{A}\in\operatorname{O}^{\uparrow}_{2,1}\mathbb{Z}\}$ , as claimed.

For the second statement, observe that replacing the generator $G$ with $J$ means replacing $l_{2}$ with the geodesic $l^{\prime}_{2}$ whose endpoints are [math] and $\infty$ . The polygon determined by $l_{0},l_{1},l^{\prime}_{2}$ is the triangle $D^{\prime}=D\cup G[D]$ , with angles $\pi/2$ at $i$ , and [math] at $1$ and at $\infty$ ; hence $\langle F,P,J\rangle$ is the extended triangle group $\Delta^{\pm}(2,\infty,\infty)$ . Clearly $\langle\mathscr{F},\mathscr{P},\mathscr{J}\rangle\leq\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ , and by computing

[TABLE]

we see that $C^{-1}\bigl{(}\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]\bigr{)}C$ is a subgroup of $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ . Taking into account the respective fundamental domains, it is easy to check that $\langle F,P,J\rangle$ has index $3$ in $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ ; therefore $C^{-1}\bigl{(}\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]\bigr{)}C$ equals either $\langle F,P,J\rangle$ or the full $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ . However, this second possibility is ruled out by the fact that $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ (which is the extended $(2,3,\infty)$ triangle group) contains elements of order $3$ , and hence of trace $1$ (up to sign), while clearly no element of $\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ may have trace $1$ . ∎

3. Pythagorean triples and the Romik map

A [primitive] pythagorean triple is a point $\bm{t}=(t_{1},t_{2},t_{3})\in\mathbb{Z}^{3}$ such that $t_{3}>0$ , $\gcd(t_{1},t_{2},t_{3})=1$ , and $t_{1}^{2}+t_{2}^{2}=t_{3}^{2}$ . Pythagorean triples correspond bijectively to rational points in the unit circle, which in turn correspond, via stereographic projection, to points in $P^{1}\mathbb{Q}$ . These correspondences provide various techniques for enumerating triples, among which the one known to Euclid: given any reduced fraction $a/b$ , the triple $(a^{2}-b^{2},2ab,a^{2}+b^{2})/\gcd(a^{2}-b^{2},2ab,a^{2}+b^{2})$ is pythagorean, and every pythagorean triple is uniquely obtainable in this way (the gcd in the denominator is $1$ if $2\mid ab$ , and $2$ otherwise). As noted in the introduction, many techniques are cast in the form of the descent of a binary or ternary tree.

A remarkable connection with the theory of continued fractions is offered in [42]; as a warmup, we sketch it using our notation. We partition $S^{1}$ in four quarters $I_{0},I_{1},I_{2},I_{3}$ , with $I_{a}=\{\exp(2\pi ti):a/4\leq t\leq(a+1)/4\}$ . Let $\bm{A}=\bm{R}_{(1,-1,1)}=\bm{F}\bm{P}\bm{F}$ . Then $\bm{A}$ acts on $S^{1}$ (viewed as $\partial\mathcal{K}$ , see the diagram (2.4) and the resulting identifications) by exchanging $\bm{x}$ with the other point of intersection of $S^{1}$ with the line through $\bm{x}$ and $[1,-1,1]$ ; the interval $I_{3}$ is thus bijectively mapped to the union of the other three intervals. We fold back $I_{0}\cup I_{1}\cup I_{2}$ to $I_{3}$ via the reflection $\bm{F}$ acting on $I_{0}$ , the rotation $\bm{J}\bm{F}$ on $I_{1}$ , and the reflection $\bm{J}$ on $I_{2}$ ; see Figure 1. Conjugating this process via the stereographic projection through $[0,1,1]$ we obtain the Romik map in Figure 2. By construction, it is a continuous piecewise-projective selfmap of the real unit interval $[0,1]$ . It is composed of three pieces, each one mapping bijectively a subinterval of $[0,1]$ to the whole interval. The computation of these pieces is built-in in our formalism: indeed, since stereographic projection from $[0,1,1]$ is $C^{-1}\circ\tau\circ\upsilon$ on $\partial\mathcal{K}$ , computation amounts to switching from boldface to lightface. Thus, the first piece is induced by $J(FPF)=\bigl{[}\begin{smallmatrix}1&\\ -2&1\end{smallmatrix}\bigr{]}$ acting on $FPFJ*[0,1]=[0,1/3]$ , the second one by $(JF)(FPF)=JPF=\bigl{[}\begin{smallmatrix}-2&1\\ 1&\end{smallmatrix}\bigr{]}$ acting on $FPJ*[0,1]=[1/3,1/2]$ , and the third by $F(FPF)=PF=\bigl{[}\begin{smallmatrix}2&-1\\ 1&\end{smallmatrix}\bigr{]}$ on $FP*[0,1]=[1/2,1]$ .

We adopt another notational shorthand, by consistently writing $\bm{t},\theta$ (or $\bm{s},\sigma$ , $\ldots$ ) for pairs $\bm{t}=[t_{1},t_{2},t_{3}]\in\partial\mathcal{K}$ , $\theta=(t_{1}+t_{2}i)/t_{3}\in\partial\mathcal{D}$ , identified as in the discussion following the diagram (2.4). We recall that the residue field of the point $\bm{t}=[t_{1},t_{2},t_{3}]$ in the projective variety $\{x_{1}^{2}+x_{2}^{2}-x_{3}^{2}=0\}=\partial\mathcal{K}$ is $\mathbb{Q}(\bm{t})=\mathbb{Q}(t_{1}/t_{3},t_{2}/t_{3})$ . If $\mathbb{Q}(\bm{t})=\mathbb{Q}$ we say that $\bm{t}$ is a rational point; in this case $\bm{t}$ has a canonical presentation as a pythagorean triple. The corresponding $\theta\in\mathbb{Q}(i)$ has a canonical presentation as well, but a subtler one. For each prime integer $p\equiv 1\pmod{4}$ , write uniquely $p=a^{2}+b^{2}$ , for integers $a>b>0$ , and let $\theta_{p}=(a+bi)/(a-bi)$ (corresponding, as in Euclid’s setting, to $\bm{t}_{p}=[a^{2}-b^{2},2ab,a^{2}+b^{2}]$ ). It is well known —and easy to prove [20]— that every $\theta\in S^{1}\cap\mathbb{Q}(i)$ factors uniquely in $\mathbb{Q}(i)$ as a product of a unit in $\mathbb{Z}[i]$ and finitely many numbers $\theta_{p}$ and their inverses. This implies that the set of primitive pythagorean triples forms a multiplicative group, isomorphic to the direct sum of the cyclic group of order $4$ with countably many copies of the infinite cyclic group. We thus obtain our second canonical presentation: every $\theta\in S^{1}\cap\mathbb{Q}(i)$ can be uniquely expressed as $\theta=\kappa\mu/\bar{\mu}$ , with $\kappa\in\{1,i,-1,-i\}$ and $\mu\in\mathbb{Z}[i]$ having prime decomposition of the form

[TABLE]

with $a_{j}>\lvert b_{j}\rvert>0$ , $e_{j}>0$ for every $j$ , and the pairs $(a_{1},\lvert b_{1}\rvert),\ldots,(a_{q},\lvert b_{q}\rvert)$ all distinct.

4. The de Sitter space

The de Sitter space is the one-sheeted hyperboloid $\mathcal{S}=\{\bm{x}\in\mathbb{R}^{3}:\langle\bm{x},\bm{x}\rangle=1\}$ ; it is a lorentzian manifold of constant positive curvature [37], [36]. The de Sitter space is in natural bijection with various spaces of interest to us: these bijections are well known, albeit a bit scattered in the literature. We collect the relevant facts in Theorem 4.1, whose nonstandard feature is the rôle of $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ as the acting group, instead of the usual $\operatorname{PSL}_{2}\mathbb{R}$ .

We recall from §2 that $A\mapsto\bm{A}$ is a group isomorphism from $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ to $\operatorname{O}^{\uparrow}_{2,1}\mathbb{R}$ . We define now another isomorphism $\Lambda:\operatorname{PSL}^{\pm}_{2}\mathbb{R}\to\operatorname{SO}_{2,1}\mathbb{R}$ by $\Lambda(A)=(\det A)\bm{A}$ . In the following theorem we let $e:\{1,-1\}\to\{0,1\}$ have value [math] on $1$ , and $1$ on $-1$ ; also, we denote any group action by a star.

Theorem 4.1.

The spaces in the following list, together with the specified base points and transitive left actions of $\operatorname{PSL}_{2}^{\pm}\mathbb{R}$ , are in bijective correspondence. These correspondences preserve the base points and are equivariant with respect to the actions.

(S1)

The de Sitter space $\mathcal{S}$ , with base point $(1,0,0)$ and action $A*\bm{x}=\Lambda(A)\bm{x}$ .

(S2)

The coset space $\operatorname{PSL}_{2}\mathbb{R}/\mathfrak{A}$ , for $\mathfrak{A}$ the subgroup of diagonal matrices, with base point $\mathfrak{A}$ and action $A*E\mathfrak{A}=AEJ^{e(\det A)}\mathfrak{A}$ .

(S3)

$(\operatorname{P}^{1}\mathbb{R}\times\operatorname{P}^{1}\mathbb{R})\setminus(\operatorname{diagonal})$ , with base point $(\infty,0)$ and action $A*(\omega,\alpha)=(A*\omega,A*\alpha)$ .

(S4)

$(S^{1}\times S^{1})\setminus(\operatorname{diagonal})$ , with base point $(i,-i)$ and action $A*(\sigma,\rho)=(\mathscr{A}*\sigma,\mathscr{A}*\rho)$ .

(S5)

The space of oriented geodesics in $\mathcal{D}$ , with base point the geodesic from $-i$ to $i$ and action $A*g=\mathscr{A}[g]$ .

(S6)

The space of quadratic forms $q\bigl{(}\begin{smallmatrix}x\\ y\end{smallmatrix}\bigr{)}=q_{1}x^{2}-q_{2}xy+q_{3}y^{2}$ of discriminant $1$ , with base point $-xy$ and action $(A*q)\bigl{(}\begin{smallmatrix}x\\ y\end{smallmatrix}\bigr{)}=(\det A)q\bigl{(}A^{-1}\bigl{(}\begin{smallmatrix}x\\ y\end{smallmatrix}\bigr{)}\bigr{)}$ .

Each space carries a $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ -invariant infinite measure, which is the quotient Haar measure in (S2), and is induced by the form $(\omega-\alpha)^{-2}\,\mathrm{d}\omega\,\mathrm{d}\alpha$ in (S3). In (S1), the measure of a Borel subset $B$ of $\mathcal{S}$ is the euclidean volume of the cone $\{t\bm{x}:t\in[0,1],\,\bm{x}\in B\}$ , and analogously for (S6).

Proof.

The natural bijections among the spaces in (S3), (S4), (S5) are the obvious ones resulting from the diagram (2.4). Here we will first describe the bijections among (S2), (S3), (S6), and then the one between (S1) and (S6).

Let $q$ be a form as in (S6), associated to the symmetric matrix

[TABLE]

of determinant $-1/4$ . We obtain a pair $(\omega,\alpha)$ as in (S3) by labeling the two roots of $q(x,1)$ as follows:

(a)

if $q_{1}=0$ and $q_{2}=1$ , then $\omega=\infty$ and $\alpha=q_{3}$ ;

(b)

if $q_{1}=0$ and $q_{2}=-1$ , then $\omega=-q_{3}$ and $\alpha=\infty$ ;

(c)

if $q_{1}\not=0$ , then

[TABLE]

Given a pair $(\omega,\alpha)$ as in (S3), we set

[TABLE]

thus defining a coset $E\mathfrak{A}$ as in (S2).

Finally, any $E\mathfrak{A}$ in (S2) determines a symmetric matrix $Q^{\prime}$ of determinant $-1/4$ via

[TABLE]

note that $Q^{\prime}$ is well defined, i.e., independent from the choice of a representative in $E\mathfrak{A}$ and from the lift of this representative to $\operatorname{SL}_{2}\mathbb{R}$ .

It is clear that each of these constructions preserves the base points and is equivariant with respect to the listed actions. Therefore, the claimed correspondence between (S2), (S3), (S6) follows as soon as we prove that the final $Q^{\prime}$ equals the starting $Q$ . We check case (c), leaving the simpler cases (a) and (b) to the reader. By definition,

[TABLE]

so that

[TABLE]

Hence

[TABLE]

which is the initial $Q$ ; note the use of the identity $J^{\pm 1}\bigl{(}\begin{smallmatrix}&1\\ 1&\end{smallmatrix}\bigr{)}J^{\pm 1}=\pm 1\bigl{(}\begin{smallmatrix}&1\\ 1&\end{smallmatrix}\bigr{)}$ in the computation.

The bijection between (S1) and (S6) is a simple change of variables, namely

[TABLE]

This change of variables transforms the matrix $Q$ in (4.1) to $W/2$ , where $W$ is the matrix in (2.1). This implies that the bijection is equivariant with respect to the actions listed in (S1) and (S6); see also Remark 5.2.

The statement about invariant measures is well known; see, e.g., [22, §8]. ∎

For future reference we list here the form $q$ and the point $\bm{w}\in\mathcal{S}$ as a function of $(\omega,\alpha)$ :

[TABLE]

5. Circle intervals

The unit circle $S^{1}$ is cyclically ordered by the ternary betweenness relation $\bm{t}\prec\bm{x}\prec\bm{t}^{\prime}$ , which reads “ $\bm{t},\bm{t}^{\prime},\bm{x}$ are pairwise distinct, and traveling from $\bm{t}$ to $\bm{t}^{\prime}$ counterclockwise we meet $\bm{x}$ ”. Every pair of distinct points $\bm{t},\bm{t}^{\prime}$ determines two closed intervals, namely $[\bm{t},\bm{t}^{\prime}]=\{\bm{t},\bm{t}^{\prime}\}\cup\{\bm{x}:\bm{t}\prec\bm{x}\prec\bm{t}^{\prime}\}$ and $[\bm{t}^{\prime},\bm{t}]$ . Given $\bm{w}$ in the de Sitter space, the set $I_{\bm{w}}=\{\bm{x}\in S^{1}:x_{3}\langle\bm{w},\bm{x}\rangle\geq 0\}$ is an interval as well (the factor $x_{3}$ , i.e., the third coordinate of $\bm{x}$ , makes the definition independent from the choice of a representative for $\bm{x}$ ). Let us denote the ordinary cross product of two vectors in $\mathbb{R}^{3}$ by $\bm{x}\times\bm{y}$ .

Lemma 5.1.

Let $\bm{t},\bm{t}^{\prime}\in S^{1}$ be distinct, and let

[TABLE]

the right-hand side being independent from the chosen lifts of $\bm{t},\bm{t}^{\prime}$ to $\mathbb{R}^{3}\setminus\{0\}$ . Then the following statements hold.

(i)

$\bm{w}\in\mathcal{S}$ , and $I_{\bm{w}}=[\bm{t},\bm{t}^{\prime}]$ .

(ii)

Let $(\omega,\alpha)\in(\operatorname{P}^{1}\mathbb{R}\times\operatorname{P}^{1}\mathbb{R})\setminus(\operatorname{diagonal})$ be the pair corresponding to $\bm{w}$ according to Theorem 4.1. Then we have

[TABLE]

(iii)

For every $\bm{A}\in\operatorname{O}^{\uparrow}_{2,1}\mathbb{R}$ , we have $\bm{A}[I_{\bm{w}}]=I_{\bm{Aw}}$ , which equals $[\bm{At},\bm{At}^{\prime}]$ if $\det\bm{A}=1$ , and $[\bm{At}^{\prime},\bm{At}]$ otherwise.

(iv)

$\bm{w}\in\mathbb{Q}^{3}$ * if and only if both $\bm{t}$ and $\bm{t}^{\prime}$ are rational points.*

(v)

The arclength of $[\bm{t},\bm{t}^{\prime}]$ and the third coordinate $w_{3}$ of $\bm{w}$ are related by $\operatorname{arclength}([\bm{t},\bm{t}^{\prime}])=2\operatorname{arccot}(w_{3})$ .

(vi)

If $\bm{t}$ and $\bm{t}^{\prime}$ do not lie on the same diameter (i.e., by (v), if $w_{3}\not=0$ ), then the unique circle in $\mathbb{R}^{2}$ perpendicular to $S^{1}$ and passing through $\bm{t},\bm{t}^{\prime}$ has center $(w_{1}/w_{3},w_{2}/w_{3})$ and curvature $\lvert w_{3}\rvert$ .

(vii)

Assume that

[TABLE]

with arclength tending to [math] (i.e., $\lim_{t\to\infty}w_{t,3}=\infty$ ). Then $\lim_{t\to\infty}\operatorname{arclength}(I_{\bm{w}_{t}})\big{/}(2/w_{t,3})=1$ .

Proof.

(i) Every rotation

[TABLE]

leaves invariant the arclength of $[\bm{t},\bm{t}^{\prime}]$ and the third coordinate of $\bm{w}$ (because $\bm{S}$ belongs to $\operatorname{SO}_{3}\mathbb{R}$ as well as to $\operatorname{SO}_{2,1}\mathbb{R}$ , and hence $(\bm{L}\bm{S}\bm{t}^{\prime}\times\bm{L}\bm{S}\bm{t})\big{/}\langle\bm{S}\bm{t}^{\prime},\bm{S}\bm{t}\rangle=\bm{S}\bm{w}$ ). Therefore we assume without loss of generality $\bm{t}=[1,0,1]$ and $\bm{t}^{\prime}=[\cos r,\sin r,1]$ , for some $0<r<2\pi$ . Then, by explicit computation, $\bm{w}=\bigl{(}(\sin r)\big{/}(1-\cos r),1,(\sin r)\big{/}(1-\cos r)\bigr{)}$ , which is indeed in $\mathcal{S}$ . Let $\bm{x}(u)=[\cos u,\sin u,1]$ , and let $f(u)=\langle\bm{w},\bm{x}(u)\rangle:[0,2\pi)\to\mathbb{R}$ . Then, by elementary projective geometry, $f$ takes value [math] in precisely two points, namely in $u=0$ and in the unique solution to $\bm{x}(u)=\bm{t}^{\prime}$ . Again by explicit computation, $f$ has derivative $f^{\prime}(u)=\cos u-(\sin r)(\sin u)/(1-\cos r)$ , which is positive at [math]. This, and extending $f$ to be periodic, then implies that $\langle\bm{w},\bm{x}\rangle\geq 0$ if and only if $\bm{x}\in[\bm{t},\bm{t}^{\prime}]$ , as claimed.

(ii) We have $(\mu\circ\upsilon)^{-1}(\omega)=(2\omega,\omega^{2}-1,\omega^{2}+1)$ , and analogously for $\alpha$ . Our statement amounts then to the verification that the vector

[TABLE]

resulting from (5.1) equals the vector $\bm{w}$ given by (4.4). This is a straightforward computation.

(iii) Let $\bm{x}$ be a point in $S^{1}$ , and choose a representative for it with positive third coordinate. Then, for every $\bm{A}\in\operatorname{O}^{\uparrow}_{2,1}\mathbb{R}$ , the third coordinate of $\bm{A}^{-1}\bm{x}$ is still positive; we thus have $\bm{x}\in\bm{A}[I_{\bm{w}}]$ iff $\bm{A}^{-1}\bm{x}\in I_{\bm{w}}$ iff $\langle\bm{w},\bm{A}^{-1}\bm{x}\rangle\geq 0$ iff $\langle\bm{A}\bm{w},\bm{x}\rangle\geq 0$ iff $\bm{x}\in I_{\bm{A}\bm{w}}$ . The second statement follows from the first and the remark that $\bm{t}\prec\bm{A}^{-1}\bm{x}\prec\bm{t}^{\prime}$ is equivalent to $\bm{A}\bm{t}\prec\bm{x}\prec\bm{A}\bm{t}^{\prime}$ if $\det\bm{A}=1$ , and to $\bm{A}\bm{t}^{\prime}\prec\bm{x}\prec\bm{A}\bm{t}$ if $\det\bm{A}=-1$ .

(iv) The right-to-left implication follows from the definition of $\bm{w}$ . Conversely, if $\bm{w}\in\mathbb{Q}^{3}$ then the proof of the equivalence between (S1) and (S6) in Theorem 4.1 yields that the form $q$ corresponding to $\bm{w}$ has rational coefficients. Since $q$ has discriminant $1$ , the roots of $q(x,1)$ (given by (a), (b), (c) in the proof of the same Theorem 4.1) are rational numbers. By (ii), $\bm{t}$ and $\bm{t}^{\prime}$ are the reverse stereographic projections through $[0,1,1]$ of these roots, and thus are rational points.

(v) As in (i), we assume $\bm{t}=[1,0,1]$ and $\bm{t}^{\prime}=[\cos r,\sin r,1]$ . Then, as computed in (i), $w_{3}=(\sin r)\big{/}(1-\cos r)=\cot(r/2)$ , and our statement follows.

(vi) Looking at $\bm{w}$ as a point in $\operatorname{P}^{2}\mathbb{R}$ , the identities $\langle\bm{w},\bm{t}\rangle=\langle\bm{w},\bm{t}^{\prime}\rangle=0$ mean that $\bm{w}$ is the intersection point of the two lines tangent to $S^{1}$ at $\bm{t}$ and $\bm{t}^{\prime}$ ; thus the described circle has center $(w_{1}/w_{3},w_{2}/w_{3})$ . Upon applying the rotation in the proof of (i), the statement about the curvature follows by direct inspection.

(vii) This is clear. ∎

Remark 5.2.

Since, as it is easily seen, the map $\bm{w}\mapsto I_{\bm{w}}$ is a bijection between $\mathcal{S}$ and the space of closed circle intervals, it is tempting to add a seventh item to the list in Theorem 4.1. However this would not be correct, since the action in Lemma 5.1(iii) does not agree with the one in Theorem 4.1(S1). In other words, $\operatorname{PSL}^{\pm}_{2}\mathbb{R}$ acts on the space of intervals via the “bold” isomorphism $A\mapsto\bm{A}$ , while it acts on the de Sitter space via $\Lambda$ . The following commuting diagram may clarify the situation

[TABLE]

In (5.2), the rightmost vertical arrow is the involutive automorphism $\bm{A}\mapsto(\det\bm{A})(\operatorname{sgn}\bm{A}_{3,3})\bm{A}$ of $\operatorname{O}_{2,1}\mathbb{R}$ , which restricts to the isomorphisms $\Lambda\circ\operatorname{bold}^{-1}$ and $\operatorname{bold}\circ\Lambda^{-1}$ . Since these isomorphisms obviously preserve the fact that a matrix has integer entries, Theorem 2.4 implies that $\operatorname{SO}_{2,1}\mathbb{Z}=\Lambda\bigl{[}\langle F,P,G\rangle\bigr{]}=\langle-\bm{F},-\bm{P},-\bm{G}\rangle\simeq\Delta^{\pm}(2,4,\infty)$ and $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}=\Lambda\bigl{[}\langle F,P,G\rangle^{+}\bigr{]}=\langle\bm{F},\bm{P},\bm{G}\rangle^{+}\simeq\Delta(2,4,\infty)$ .

When working with continued fractions algorithms one naturally deals with unimodular intervals in $\operatorname{P}^{1}\mathbb{R}$ , namely intervals $[p/q,p^{\prime}/q^{\prime}]$ with rational endpoints and such that $\det\bigl{(}\begin{smallmatrix}p&p^{\prime}\\ q&q^{\prime}\end{smallmatrix}\bigr{)}=-1$ ; for example, the intervals $[1/(a+1),1/a]$ of continuity for the Gauss map $x\mapsto 1/x-\lfloor 1/x\rfloor$ are unimodular. It is a trivial —but key— fact that the modular group $\operatorname{PSL}_{2}\mathbb{Z}$ acts simply transitively on such intervals. The situation for intervals on the circle is more involved.

Theorem 5.3.

The set $\mathcal{S}\cap\mathbb{Z}^{3}$ is partitioned in two orbits, corresponding to the parity of $w_{3}$ , by the action of $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ . On each orbit the action is simply transitive. Replacing $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ with its index- $2$ subgroup $\Lambda\bigl{[}\langle F,P,J\rangle^{+}\bigr{]}$ each orbit is further split in two.

Proof.

It is easy to check that each of $-\bm{F}$ , $-\bm{P}$ , $-\bm{G}$ preserves the parity of $w_{3}$ ; hence there are at least two orbits.

Choose $\bm{w}\in\mathcal{S}\cap\mathbb{Z}^{3}$ and let $(\omega,\alpha)\in(\operatorname{P}^{1}\mathbb{Q}\times\operatorname{P}^{1}\mathbb{Q})\setminus(\operatorname{diagonal})$ be the corresponding ordered pair according to Theorem 4.1. An appropriate power $(FP)^{k}$ of the parabolic matrix $FP$ (that fixes $1$ ) sends $(\omega,\alpha)$ to a new pair $(\omega^{\prime},\alpha^{\prime})$ with $0\leq\omega^{\prime}\leq 1$ . By [42, Theorem 2(i)], the orbit $\omega^{\prime}=\omega^{\prime}_{0},\omega^{\prime}_{1},\omega^{\prime}_{2},\ldots$ of $\omega^{\prime}$ under the Romik map ends up after finitely many steps, say the $n$ th step, in one of the two parabolic fixed points [math], $1$ . For each $0\leq t<n$ , let

[TABLE]

be the matrix acting at time $t$ . Then $A=FJA_{n-1}A_{n-2}\cdots A_{0}(FP)^{k}\in\langle F,P,J\rangle$ , and $A*(\omega,\alpha)=(\omega^{\prime\prime},\alpha^{\prime\prime})$ is such that $\omega^{\prime\prime}\in\{\infty,-1\}$ . Postcomposing $A$ , if necessary, with $J$ (if $\omega^{\prime\prime}=\infty$ ) or with $F$ (if $\omega^{\prime\prime}=-1$ ), we have $A\in\langle F,P,J\rangle^{+}$ .

Suppose $\omega^{\prime\prime}=\infty$ . Then $\alpha^{\prime\prime}\in\mathbb{Z}$ because the point $\bm{w}^{\prime\prime}$ corresponding to $(\infty,\alpha^{\prime\prime})$ equals $(1,\alpha^{\prime\prime},\alpha^{\prime\prime})$ by (4.4), and also equals $\Lambda(A)\bm{w}$ , which is a point in $\mathbb{Z}^{3}$ . This implies that an appropriate power of the parabolic matrix $PJ=\bigl{[}\begin{smallmatrix}1&2\\ &1\end{smallmatrix}\bigr{]}$ maps $(\infty,\alpha^{\prime\prime})$ either to $(\infty,0)$ or to $(\infty,1)$ . If, on the other hand, $\omega^{\prime\prime}=-1$ , then the same argument with $PJ$ replaced by $(JPJ)F=\bigl{[}\begin{smallmatrix}2&1\\ -1&\end{smallmatrix}\bigr{]}$ (which is parabolic fixing $-1$ ) yields that a power of $JPJF$ maps $(-1,\alpha^{\prime\prime})$ either to $(-1,1)$ or to $(-1,\infty)$ .

Summing up, we have proved that the pair $(\omega,\alpha)$ is in the $\langle F,P,J\rangle^{+}$ -orbit of one of the pairs $(\infty,0),(\infty,1),(-1,1),(-1,\infty)$ . Now, the rotation $GF\in\langle F,P,G\rangle^{+}$ maps the first pair to the third, and the second to the fourth. By Theorem 4.1 this means that the original point $\bm{w}$ is in the $\Lambda\bigl{[}\langle F,P,G\rangle^{+}\bigr{]}$ -orbit of either $(1,0,0)$ or of $(1,1,1)$ . Since $\Lambda\bigl{[}\langle F,P,G\rangle^{+}\bigr{]}=\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ by Remark 5.2, our first claim is established.

Simple transitivity follows from the fact that both $(\infty,0)$ and $(\infty,1)$ have trivial stabilizer in $\langle F,P,G\rangle^{+}$ (because an element of a fuchsian group that fixes two distinct cusps must be the identity).

Finally, the pairs $(\infty,0),(\infty,1),(-1,1),(-1,\infty)$ remain distinct modulo $\langle F,P,J\rangle^{+}$ . Indeed, the latter is the triangle group $\Delta(2,\infty,\infty)$ , which has two distinct cusp orbits, and it is easy to check that any identification of the above four pairs would collapse these two orbits. ∎

We can now define unimodularity for circle intervals.

Definition 5.4.

Let $\bm{t},\bm{t}^{\prime}$ be distinct rational points in $S^{1}$ , and let $\bm{w}\in\mathcal{S}\cap\mathbb{Q}^{3}$ be the point corresponding to $[\bm{t},\bm{t}^{\prime}]$ according to Lemma 5.1. If $\bm{w}\in\mathbb{Z}^{3}$ and $w_{3}$ is even (odd), then we say that $[\bm{t},\bm{t}^{\prime}]$ is an even (odd) unimodular interval.

Theorem 5.5.

Let $\bm{t},\bm{t}^{\prime},\bm{w}$ be as in Definition 5.4; then the following conditions are equivalent.

(1)

$[\bm{t},\bm{t}^{\prime}]$ * is unimodular (either even or odd).* 2. (2)

$\bm{R}_{\bm{w}}$ * has integer entries.* 3. (3)

$[\bm{t},\bm{t}^{\prime}]$ * is the image either of $\bigl{[}[0,-1,1],[0,1,1]\bigr{]}$ or of $\bigl{[}[1,0,1],[0,1,1]\bigr{]}$ under some (necessarily unique) element of $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ .* 4. (4)

$\langle\bm{t},\bm{t}^{\prime}\rangle\in\{-1,-2\}$ * (here $\bm{t},\bm{t}^{\prime}$ are the canonical presentations of $\bm{t},\bm{t}^{\prime}$ as primitive pythagorean triples).*

If these conditions hold, then $[\bm{t},\bm{t}^{\prime}]$ is odd iff it is the image of $\bigl{[}[1,0,1],[0,1,1]\bigr{]}$ iff $\langle\bm{t},\bm{t}^{\prime}\rangle=-1$ . Moreover, $\bm{R}_{\bm{w}}$ belongs to $\langle\bm{F},\bm{P},\bm{J}\rangle$ , and the matrix $\mathscr{R}_{\bm{w}}\in\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ corresponding to it under Convention 2.2 is

[TABLE]

where $\theta,\theta^{\prime}\in S^{1}\cap\mathbb{Q}(i)$ are identified with $\bm{t},\bm{t}^{\prime}$ as in §3.

Proof.

(1) $\Rightarrow$ (2) Since $\langle\bm{w},\bm{w}\rangle=1$ , this is immediate from the explicit formula for $\bm{R}_{\bm{w}}$ in (2.6).

(2) $\Rightarrow$ (3) Let

[TABLE]

(see Lemma 5.1(ii)). Then, as in the proof of Theorem 5.3, we construct $A\in\langle F,P,J\rangle^{+}$ such that $A*(\omega,\alpha)$ equals either $(\infty,\alpha^{\prime\prime})$ or $(-1,\alpha^{\prime\prime})$ . Since $FG*(-1)=\infty$ , there exists $B\in\langle F,P,G\rangle^{+}$ with $B*(\omega,\alpha)=(\infty,q)$ , for some $q\in\mathbb{Q}$ . Hence, $\Lambda(B)\bm{w}=(1,q,q)=\bm{v}$ . We then have

[TABLE]

and the leftmost entry in the display is a matrix with integer entries. Multiplying through by $-1$ , subtracting the identity matrix $\bm{I}$ , and multiplying by $\bm{L}$ on the right, we see that the matrix

[TABLE]

must have integer entries. This implies that the denominator of the rational number $q$ must divide $2$ , and so must do the denominator of $q^{2}$ ; therefore $q$ is an integer. Thus, as in the proof of Theorem 5.3, an appropriate power $(\bm{P}\bm{J})^{k}$ will map $(1,q,q)$ either to $(1,0,0)$ or to $(1,1,1)$ ; therefore, $\Lambda\bigl{(}(PJ)^{k}B\bigr{)}\bm{w}\in\{(1,0,0),(1,1,1)\}$ . Now, $(PJ)^{k}B\in\langle F,P,G\rangle^{+}$ , and $\Lambda$ equals the “bold” isomorphism on $\langle F,P,G\rangle^{+}$ , with range $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ . Thus $\bm{w}$ is the image either of $(1,0,0)$ or of $(1,1,1)$ under some element of $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ , a statement equivalent to (3) by Remark 5.2.

(3) $\Rightarrow$ (4) This is clear, since $\langle(0,-1,1),(0,1,1)\rangle=-2$ and $\langle(1,0,1),(0,1,1)\rangle=-1$ .

(4) $\Rightarrow$ (1) If $\langle\bm{t},\bm{t}^{\prime}\rangle=-1$ , then $\bm{w}\in\mathbb{Z}^{3}$ by the definition of $\bm{w}$ in Lemma 5.1; assume then $\langle\bm{t},\bm{t}^{\prime}\rangle=-2$ . In every pythagorean triple one of the legs must be even, and the other leg and the hypotenuse both odd. The condition $t_{1}t^{\prime}_{1}+t_{2}t^{\prime}_{2}-t_{3}t^{\prime}_{3}=-2$ forces $t_{1},t^{\prime}_{1}$ to be both even and $t_{2},t^{\prime}_{2}$ both odd (or conversely). Since $t_{3},t^{\prime}_{3}$ are surely both odd, all the entries in $\bm{L}\bm{t}^{\prime}\times\bm{L}\bm{t}$ must be even; thus $\bm{w}\in\mathbb{Z}^{3}$ .

The stated characterization of $[\bm{t},\bm{t}^{\prime}]$ being even/odd is clear from the previous proof.

By Theorem 5.3, $\bm{w}$ is in the $\langle\bm{F},\bm{P},\bm{J}\rangle^{+}$ -orbit of one of $(1,0,0)$ , $(1,1,1)$ , $(0,1,0)$ , $(-1,1,1)$ . Hence $\bm{R}_{\bm{w}}$ is a conjugate either of $\bm{R}_{(1,0,0)}=\bm{J}$ , or of $\bm{R}_{(1,1,1)}=\bm{P}$ , or of $\bm{R}_{(0,1,0)}=\bm{F}$ , or of $\bm{R}_{(-1,1,1)}=\bm{J}\bm{P}\bm{J}$ by a matrix in $\langle\bm{F},\bm{P},\bm{J}\rangle^{+}$ ; in any case, it belongs to $\langle\bm{F},\bm{P},\bm{J}\rangle$ .

Finally, let $\mathscr{S}$ be the matrix in (5.3). By direct computation

[TABLE]

which has the form $\bigl{[}\begin{smallmatrix}\alpha&\beta\\ \bar{\beta}&\bar{\alpha}\end{smallmatrix}\bigr{]}$ , as can easily be checked; hence $\mathscr{S}\in\operatorname{PSU}^{\pm}_{1,1}\mathbb{C}$ . If we can prove that $\mathscr{S}$ has entries in $\mathbb{Z}[i]$ , then necessarily $\mathscr{S}=\mathscr{R}_{\bm{w}}$ . Indeed, the matrix $\mathscr{S}^{-1}\mathscr{R}_{\bm{w}}$ would then belong to the fuchsian group $\operatorname{PSU}_{1,1}\mathbb{Z}[i]$ , and would fix the two cusps $\theta,\theta^{\prime}$ ; hence, it must be the identity matrix.

Write uniquely $\theta=\kappa\mu/\bar{\mu}$ , $\theta^{\prime}=\lambda\nu/\bar{\nu}$ , as explained in §3. By Theorem 5.3, there exists $\mathscr{A}\in\langle\mathscr{F},\mathscr{P},\mathscr{J}\rangle^{+}=\operatorname{PSU}_{1,1}\mathbb{Z}[i]$ such that

[TABLE]

This implies that the determinant $\delta=\kappa\mu\bar{\nu}-\lambda\bar{\mu}\nu$ divides $2$ in $\mathbb{Z}[i]$ . Since

[TABLE]

we have

[TABLE]

which has entries in $\mathbb{Z}[i]$ . ∎

6. Billiard maps

Having arranged our tools in working order, we proceed to our core objects.

Definition 6.1.

A unimodular partition of the unit circle $S^{1}$ is a counterclockwise cyclically ordered $m$ -uple $\bm{t}_{0},\bm{t}_{1},\ldots,\bm{t}_{m-1}$ of pythagorean triples, of cardinality at least $3$ , such that each interval $[\bm{t}_{a},\bm{t}_{a+1}]$ is unimodular (including $[\bm{t}_{m-1},\bm{t}_{0}]$ ; here and in the following we are writing indices modulo $m$ ). We will write $\bm{w}_{a}=(\bm{L}\bm{t}_{a+1}\bm{\times}\bm{L}\bm{t}_{a})/\langle\bm{t}_{a+1},\bm{t}_{a}\rangle\in\mathcal{S}$ for the points defined by Lemma 5.1.

According to our conventions, and without further notice, we will often switch to a complex-numbers setting, thus writing $\theta_{a}$ for $\bm{t}_{a}$ .

For every $a$ , let $l_{a}$ be the geodesic in $\mathcal{D}$ of ideal endpoints $\theta_{a}$ and $\theta_{a+1}$ ; of the two halfplanes determined by $l_{a}$ , let $D_{a}$ be the one containing all other $l_{b}$ , for $b\not=a$ . Then $D=\bigcap\{D_{a}:a=0,\ldots,m-1\}$ is a polygon with sides $l_{0},\ldots,l_{m-1}$ and ideal vertices $\theta_{0},\ldots,\theta_{m-1}$ , on which we can play billiards in the usual way. Namely, any unit velocity vector attached to an infinitesimal ball in the interior of $D$ determines an oriented geodesic $g$ starting from an ideal point $\rho$ and ending at $\sigma$ . The ball travels along $g$ at unit speed, until it hits the side $l_{a}$ determined by the half-open interval $[\theta_{a},\theta_{a+1})$ to which $\sigma$ belongs (unless $\sigma$ is one of the vertices, in which case the ball is lost at infinity). When hitting $l_{a}$ , the ball rebounces with angle of reflection equal to the angle of incidence, and continues its trajectory along the geodesic $g^{\prime}$ which is the image of $g$ with respect to the reflection with mirror $l_{a}$ . This reflection is induced by the matrix $\mathscr{R}_{\bm{w}_{a}}$ in (5.3) (with $\theta=\theta_{a}$ and $\theta^{\prime}=\theta_{a+1}$ ), and thus has ideal initial and terminal points $\mathscr{R}_{\bm{w}_{a}}*\rho$ and $\mathscr{R}_{\bm{w}_{a}}*\sigma$ , respectively. All of this naturally suggests the following standard definition [18, Chapter 6], [16, §IV.1].

Definition 6.2.

The billiard map determined by the unimodular partition $\theta_{0},\ldots,\theta_{m-1}$ is the map $\widetilde{B}$ from $(S^{1}\times S^{1})\setminus(\operatorname{diagonal})$ to itself defined by $\widetilde{B}(\sigma,\rho)=(\mathscr{A}_{a}*\sigma,\mathscr{A}_{a}*\rho)$ , where $a$ is the index of the unique half-open interval $I_{a}={[}\theta_{a},\theta_{a+1})$ containing $\sigma$ , and $\mathscr{A}_{a}=\mathscr{R}_{\bm{w}_{a}}$ . The map $\widetilde{B}$ is continuous, and determines a topological dynamical system. We denote by $(S^{1},B)$ the factor system naturally induced by the projection $(\sigma,\rho)\mapsto\sigma$ ; in short, $B(\sigma)=\mathscr{A}_{a}*\sigma$ for $\sigma\in I_{a}$ .

We will freely use Theorem 4.1 to conjugate $\widetilde{B}$ to a map acting on any of the spaces (S1)–(S6); we will still denote the conjugated map by $\widetilde{B}$ , slightly abusing notation. For ease of visualization (and crucially in §9 and §10) we will also conjugate $\widetilde{B}$ and $B$ to maps on ${[}0,1)^{2}\setminus(\operatorname{diagonal})$ and $[0,1)$ , respectively; these last conjugations are realized through the normalized (i.e., the image is divided by $2\pi$ ) argument function $\arg:\partial\mathcal{D}\to{[}0,1)$ .

Example 6.3.

The ordered $6$ -uple

[TABLE]

is a unimodular partition, whose corresponding billiard table is shown in Figure 3 (left). The matrices $\mathscr{A}_{0},\ldots,\mathscr{A}_{5}$ are

[TABLE]

The graph of the $\arg$ -conjugate of $B$ is shown in Figure 3 (right); it requires caution in two respects. First, $B$ is a continuous map on $S^{1}$ and, second, it is piecewise-defined via six pieces, whose endpoints are given by the six $B$ -fixed points ( $0=1$ included). We plot in Figure 4 (left) $5000$ points of the $\widetilde{B}$ -orbit of a “typical” point in the de Sitter space $\mathcal{S}$ , and in Figure 4 (right) their $\arg$ -images. The cluster points apparent in this latter figure correspond to the six fixed points cited above. These are indifferent fixed points (i.e., the derivative of $B$ has absolute value $1$ ), and this forces the unique $B$ -invariant measure absolutely continuous with respect to the Lebesgue measure to be infinite; see Theorem 7.2 and Figure 5. Note that $\widetilde{B}$ is not injective: the points $(\theta_{0},\mathscr{A}_{0}*\theta_{2})$ and $(\mathscr{A}_{2}*\theta_{0},\theta_{2})$ are different, but both get mapped to $(\theta_{0},\theta_{2})$ (see however Theorem 7.1(i)).

We let $\varGamma^{\pm}_{B}$ be the group generated by $\mathscr{A}_{0},\ldots,\mathscr{A}_{m-1}$ , and $\varGamma_{B}=\varGamma^{\pm}_{B}\cap\operatorname{PSU}_{1,1}\mathbb{Z}[i]$ the associated fuchsian group. By conjugating with an appropriate element of $\operatorname{PSU}_{1,1}\mathbb{Z}[i]$ we always assume, without loss of generality, that $\theta_{0}=1$ . As noted in §2, $\varGamma^{\pm}_{B}$ admits the presentation $\langle x_{0},\ldots,x_{m-1}\mid x_{0}^{2}=x_{1}^{2}=\cdots=x_{m-1}^{2}=1\rangle$ , and hence is isomorphic to the free product of $m$ copies of the group of order two. Equivalently stated, each element of $\varGamma^{\pm}_{B}$ can be uniquely written as a word in the generators $\mathscr{A}_{0},\ldots,\mathscr{A}_{m-1}$ , subject to the only condition that the same generator does not appear in two consecutive positions. Since $D$ has finite hyperbolic area, $\varGamma_{B}$ and $\varGamma^{\pm}_{B}$ have finite index in $\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ .

Definition 6.4.

Let $B,I_{0},\ldots,I_{m-1}$ be as in Definition 6.2. For each $t=0,1,2,\ldots$ , let $a_{t}$ be determined by $B^{t}(\sigma)\in I_{a_{t}}$ ; the point $\varphi(\sigma)=a_{0}a_{1}a_{2}\ldots=\mathbf{a}$ in the Cantor space $\{0,\ldots,m-1\}^{\omega}$ is the $B$ -symbolic sequence of $\sigma$ .

Lemma 6.5.

The $B$ -symbolic-sequence map $\varphi:S^{1}\to\{0,\ldots,m-1\}^{\omega}$ is injective. Its range is the set of all sequences $\mathbf{a}$ such that:

(i)

if $a_{t}=a_{t+1}$ for some $t$ , then $a_{t}=a_{t+h}$ for every $h\geq 0$ ;

(ii)

for any $a\in\{0,\ldots,m-1\}$ , the tail of $\mathbf{a}$ is neither of the form $\overline{a(a+1)}$ , nor of the form $(a-1)\overline{a}$ (the bar denoting periodicity).

Remark 6.6.

Since we are considering half-open intervals, each $\sigma$ has precisely one $B$ -symbolic sequence; thus $\varphi$ is well defined. This differs slightly form other treatments of Gauss-like maps (see, e.g., [29, §2.1] or [45, §1.2.1]), in which rational points have two symbolic sequences. Note that $\varphi$ is not continuous; indeed, if it were it would have compact image, which is not the case (e.g., all sequences of the form $(01)^{n}\overline{0}$ lie in the image, but the resulting sequence of sequences does not have a limit point in $\varphi[S^{1}]$ ).

Proof of Lemma 6.5.

Each $\mathscr{A}_{a}$ is an involution, and exchanges $\overline{I}_{a}$ with $\bigcup_{b\not=a}\overline{I}_{b}$ , the bar denoting topological closure. However, in this proof we carefully distinguish $B$ (which maps bijectively $\overline{I}_{a}$ to $\bigcup_{b\not=b}\overline{I}_{b}$ ) from $\mathscr{A}_{a}$ (which is one of the branches of $B^{-1}$ , the one that maps bijectively $\bigcup_{b\not=a}\overline{I}_{b}$ to $\overline{I}_{a}$ ). We do so in order to prepare the ground for the proof of Theorem 9.2, where the argument we are going to provide will be adapted to another $(m-1)$ -to- $1$ covering map of $S^{1}$ .

Let $\mathbf{a}=\varphi(\sigma)$ . If $a_{t}=a_{t+1}=a$ , then $B^{t}(\sigma)\in I_{a}\cap B^{-1}[I_{a}]=\{\theta_{a}\}$ . Since $\theta_{a}$ is a $B$ -fixed point, we have $a_{t+h}=a$ for every $h\geq 0$ . Moreover, if $t\geq 1$ and $a_{t-1}\not=a$ , then we have $\theta_{a}=B^{t}(\sigma)\in B[I_{a_{t-1}}]$ , which implies $a_{t-1}\not=a-1$ , because $\theta_{a}\notin B[I_{a-1}]$ . Hence $\mathbf{a}$ cannot have tail $(a-1)\overline{a}$ . The fact that $\mathbf{a}$ cannot have tail $\overline{a(a+1)}$ is proved in [12, Theorem 2.1]. We conclude that every $B$ -symbolic sequence must satisfy (i) and (ii).

Conversely, we fix $\mathbf{a}$ satisfying (i) and (ii) and show that there exists a unique point having $\mathbf{a}$ as $B$ -symbolic sequence. We need a preliminary remark: suppose we know that $\sigma$ is the unique point having $B$ -symbolic sequence $\mathbf{b}$ . Then, by direct inspection, we have:

(a)

if $\sigma$ is in the interior of $I_{b_{0}}$ and $b\not=b_{0}$ , then $\mathscr{A}_{b}*\sigma$ is in the interior of $I_{b}$ and is the unique point having $B$ -symbolic sequence $b\mathbf{b}$ ;

(b)

the same conclusion holds if $\sigma=\theta_{b_{0}}$ , provided that $b\notin\{b_{0},b_{0}-1\}$ .

Case 1. The sequence $\mathbf{a}$ has tail $\overline{a}$ , say from time $t$ on. If $t=0$ , then there exists a unique point having $B$ -symbolic sequence $\overline{a}$ , namely $\theta_{a}$ . If $t>0$ , then the previous remark and induction show that $\mathscr{A}_{a_{0}}\cdots\mathscr{A}_{a_{t-1}}*\theta_{a}$ is the only point having $B$ -symbolic sequence $\mathbf{a}$ .

Case 2. The sequence $\mathbf{a}$ does not have tail $\overline{a}$ , for any $a$ . Since $a_{t}\not=a_{t+1}$ for every $t$ , we have strict inclusions $\overline{I}_{a_{t}}\supset\mathscr{A}_{a_{t}}[\overline{I}_{a_{t+1}}]$ for every $t$ , and hence a strictly decreasing sequence of nested intervals

[TABLE]

We claim that this sequence shrinks to a singleton. Indeed, each set in (6.1) is a unimodular interval, strictly containing the following one. By Lemma 5.1(v) the third coordinates of the corresponding points $\bm{w}_{a_{0}},\bm{A}_{a_{0}}\bm{w}_{a_{1}},\bm{A}_{a_{0}}\bm{A}_{a_{1}}\bm{w}_{a_{2}},\ldots$ on the de Sitter space form a strictly increasing sequence. Since we are dealing with unimodular intervals, these third coordinates are integer numbers, and a strictly increasing sequence of integers must go to infinity. Therefore the arclengths of the intervals go to [math], and the intersection of the sequence in (6.1) contains at least one point —by compactness— but no more than one.

Let $\sigma$ be the shrinking point of (6.1) and let $\varphi(\sigma)=\mathbf{b}$ ; we prove $\mathbf{a}=\mathbf{b}$ by induction (note that, clearly, no point other than $\sigma$ may have $B$ -symbolic sequence $\mathbf{a}$ ). We have $\sigma\in\overline{I}_{a_{0}}\cap I_{b_{0}}$ ; if $a_{0}$ were different from $b_{0}$ , then necessarily $\sigma=\theta_{b_{0}}$ and $b_{0}=a_{0}+1$ . Therefore, for every $t\geq 1$ we have $\sigma=B^{t}(\sigma)\in B^{t}[\mathscr{A}_{a_{0}}\cdots\mathscr{A}_{a_{t-1}}\bigl{[}\overline{I}_{a_{t}}]\bigr{]}=\overline{I}_{a_{t}}$ , and thus $\sigma$ belongs to $\overline{I}_{a_{t}}$ . This implies $\mathbf{a}=\overline{a_{0}(a_{0}+1)}$ , which contradicts (ii); hence $a_{0}=b_{0}$ . For the inductive step, assume $a_{r}=b_{r}$ for $0\leq r<t$ . Then $B^{t}(\sigma)$ has $B$ -symbolic sequence $b_{t}b_{t+1}\ldots$ and is the unique shrinking point of the chain

[TABLE]

Applying the base step above to $B^{t}(\sigma)$ we get $a_{t}=b_{t}$ . ∎

7. Natural extension and invariant measures

If $\varphi(\sigma)$ has constant tail $\overline{a}$ for some $a\in\{0,\ldots,m-1\}$ , i.e., $B^{h}(\sigma)=\theta_{a}$ for some $h$ , we say that $\sigma$ is $B$ -terminating. If $\varphi(\sigma)$ has periodic tail $\overline{a_{h}\cdots a_{h+p-1}}$ with minimal preperiod $h$ and period $p\geq 2$ , we say that $\sigma$ is $B$ -periodic or $B$ -preperiodic, according whether $h$ is [math] or greater than [math].

We will push the identification of the de Sitter space with $(S^{1}\times S^{1})\setminus(\operatorname{diagonal})$ a bit further by using the symbol $\mathcal{S}$ for both; this is unambiguous since writing $\bm{w}\in\mathcal{S}$ or $(\sigma,\rho)\in\mathcal{S}$ clearly distinguishes the two uses. With this understanding, we denote by $\mathcal{S}_{B}$ the set of all pairs $(\sigma,\rho)$ such that:

(i)

both $\sigma$ and $\rho$ are $B$ -nonterminating;

(ii)

$\sigma$ and $\rho$ belong to different intervals.

For the map $B$ of Example 6.3, the orbit in Figure 4 is dense in $\mathcal{S}_{B}$ .

Theorem 7.1.

The following facts hold.

(i)

$\widetilde{B}\restriction\mathcal{S}_{B}$ * is a bijection on $\mathcal{S}_{B}$ .*

(ii)

If $(\sigma,\rho)\in\mathcal{S}$ is such that both $\sigma$ and $\rho$ are $B$ -nonterminating, then $\widetilde{B}^{t}(\sigma,\rho)\in\mathcal{S}_{B}$ for some $t\geq 0$ .

(iii)

Let $\tilde{\mu}$ be the $\operatorname{PSU}^{\pm}_{1,1}\mathbb{C}$ -invariant measure on $(S^{1}\times S^{1})\setminus(\operatorname{diagonal})$ given by Theorem 4.1. Then $(\mathcal{S}_{B},\tilde{\mu},\widetilde{B})$ is a measure-preserving system, and so is its factor $(S^{1},\mu,B)$ , where $\mu=\pi_{*}\tilde{\mu}$ is the pushforward measure induced by the projection $\pi(\sigma,\rho)=\sigma$ .

(iv)

The invertible system $(\mathcal{S}_{B},\tilde{\mu},\widetilde{B})$ is the natural extension of $(S^{1},\mu,B)$ .

Proof.

(i) The fact that $\widetilde{B}$ maps $\mathcal{S}_{B}$ into itself is clear. Writing $f$ for the involution $(\sigma,\rho)\mapsto(\rho,\sigma)$ of $\mathcal{S}_{B}$ , it is also clear that $f\circ\widetilde{B}\circ f=\widetilde{B}^{-1}$ on $\mathcal{S}_{B}$ . In terms of symbolic sequences, all of this just amounts to $\widetilde{B}:(a_{0}a_{1}\ldots,b_{0}b_{1}\ldots)\mapsto(a_{1}\ldots,a_{0}b_{0}b_{1}\ldots)$ and $f\circ\widetilde{B}\circ f:(a_{0}a_{1}\ldots,b_{0}b_{1}\ldots)\mapsto(b_{0}a_{0}a_{1}\ldots,b_{1}\ldots)$ .

(ii) Let $\sigma\not=\rho$ be both $B$ -nonterminating. By Lemma 6.5 there exists $t\geq 0$ such that $B^{t}(\sigma)$ and $B^{t}(\rho)$ belong to different intervals. By the definitions of $\widetilde{B}$ and of $\mathcal{S}_{B}$ , we have $\widetilde{B}^{t}(\sigma,\rho)\in\mathcal{S}_{B}$ .

(iii) Any measurable $M\subseteq\mathcal{S}_{B}$ is the disjoint union $M=\mathop{\dot{\bigcup}}\{M_{a}:a\in\{0,\ldots,m-1\}\}$ , where $M_{a}=\{(\sigma,\rho)\in M:\rho\in I_{a}\}$ . Thus ${\widetilde{B}}^{-1}M=\mathop{\dot{\bigcup}}_{a}{\widetilde{B}}^{-1}M_{a}=\mathop{\dot{\bigcup}}_{a}\mathscr{A}_{a}[M_{a}]$ and, as $\tilde{\mu}\bigl{(}\mathscr{A}_{a}[M_{a}]\bigr{)}=\tilde{\mu}(M_{a})$ , we have $\tilde{\mu}({\widetilde{B}}^{-1}M)=\tilde{\mu}(M)$ .

(iv) The set $\{\sigma\in S^{1}:\text{$ \sigma $is$ B $-terminating}\}$ is clearly $B$ -invariant and has $\mu$ -measure [math]; modulo this nullset and its $\pi$ -counterimage, we have the commuting square

[TABLE]

By the very definition of the natural extension [41, p. 22], the metric system $(\mathcal{S}_{B},\tilde{\mu},\widetilde{B})$ is the natural extension of its factor $(S^{1},\mu,B)$ if the supremum of the family of measurable partitions

[TABLE]

is —modulo nullsets— the partition of $\mathcal{S}_{B}$ in singletons. This condition amounts to the request that if $(\sigma,\rho)\not=(\sigma^{\prime},\rho^{\prime})$ , then there exists $t\geq 0$ such that $\pi\bigl{(}\widetilde{B}^{-t}(\sigma,\rho)\bigr{)}\not=\pi\bigl{(}\widetilde{B}^{-t}(\sigma^{\prime},\rho^{\prime})\bigr{)}$ . This request is clearly satisfied: if $\sigma\not=\sigma^{\prime}$ we take $t=0$ , while if $\sigma=\sigma^{\prime}$ we take $t=h+1$ , there $h$ is the least nonnegative integer such that $B^{t}(\rho)$ and $B^{t}(\rho^{\prime})$ lie in different intervals. ∎

As usual in the context of Gauss-like maps, once a model of the natural extension has been determined the computation of the (unique) absolutely continuous $B$ -invariant measure is easy; we state the result for the $\arg$ -conjugates of $\widetilde{B}$ and $B$ .

Theorem 7.2.

Let $X=\{(\arg\sigma,\arg\rho):(\sigma,\rho)\in\mathcal{S}_{B}\}\subset[0,1)^{2}$ and write —abusing language— $\widetilde{B}$ and $B$ for $\arg\circ\widetilde{B}\circ\arg^{-1}$ and $\arg\circ B\circ\arg^{-1}$ , respectively. For $a=0,\ldots,m-1$ , let $x_{a}=\arg\theta_{a}$ , and let $h_{a}:[0,1)\to\mathbb{R}_{\geq 0}$ be the function defined by

[TABLE]

on $(x_{a},x_{a+1})$ , and having value [math] elsewhere. Then the following facts hold.

(i)

The unique (up to constants) $\widetilde{B}$ -invariant measure on $X$ absolutely continuous with respect to the Lebesgue measure is $\,\mathrm{d}\tilde{\mu}=\pi^{2}\bigl{(}\sin(\pi(x-y))\bigr{)}^{-2}\,\mathrm{d}x\,\mathrm{d}y$ .

(ii)

The unique (up to constants) $B$ -invariant measure on $[0,1)$ absolutely continuous with respect to the Lebesgue measure is $\,\mathrm{d}\mu=\bigl{(}\sum_{a}h_{a}\bigr{)}\,\mathrm{d}x$ .

(iii)

Both systems $(X,\tilde{\mu},\widetilde{B})$ , $([0,1),\mu,B)$ are ergodic and conservative.

Proof.

(i) This is just a change of variables, easily performed in two steps. Let $F_{1},F_{2}:\mathbb{R}^{2}\to\mathbb{R}^{2}$ be defined by

[TABLE]

Then $F_{2}\circ F_{1}$ is a bijection from $[0,1)^{2}\setminus\{\operatorname{diagonal}\}$ to $(\operatorname{P}^{1}\mathbb{R}\times\operatorname{P}^{1}\mathbb{R})\setminus\{\operatorname{diagonal}\}$ ; indeed, it amounts to the componentwise application of $C^{-1}\circ\arg^{-1}$ , with $C$ the Cayley matrix. This implies that the pushforward of the infinite invariant measure $(\omega-\alpha)^{-2}\,\mathrm{d}\omega\,\mathrm{d}\alpha$ of Theorem 4.1 via $\arg\circ\,C$ is $(F_{2}\circ F_{1})^{*}\bigl{(}(\omega-\alpha)^{-2}\,\mathrm{d}\omega\,\mathrm{d}\alpha\bigr{)}$ . One now computes

[TABLE]

(ii) Let $x\in(x_{a},x_{a+1})$ . Then $h_{a}(x)$ is the integral

[TABLE]

of the invariant density in (i) along the fiber $\{x\}\times\bigl{(}[0,x_{a}]\cup[x_{a+1},1]\bigr{)}$ .

(iii) It is easy to check that $B^{2}$ satisfies Thaler’s conditions [47, p. 69(1)–(4)]. This implies that $B^{2}$ is ergodic and conservative; therefore so is $B$ and its natural extension $\widetilde{B}$ [1, Theorem 3.1.7]. ∎

We draw in Figure 5 the invariant density $\sum_{a}h_{a}$ for the map $B$ of Example 6.3. We note that, in case $m=3$ , a direct geometric proof of Theorem 7.2(ii) was given by Kołodziej and Misiurewicz, using Ptolemy’s theorem on quadrilaterals inscribed in a circle [30], [34].

8. The Lagrange theorem

Our next result is a version of Serret’s theorem (two real numbers have the same tail in their continued fraction expansion precisely when they are $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ -equivalent [24, §10.11], [39]) in modern language.

Theorem 8.1.

The map $B$ and the group $\varGamma^{\pm}_{B}$ are orbit equivalent. More precisely, given $\sigma,\sigma^{\prime}\in S^{1}$ , there exists $\mathscr{A}\in\varGamma^{\pm}_{B}$ such that $\sigma^{\prime}=\mathscr{A}*\sigma$ if and only if there exist $h,k\geq 0$ such that $B^{h}(\sigma)=B^{k}(\sigma^{\prime})$ . In particular, if $\sigma$ belongs to $\mathbb{Q}(i)$ then it is $B$ -terminating, its orbit landing in the unique vertex of $D$ which is $\varGamma_{B}$ -equivalent to $\sigma$ .

Proof.

We begin proving the last assertion, for which the $\partial\mathcal{K}$ setting is expedient. Let then $\bm{s}$ be a rational point, and let $(\bm{w}_{0})_{3},\ldots,(\bm{w}_{m-1})_{3}\in\mathbb{Z}$ be the third coordinates of the points $\bm{w}_{0},\ldots\bm{w}_{m-1}$ of Definition 6.1. We need a preliminary step.

Claim

By conjugating $B$ by an appropriate element of $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ , we may assume that $(\bm{w}_{0})_{3},\ldots,(\bm{w}_{m-1})_{3}$ are all greater than [math], with at most one exception that may equal [math].

Proof of Claim

By Lemma 5.1(v), the greater is the arclength of $I_{a}$ , the smaller is $(\bm{w}_{a})_{3}$ , with $(\bm{w}_{a})_{3}=0$ corresponding to arclength $\pi$ . This implies that no more than one of the above third coordinates may be negative or [math]. Say that $(\bm{w}_{a})_{3}<0$ . If $I_{a}$ is even, then by Theorem 5.3 we may conjugate $B$ by the matrix in $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{Z}$ that sends $\bm{w}_{a}$ to $(0,1,0)$ , and we are through. If $I_{a}$ is odd, than we conjugate by the matrix that sends $\bm{w}_{a}$ to $(1,1,1)$ ; the image of $I_{a}$ will then have arclength $\pi/2$ . One of the new third coordinates may now have value [math], but none may have value $-1$ or less, since value $-1$ already corresponds to an arclength of $3\pi/2$ , and the sum of the arclengths would exceed $2\pi$ .

Having proved our claim we perform, if needed, this preliminary conjugation, which does not affect the validity of our statement; renaming indices, we assume $(\bm{w}_{0})_{3}\geq 0$ and $(\bm{w}_{1})_{3},\ldots,(\bm{w}_{m-1})_{3}>0$ . If $\bm{s}$ is one of $\bm{t}_{0},\ldots,\bm{t}_{m-1}$ , we are through. Otherwise, $\bm{s}$ is in the interior of precisely one interval, say $I_{a}$ ; let $\bm{s}^{\prime}=B(\bm{s})$ . Then, lifting $\bm{s}$ and $\bm{s}^{\prime}$ to their canonical representatives (i.e., to pythagorean triples), we have the identity in $\mathbb{Z}^{3}$

[TABLE]

Now, $\langle\bm{w}_{a},\bm{w}_{a}\rangle=1$ since $\bm{w}_{a}\in\mathcal{S}$ , and $\langle\bm{w}_{a},\bm{s}\rangle>0$ since $\bm{s}$ is in the interior of $I_{a}$ . This implies that the third coordinate of $\bm{s}^{\prime}$ is strictly less than the third coordinate of $\bm{s}$ , unless $a=0$ and $(\bm{w}_{0})_{3}=0$ , in which case we have equality. But the third coordinates of $\bm{s}$ and $\bm{s}^{\prime}$ are positive integers, and the exceptional case of equality is always preceded and followed by nonexceptional cases. Hence the process must stop, and this may happen only when the $B$ -orbit of $\bm{s}$ lands in one of the interval endpoints $\bm{t}_{0},\ldots,\bm{t}_{m-1}$ .

For the first assertion, the “if” implication is clear. Assume $\sigma^{\prime}=\mathscr{A}*\sigma$ . If one of $\sigma,\sigma^{\prime}$ is in $\mathbb{Q}(i)$ then so is the other, and by the first part of the proof both $\sigma$ and $\sigma^{\prime}$ land in one of $\theta_{0},\ldots,\theta_{m-1}$ . Since the vertices of $D$ are $\varGamma^{\pm}_{B}$ -inequivalent, they must land in the same $\theta_{a}$ . Let then $\sigma,\sigma^{\prime}\notin\mathbb{Q}(i)$ and $\varphi(\sigma)=\bm{a}$ . As noted in §6, $\mathscr{A}$ factors uniquely as $\mathscr{A}=\mathscr{A}_{b_{0}}\ldots\mathscr{A}_{b_{r-1}}$ , for certain $b_{0},\ldots,b_{r-1}\in\{0,\ldots,m-1\}$ . Let $0\leq h\leq r$ be minimum such that $a_{h}\not=b_{r-1-h}$ . Then

[TABLE]

By (a) in the proof of Lemma 6.5, $\varphi(\sigma^{\prime})=b_{0}\ldots b_{r-1-h}a_{h}a_{h+1}\ldots$ , and $B^{r-h}(\sigma^{\prime})=B^{h}(\sigma)$ . ∎

The bijection between $\partial\mathcal{D}\cap\mathbb{Q}(i)$ and rational points in $\partial\mathcal{K}$ extends to higher degrees.

Lemma 8.2.

Let $\bm{s}=[s_{1},s_{2},s_{3}]\in\partial\mathcal{K}$ correspond to $\sigma=(s_{1}+s_{2}i)/s_{3}\in\partial\mathcal{D}$ as usual, and let $\omega=C^{-1}*\sigma=(\mu\circ\upsilon)(\bm{s})\in\operatorname{P}^{1}\mathbb{R}$ . Then $\mathbb{Q}(\bm{s})=\mathbb{Q}(\omega)$ and $[\mathbb{Q}(\omega):\mathbb{Q}]=[\mathbb{Q}(i)(\sigma):\mathbb{Q}(i)]$ . If $\mathbb{Q}(\omega)/\mathbb{Q}$ is Galois totally real, then the Galois groups $\operatorname{Gal}(\mathbb{Q}(\omega)/\mathbb{Q})$ and $\operatorname{Gal}(\mathbb{Q}(i)(\sigma)/\mathbb{Q}(i))$ are naturally isomorphic. In particular, assume that $\sigma$ is quadratic over $\mathbb{Q}(i)$ and let $\sigma^{\prime}$ be its Galois conjugate. Then $\sigma^{\prime}\in\partial\mathcal{D}$ and $\omega^{\prime}=C^{-1}*\sigma^{\prime}$ is the Galois conjugate of $\omega$ with respect to the quadratic extension $\mathbb{Q}(\omega)/\mathbb{Q}$ .

Proof.

Since the stereographic projection through $[0,1,1]$ is a rational map with rational coefficients, the identity $\mathbb{Q}(\bm{s})=\mathbb{Q}(\omega)$ holds (with the convention that $\mathbb{Q}(\infty)=\mathbb{Q}$ ). All statements follow from elementary Galois theory, as soon as one realizes that $\mathbb{Q}(i,\sigma)=\mathbb{Q}(i,s_{1}/s_{3},s_{2}/s_{3})$ . In this identity the left-to-right containment is obvious, and the other one follows from $s_{1}/s_{3}=(\sigma+\sigma^{-1})/2$ . ∎

The question of the validity of Lagrange’s theorem (preperiodic points correspond to quadratic irrationals) for the Romik map is left open in [42, §5.1]. It can be settled in the affirmative by the result in [38]; see also [14] for this issue, and [13] for diophantine approximation aspects of the Romik map. Here we provide a different proof, valid not only for the Romik map but for all maps based on unimodular partitions. Note that our proof covers not only Lagrange’s, but Galois’s theorem [40, Chapter III]: periodic points correspond to reduced irrationals.

Theorem 8.3.

The point $\sigma\in S^{1}$ is $B$ -preperiodic if and only if it is quadratic over $\mathbb{Q}(i)$ . If this is the case and $a_{0}\ldots a_{h-1}\overline{a_{h}\ldots a_{h+p-1}}$ is the $B$ -symbolic sequence of $\sigma$ (with $p$ the minimal period and $h$ the minimal preperiod, so that $a_{h-1}\not=a_{h+p-1}$ ), then the $B$ -symbolic sequence of the Galois conjugate $\sigma^{\prime}$ is $a_{0}\ldots a_{h-1}\overline{a_{h+p-1}\ldots a_{h}}$ . In particular, the preperiodic $\sigma$ is periodic iff so is $\sigma^{\prime}$ iff $(\sigma,\sigma^{\prime})\in\mathcal{S}_{B}$ .

Proof.

Let $\sigma$ be $B$ -preperiodic. Clearly, for every $\mathscr{A}\in\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ , we have $\mathbb{Q}(i)(\mathscr{A}*\sigma)=\mathbb{Q}(i)(\sigma)$ ; we can then assume that $\sigma$ is $B$ -periodic, with $B$ -symbolic sequence $\overline{a_{0}a_{1}\ldots a_{p-1}}$ . Let $\mathscr{B}=\mathscr{A}_{a_{0}}\mathscr{A}_{a_{1}}\cdots\mathscr{A}_{a_{p-1}}$ . By looking at the decreasing sequence (6.1) in the proof of Lemma 6.5, we obtain

[TABLE]

Since $\mathscr{B}*\sigma$ is also in the above intersection, it equals $\sigma$ , and this yields a quadratic polynomial with coefficients in $\mathbb{Q}(i)$ and having $\sigma$ as root. This polynomial is not the zero polynomial, as $\mathscr{B}$ is not the identity matrix, and is irreducible over $\mathbb{Q}(i)$ because $\sigma$ is $B$ -nonterminating and Theorem 8.1 applies.

Conversely, let $\sigma\in S^{1}$ be quadratic over $\mathbb{Q}(i)$ . By Lemma 8.2 the conjugate $\sigma^{\prime}$ is in $S^{1}$ as well. For $t\geq 0$ , let $\widetilde{B}^{t}(\sigma,\sigma^{\prime})=(\sigma_{t},\sigma^{\prime}_{t})$ , and let $g_{t}$ be the oriented geodesic of origin $\sigma^{\prime}_{t}$ and endpoint $\sigma_{t}$ . By Theorem 7.1 there exists $h\geq 0$ such that, for $0\leq t<h$ , the points $\sigma_{t}$ and $\sigma^{\prime}_{t}$ belong to the same interval (so that $g_{t}$ does not cut the billiard table $D$ ), while $g_{t}$ cuts $D$ for every $t\geq h$ . In particular, the $B$ -symbolic sequences of $\sigma$ and $\sigma^{\prime}$ agree up to time $h-1$ included, and disagree at time $h$ . Let $\omega=C^{-1}*\sigma_{h}$ , $\omega^{\prime}=C^{-1}*\sigma^{\prime}_{h}$ ; since $\sigma_{t}$ and $\sigma^{\prime}_{t}$ are still conjugate in $\mathbb{Q}(i)(\sigma)/\mathbb{Q}(i)$ , by Lemma 8.2 $\omega$ and $\omega^{\prime}$ are conjugate in $\mathbb{Q}(\omega)/\mathbb{Q}$ . Let $O=\{\xi\in\mathbb{Q}(\omega):\xi(\mathbb{Z}\omega+\mathbb{Z})\subseteq\mathbb{Z}\omega+\mathbb{Z}\}$ be the coefficient ring of the module $\mathbb{Z}\omega+\mathbb{Z}$ [8, Chapter 2 §2.2]. Then $O$ is an order in $\mathbb{Q}(\omega)$ with fundamental unit $\varepsilon>1$ , and thus the matrix

[TABLE]

(where $\varepsilon^{\prime}$ is the conjugate of $\varepsilon$ ) is in $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ .

Now, $\langle F,P,J\rangle=C^{-1}\bigl{(}\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]\bigr{)}C$ is an index- $3$ subgroup of $\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ (see the end of the proof of Theorem 2.4), and $\varGamma^{\pm}_{B}$ is a finite-index subgroup of $\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ (see §6). Hence, replacing $H$ with an appropriate power, we obtain a matrix $\mathscr{H}^{l}=CH^{l}C^{-1}\in\varGamma^{\pm}_{B}$ which induces on $\mathcal{D}$ either a hyperbolic translation of axis $g_{h}$ (if $\det\mathscr{H}^{l}=1$ ), or a glide reflection, again of axis $g_{h}$ (if $\det\mathscr{H}^{l}\not=1$ ). As noted in §6, $\mathscr{H}^{l}$ can be uniquely written as $\mathscr{H}^{l}=\mathscr{A}_{b_{0}}\cdots\mathscr{A}_{b_{q-1}}$ for certain $b_{0},\ldots,b_{q-1}\in\{0,\ldots,m-1\}$ . We claim that $\overline{b_{0}\cdots b_{q-1}}$ and $\overline{b_{q-1}\cdots b_{0}}$ are the $B$ -symbolic sequences of $\sigma_{h}$ and $\sigma^{\prime}_{h}$ , respectively ( $q$ might be a proper multiple of the minimal period $p$ ); this will conclude the proof of Theorem 8.3.

We must have $b_{0}\not=b_{q-1}$ . Indeed, if not, then $\mathscr{H}^{l}$ would factor as

[TABLE]

for some $k\geq 2$ , with $t=(q-k)/2$ and $b_{t}\not=b_{t+k-1}$ . Hence $g_{h}$ would be the $(\mathscr{A}_{b_{0}}\cdots\mathscr{A}_{b_{t-1}})$ -image of the geodesic stabilized by $(\mathscr{A}_{b_{t}}\cdots\mathscr{A}_{b_{t+k-1}})$ , which has endpoints in the two distinct intervals $I_{b_{t}}$ and $I_{b_{t+k-1}}$ . Since $b_{t}$ and $b_{t+k-1}$ are different from $b_{t-1}$ , the endpoints of $g_{h}$ would both lie in $I_{b_{0}}$ , which is impossible since $g_{h}$ cuts $D$ ; therefore $b_{0}\not=b_{q-1}$ .

The sequence $\overline{b_{0}\cdots b_{q-1}}$ satisfies (i) in Lemma 6.5 (because $b_{0}\not=b_{q-1}$ ), as well as (ii) (because otherwise $\mathscr{H}^{l}$ would be a power of some $\mathscr{A}_{a}\mathscr{A}_{a+1}$ and thus would be parabolic, which is not possible because any power of the matrix in (8.2) has trace of absolute value greater than $2$ ). Therefore, $\overline{b_{0}\cdots b_{q-1}}$ is the $B$ -symbolic sequence of a unique point of $S^{1}$ , and this point is necessarily $\sigma_{h}$ , because $\sigma_{h}$ is the ideal endpoint of $g_{h}$ , and thus the shrinking point of

[TABLE]

The same argument, applied to $\mathscr{H}^{-1}=\mathscr{A}_{b_{q-1}}\cdots\mathscr{A}_{b_{0}}$ , shows that $\sigma^{\prime}_{h}$ has $B$ -symbolic sequence $\overline{b_{q-1}\cdots b_{0}}$ . ∎

Example 8.4.

Consider the unimodular partition given by the pythagorean triples

[TABLE]

in Figure 6 we draw the corresponding billiard table by thick geodesics.

Let $q(x,y)=4091x^{2}+1302xy+101y^{2}$ , which has discriminant $D=42440$ . The roots of $q(x,1)$ are

[TABLE]

We work directly on the de Sitter space; by (4.3), $q$ corresponds to

[TABLE]

Since we may safely multiply by a constant, and we prefer working with integer vectors, we multiply by $\sqrt{D}/2$ and define

[TABLE]

By the equivariance between (S1) and (S5) in Theorem 4.1, the billiard map $\widetilde{B}$ on [any dilated copy of] $\mathcal{S}$ is piecewise defined by the following matrices in $\operatorname{SO}_{2,1}\mathbb{Z}$ :

[TABLE]

In order to apply $\widetilde{B}$ we must determine the pair $(\bm{s},\bm{r})\in(S^{1}\times S^{1})\setminus(\operatorname{diagonal})$ associated to $\bm{v}$ , and the interval $I_{a}$ to which $\bm{s}$ belongs. The intervals $I_{0},\ldots,I_{5}$ correspond as in Definition 6.1 to the points in $\mathcal{S}$

[TABLE]

A straightforward computation along the lines of the proof of Theorem 4.1 shows that $\bm{s},\bm{r}$ are given, as a function of $\bm{v}\in(\sqrt{D}/2)\mathcal{S}$ , by

[TABLE]

and that the $3$ rd coordinates $s_{3},r_{3}$ displayed above are always strictly positive. This implies that all values $\langle\bm{w}_{0},\bm{s}\rangle,\ldots,\langle\bm{w}_{5},\bm{s}\rangle$ are strictly negative, with precisely one strictly positive exception. The index $a$ of that exception is the index of the interval $I_{a}$ to which $\bm{s}$ belongs, and thus the index of the matrix $-\bm{A}_{a}$ to be applied.

In our case, $\langle\bm{w}_{4},\bm{s}\rangle=1.64125\ldots$ and $\langle\bm{w}_{4},\bm{r}\rangle=1.94758\ldots$ ; thus both $\bm{s}=\bm{s}_{0}$ and $\bm{r}=\bm{r}_{0}$ lie in $I_{4}$ , and the $\widetilde{B}$ -image of $\bm{v}=\bm{v}_{0}$ is $-\bm{A}_{4}\bm{v}_{0}=(-247,199,-300)=\bm{v}_{1}$ . Repeating the computation we see that both $\bm{s}_{1}$ and $\bm{r}_{1}$ are in $I_{5}$ , so that $\bm{v}_{2}=-\bm{A}_{5}\bm{v}_{1}=(-45,93,8)$ . Now $\bm{s}_{2}$ and $\bm{r}_{2}$ belong to different intervals, namely the $3$ rd and the [math]th; thus $\bm{v}_{2}$ belongs to $\mathcal{S}_{B}$ and the periodicity starts. Proceeding with the computation we obtain

[TABLE]

The $B$ -symbolic sequence of $\omega_{0}$ is thus $45\overline{35420}$ , and that of $\alpha_{0}$ is $45\overline{02453}$ . We draw in Figure 6 the resulting billiard trajectory, along with the two geodesics corresponding to the preperiodic points $\bm{v}_{0}$ and $\bm{v}_{1}$ .

9. Minkowski functions

Let $B:S^{1}\to S^{1}$ be the factor of some fixed billiard map as in Definition 6.2. Clearly $B$ is an orientation-reversing $(m-1)$ -to- $1$ covering map of $S^{1}$ onto itself. The same properties are shared by precisely one continuous group homomorphism $T:S^{1}\to S^{1}$ , namely $T(z)=z^{-(m-1)}$ . In this section we prove that there exists a self-homeomorphism $\Phi$ of $S^{1}$ that conjugates $B$ with $T$ . We provide an explicit expression for $\Phi$ , and prove that $\Phi$ is unique up to postcomposition with the elements of the dihedral group of order $2m$ . In the final section we will show that $\Phi$ is purely singular with respect to the Lebesgue measure on $S^{1}$ , and Hölder continuous with exponent equal to $\log(m-1)$ divided by the maximal periodic mean free path in the hyperbolic billiard associated to $\widetilde{B}$ .

Example 9.1.

The prototype of such homeomorphisms is the Minkowski question mark function, which conjugates the Farey map $x\mapsto\min(x/(1-x),(1-x)/x)$ on $[0,1]$ with the tent map $x\mapsto\min(2x,-2x+2)$ , see [43], [27], [7] and references therein. For an example in our setting, let us consider the unimodular partition determined by $1,i,-1,-i$ ; we have then a “square billiard table”. For ease of visualization we look at $B$ and $T$ as maps from $[0,1)$ to itself; in particular, $T(x)=-3x\pmod{1}$ . We show in Figure 7 (left) the superimposed graphs of $B$ and $T$ , and the resulting function $\Phi$ (right).

As noted in Example 6.3, $B$ is defined via $4$ pieces, with endpoints the indifferent fixed points [math], $1/4$ , $1/2$ , $3/4$ , and has (apparent) discontinuities at [math], $\arg(\mathscr{A}_{1}*1)=\arccos(-3/5)/(2\pi)=0.35241\ldots$ , $\arg(\mathscr{A}_{2}*1)=1-\arg(\mathscr{A}_{1}*1)$ . In this quite specific case $T$ shares the set of fixed points (which of course are now expansive) with $B$ ; the graph of $T$ has (apparent) discontinuities at [math], $1/3$ , $2/3$ . We will return to this example at the end of the paper.

In order to state the next result, we recall that the torsion subgroup $S^{1}_{\text{tor}}$ of $S^{1}$ is the internal direct sum of the Prüfer groups $S^{1}_{\text{$ p $-tor}}=\{\sigma\in S^{1}:\operatorname{ord}(\sigma)\text{ is a power of }p\}$ , for $p$ ranging over the primes. We let $\zeta=\exp(2\pi i/(m-1))$ .

Theorem 9.2.

There exists a homeomorphism $\Phi:S^{1}\to S^{1}$ such that $\Phi\circ B=T\circ\Phi$ . This homeomorphism is unique up to postcomposition with elements of the dihedral group $z\mapsto\zeta^{h}z^{e}$ , with $h\in\{0,\ldots,m-1\}$ and $e\in\{-1,1\}$ . The map $\Phi$ establishes a bijection between the set of points in $S^{1}$ of degree $\leq 2$ over $\mathbb{Q}(i)$ and $S^{1}_{\text{tor}}$ , the set $S^{1}\cap\mathbb{Q}(i)$ corresponding to the direct sum of the subgroup $\langle\zeta\rangle$ generated by $\zeta$ and the finitely many $S^{1}_{\text{$ p $-tor}}$ , for $p\mid m-1$ .

Before proving Theorem 9.2 we need some preliminaries. We already encountered the ternary betweenness relation on $S^{1}$ in §5, and we now introduce the same relation on the index set $\{0,\ldots,m-1\}$ , cyclically ordered in the natural way. The powers of $\zeta$ determine a partition of $S^{1}$ in the half-open intervals $J_{a}=\{\zeta^{a}\}\cup\{x:\zeta^{a}\prec x\prec\zeta^{a+1}\}={[}\zeta^{a},\zeta^{a+1})$ . We define a binary relation $<_{B}$ on $S^{1}$ as follows: $\sigma<_{B}\sigma^{\prime}$ if and only if $\sigma$ and $\sigma^{\prime}$ lie in the same interval $I_{a}$ , for some $a\in\{0,\ldots,m-1\}$ , and $\arg(\sigma)<\arg(\sigma^{\prime})$ . The relation $<_{T}$ is defined in the analogous way, using the intervals $J_{a}$ . Precisely as in Definition 6.4, but using the intervals $J_{a}$ , we introduce the $T$ -symbolic-sequence map $\psi:S^{1}\to\{0,\ldots,m-1\}^{\omega}$ .

Lemma 9.3.

All statements in Lemma 6.5 hold for $\psi$ ; in particular $\varphi$ and $\psi$ have identical range $X\subset\{0,\ldots,m-1\}^{\omega}$ , which is described by (i) and (ii) in that lemma. The betweenness and the $<_{B}$ relations on $S^{1}$ are characterized in terms of $B$ -symbolic sequences and the betweenness relation on $\{0,\ldots,m-1\}$ as follows: let $\varphi(\sigma)=\bm{a}$ , $\varphi(\sigma^{\prime})=\bm{a}^{\prime}$ , $\varphi(\sigma^{\prime\prime})=\bm{a}^{\prime\prime}$ . Then:

(1)

$\sigma<_{B}\sigma^{\prime}$ * if and only if there exists $t\geq 0$ such that:*

(1.1)

$a_{h}=a^{\prime}_{h}$ * for every $0\leq h\leq t$ ,*

(1.2)

$a_{t+1}\not=a^{\prime}_{t+1}$ ,

(1.3)

one of the following mutually exclusive conditions holds:

(1.3.1)

$t$ * is even and ( $a_{t+1}=a_{t}$ or $a_{t+1}\prec a_{t}\prec a^{\prime}_{t+1}$ ),*

(1.3.2)

$t$ * is odd and ( $a^{\prime}_{t+1}=a^{\prime}_{t}$ or $a^{\prime}_{t+1}\prec a_{t}\prec a_{t+1}$ );*

(2)

$\sigma\prec\sigma^{\prime}\prec\sigma^{\prime\prime}$ * if and only if one of the following mutually exclusive conditions holds:*

(2.1)

$a_{0}\prec a^{\prime}_{0}\prec a^{\prime\prime}_{0}$ ,

(2.2)

$a_{0}=a^{\prime}_{0}\not=a^{\prime\prime}_{0}$ * and $\sigma<_{B}\sigma^{\prime}$ ,*

(2.3)

$a_{0}\not=a^{\prime}_{0}=a^{\prime\prime}_{0}$ * and $\sigma^{\prime}<_{B}\sigma^{\prime\prime}$ ,*

(2.4)

$a_{0}=a^{\prime}_{0}=a^{\prime\prime}_{0}$ * and $\sigma<_{B}\sigma^{\prime}$ and $\sigma^{\prime}<_{B}\sigma^{\prime\prime}$ .*

We have an analogous characterization of betweenness and $<_{T}$ in terms of $T$ -symbolic sequences.

Proof.

The proof of Lemma 6.4 easily extends to the case of the map $T$ . Apart from the obvious modifications (use $J_{a}$ for $I_{a}$ , and $\zeta^{a}$ for $\theta_{a}$ ), one has to replace the occurrences of $B$ with occurrences of $T$ , and those of $\mathscr{A}_{a}$ with $T_{a}^{-1}$ , the latter being the $a$ th inverse branch of $T$ , i.e., the map that associates to $\sigma\in\bigcup_{b\not=a}\overline{J}_{b}$ its unique $-(m-1)$ th root lying in $\overline{J}_{a}$ . The fact that no $T$ -symbolic sequence has tail $\overline{a(a+1)}$ is easy; indeed, any point having that symbolic sequence should jump forever from $J_{a}$ to $J_{a+1}$ . But at each jump its arclength distance from the fixed point $\zeta^{a+1}$ increases by a factor $m-1$ , so the point will eventually escape from $J_{a}\cup J_{a+1}$ . Finally, the analogue of the sequence (6.1) surely shrinks to a singleton, because at each step the arclengths shrink by a factor $m-1$ . With these modifications, the proof carries through verbatim.

We prove statement (1). Suppose $\sigma$ and $\sigma^{\prime}$ are different, but lie in the same interval $I_{a_{0}}$ . Then there exists $t\geq 0$ such that for $t$ steps the successive $B$ -images of $\sigma$ and $\sigma^{\prime}$ keep on lying in the same interval, while $B^{t+1}(\sigma)$ and $B^{t+1}(\sigma^{\prime})$ lie in the different intervals $I_{a_{t+1}}$ and $I_{a^{\prime}_{t+1}}$ , respectively. Since $B$ is orientation-reversing, $\sigma<_{B}\sigma^{\prime}$ if and only if either $t$ is even and $B^{t}(\sigma)<_{B}B^{t}(\sigma^{\prime})$ , or $t$ is odd and $B^{t}(\sigma^{\prime})<_{B}B^{t}(\sigma)$ . We can then assume without loss of generality $t=0$ , and observe that $\sigma<_{B}\sigma^{\prime}$ holds if and only if $\sigma=\theta_{a_{0}}$ (which is equivalent to $a_{1}=a_{0}$ ), or $B(\sigma)\prec\theta_{a_{0}}\prec B(\sigma^{\prime})$ (which is equivalent to $a_{1}\prec a_{0}\prec a^{\prime}_{1}$ , since now $B(\sigma)$ and $B(\sigma^{\prime})$ lie in different intervals, both different from $I_{a_{0}}$ ).

Statement (2) is clear, as is the fact that all of the proof applies to the map $T$ . ∎

Proof of Theorem 9.2.

Let $S$ be the shift on $X=\varphi[S^{1}]=\psi[S^{1}]$ , and define $\Phi=\psi^{-1}\circ\varphi$ . Then the inner squares in

[TABLE]

commute, so the outer rectangle commutes as well. Let $\sigma$ , $\sigma^{\prime}$ , $\sigma^{\prime\prime}$ be distinct points of $S^{1}$ . Then $\sigma\prec\sigma^{\prime}\prec\sigma^{\prime\prime}$ holds if and only if the conditions of Lemma 9.3 apply to $\varphi(\sigma)$ , $\varphi(\sigma^{\prime})$ , $\varphi(\sigma^{\prime\prime})$ . By construction, $\varphi(\sigma)=\psi\bigl{(}\Phi(\sigma)\bigr{)}$ and analogously for $\sigma^{\prime}$ and $\sigma^{\prime\prime}$ ; therefore $\sigma^{\prime}$ is between $\sigma$ and $\sigma^{\prime\prime}$ if and only if $\Phi(\sigma^{\prime})$ is between $\Phi(\sigma)$ and $\Phi(\sigma^{\prime\prime})$ . Since the topology of $S^{1}$ is definable in terms of betweenness, $\Phi$ is a homeomorphism.

Let $\Phi_{1}$ be any homeomorphism that makes the outer rectangle in (9.1) commute. For every $h\in\{0,\ldots,m-1\}$ and every $e\in\{1,-1\}$ , the map $Q(z)=\zeta^{h}z^{e}$ commutes with $T$ , so that $Q\circ\Phi_{1}$ too makes the outer rectangle commute. We therefore assume that $\Phi_{1}$ is orientation-preserving and fixes $1$ , and prove $\Phi_{1}=\Phi$ . As $\Phi_{1}$ and $\Phi$ are homeomorphisms and the set of $B$ -terminating points is dense in $S^{1}$ , it is enough to show that $\Phi_{1}$ agrees with $\Phi$ on this set; in other words, that if $\sigma$ has $B$ -symbolic sequence $a_{0}\ldots a_{t-1}\overline{a_{t}}$ with $a_{t-1}\not=a_{t}$ , then $\Phi_{1}(\sigma)$ has $T$ -symbolic sequence $a_{0}\ldots a_{t-1}\overline{a_{t}}$ .

We work by induction on $t$ . If $t=0$ , then $\sigma=\theta_{a_{0}}$ . Since $\Phi_{1}$ is orientation-preserving, sends the set $\{\theta_{0},\ldots,\theta_{m-1}\}$ of $B$ -fixed points to the set $\{\zeta^{0},\ldots,\zeta^{m-1}\}$ of $T$ -fixed points, and fixes $1=\theta_{0}=\zeta^{0}$ , we have $\Phi_{1}(\theta_{a})=\zeta^{a}$ for every $a$ . In particular, $\Phi_{1}(\sigma)=\zeta^{a_{0}}$ , which has $T$ -symbolic sequence $\overline{a_{0}}$ . Let $t>0$ ; then $a_{0}\not=a_{1}$ , which implies $\sigma\not=\theta_{a_{0}}$ and $\Phi_{1}(\sigma)\not=\zeta^{a_{0}}$ . By the inductive hypothesis, the statement is true for all points that land in a $B$ -fixed point in $t-1$ steps. Since $B(\sigma)$ is one of these points, we have

[TABLE]

Thus $\psi\bigr{(}\Phi_{1}(\sigma)\bigr{)}=ba_{1}\ldots a_{t-1}\overline{a_{t}}$ for some $b$ , and we must show $b=a_{0}$ . Suppose not; then we have $\zeta^{a_{0}}\prec\zeta^{b}\prec\Phi_{1}(\sigma)$ , while $\zeta^{a_{0}}\prec\Phi(\sigma)\prec\zeta^{b}$ . Applying the order-preserving homeomorphism $\Phi_{1}^{-1}$ to the former relation, and $\Phi^{-1}$ to the latter, we get $\theta_{a_{0}}\prec\theta_{b}\prec\sigma$ and $\theta_{a_{0}}\prec\sigma\prec\theta_{b}$ , which is impossible; therefore $b=a_{0}$ and our first statement is proved.

By Theorems 8.1 and 8.3 the set of points in $S^{1}$ of degree $1$ (respectively, $2$ ) over $\mathbb{Q}(i)$ is the set of $B$ -terminating (respectively, $B$ -preperiodic) points. Their $\Phi$ -images are then the $T$ -terminating (respectively, $T$ -preperiodic) points. It is easily seen the every $T$ -terminating or $T$ -preperiodic point must have the form $\exp(2\pi iq)$ for some rational number $q$ , i.e., must lie in $S^{1}_{\text{tor}}$ . We have the decomposition $S^{1}_{\text{tor}}=H_{1}\cdot H_{2}$ , where $H_{1}$ (respectively, $H_{2}$ ) is the inner sum of all Prüfer groups $S^{1}_{\text{$ p $-tor}}$ with $p\nmid m-1$ (respectively, $p\mid m-1$ ). Now, given $\sigma\in S^{1}_{\text{tor}}$ , repeated applications of $T$ kill the $H_{2}$ part, and as soon as this happens the periodicity starts. More precisely, let $h\geq 0$ be minimum such that $T^{h}(\sigma)\in H_{1}$ . Then $T^{h}(\sigma)$ is $T$ -periodic, because raising to the $-(m-1)$ th power is an automorphism of $H_{1}$ of finite order. In particular, $\sigma$ is $T$ -terminating if and only if $T^{h}(\sigma)$ is a fixed point, i.e., a power of $\zeta$ . Thus, $\sigma$ is $T$ -terminating precisely when it belongs to $\langle\zeta\rangle\cdot H_{2}$ . ∎

We note as an aside that the pushforward probability measure $\Phi^{-1}_{*}\lambda$ , where $\lambda$ is the Lebesgue measure on the circle, is $B$ -invariant, and is the measure of maximal entropy for $B$ .

For the rest of this paper we consider $B$ , $T$ , $\Phi$ as selfmaps of $[0,1)$ , as in Figure 7. This improves visualization, and makes $\Phi=\psi^{-1}\circ\varphi$ the unique homeomorphism of $[0,1)$ (with the topology inherited from $\mathbb{R}$ , not from $S^{1}$ ) that conjugates $B$ with $T$ . Accordingly, $<$ will now denote the standard non-circular orders on $[0,1)$ and on $\{0,\ldots,m-1\}$ . We will abuse language by writing $I_{a}$ and $J_{a}$ for the $\arg$ -images in $[0,1)$ of the intervals $I_{a}$ and $J_{a}$ of $S^{1}$ .

In the next Theorem 9.4 we provide an explicit formula for $\Phi(x)$ , analogous to the Denjoy-Salem formula for the classical case [19], [43, pp. 435-436], and to the formula in [7, Theorem 1] for the Minkowski function induced by the Romik map. We define a function $d:\{0,\ldots,m-1\}^{2}\setminus\{\operatorname{diagonal}\}\to\{0,\ldots,m-1\}$ by

[TABLE]

Theorem 9.4.

Let $x\in[0,1)$ have $B$ -symbolic sequence $\bm{a}$ . Then

[TABLE]

Proof.

The statement amounts to saying that $\psi^{-1}(\bm{a})$ equals the value of the absolutely convergent series on the right-hand side of (9.2). By construction,

[TABLE]

where $T_{a_{t}}^{-1}$ is the $a_{t}$ th inverse branch of $T$ discussed in the proof of Lemma 9.3 (instead of [math], any point in $[0,1)$ would do). We recall that, by definition, $T_{a}^{-1}$ is that inverse branch of $T$ that sends $\bigcup_{b\not=a}\overline{J}_{b}$ onto $\overline{J}_{a}$ . Here a picture may help: rotate the graph of $T$ in Figure 7 (left) along the diagonal, and look at its $m=4$ inverse branches, the first two being

[TABLE]

A brief pondering over such a picture shows that $T_{a}^{-1}(x)$ equals $-x/(m-1)+(a+1)/(m-1)$ on $\bigcup_{b>a}\overline{J}_{b}$ , and equals $-x/(m-1)+a/(m-1)$ on $\bigcup_{b<a}\overline{J}_{b}$ ; in short,

[TABLE]

Applying induction to the above formula one easily proves that

[TABLE]

(where we set $a_{n}=0$ ), and the statement follows by letting $n$ tend to infinity. ∎

If $x$ is $B$ -preperiodic, (9.2) yields a finite expression for $\Phi(x)$ . Indeed, writing for short $d_{t}=d(a_{t},a_{t+1})$ and $\bm{d}=d_{0}d_{1}\ldots$ , we have that the map $\bm{a}\mapsto\bm{d}$ is shift-invariant; in particular, it sends preperiodic sequences to preperiodic ones. Hence, for $\bm{a}=\varphi(x)$ and $\bm{d}=\bm{d}(\bm{a})=d_{0}\ldots d_{h-1}\overline{d_{h}\ldots d_{h+p-1}}$ we set

[TABLE]

and obtain by a straightforward computation

[TABLE]

Example 9.5.

The point $\omega_{0}$ of Example 8.4 has $B$ -symbolic sequence $\bm{a}=45\overline{35420}$ , and $m=6$ . Thus $\bm{d}=55\overline{45421}$ and, applying (9.3),

[TABLE]

Multiplying successively by $-(m-1)=-5$ , and working in $\mathbb{Q}/\mathbb{Z}\simeq S^{1}_{\text{tor}}$ , the summand $1/3$ is fixed (because $-5\equiv 1$ modulo $3$ ), and $11/25$ gets killed in two steps. So it only remains the summand $27/521$ , which yields a periodic orbit of length $5$ (because $-5$ has order $5$ modulo $521$ ), as expected.

The Galois conjugate $\alpha_{0}$ of $\omega_{0}$ has $B$ -symbolic sequence $\bm{a}^{\prime}=45\overline{02453}$ and

[TABLE]

with identical dynamical behaviour. The appearance of the same primes at the denominators is not surprising. Indeed, given a periodic orbit of length $p$ , a simple computation shows that the only primes whose powers may appear as denominators of summands are those dividing $(m-1)^{p}+(-1)^{p+1}$ , in our case $2$ , $3$ , $521$ .

10. Singularity and Hölder exponent

We maintain the setting described before Theorem 9.4. Since $\Phi$ is a monotonically increasing homeomorphism of $[0,1)$ , it is differentiable $\lambda$ -a.e. ( $\lambda$ referring to the Lebesgue measure) with finite derivative.

Theorem 10.1.

The function $\Phi$ is purely singular (i.e., $\Phi^{\prime}=0$ $\lambda$ -a.e.).

We need a preliminary lemma, for which we refer to the notation introduced in Definition 6.1.

Lemma 10.2.

For every $a$ , we have $\bm{w}_{a-1}+\bm{w}_{a}=q_{a}\bm{t}_{a}$ for some $q_{a}\in\mathbb{Z}_{>0}$ . Moreover, the identities

[TABLE]

hold.

Proof.

It is easy to show that $\langle\bm{w}_{a-1},\bm{w}_{a}\rangle=-1$ ; for example, applying an appropriate element of $\operatorname{SO}^{\uparrow}_{2,1}\mathbb{R}$ we may assume $\bm{t}_{a-1}=[0,-1,1]$ , $\bm{t}_{a}=[1,0,1]$ , $\bm{t}_{a+1}=[0,1,1]$ , and compute directly. As a consequence, $\langle\bm{w}_{a-1}+\bm{w}_{a},\bm{w}_{a-1}+\bm{w}_{a}\rangle=1-2+1=0$ , and $\bm{w}_{a-1}+\bm{w}_{a}$ lies on the isotropic cone of the Lorentz form. By the formula (5.1), the plane tangent to this cone at $\bm{t}_{a}$ contains both $\bm{w}_{a-1}$ and $\bm{w}_{a}$ ; hence $\bm{w}_{a-1}+\bm{w}_{a}$ must be an integer multiple of $\bm{t}_{a}$ . We thus have $\bm{w}_{a-1}+\bm{w}_{a}=q_{a}\bm{t}_{a}$ for some $q_{a}\in\mathbb{Z}$ , and must prove $q_{a}>0$ . Now, we can surely construct a parabolic transformation $\bm{P}\in\operatorname{SO}^{\uparrow}_{2,1}\mathbb{R}$ that fixes $\bm{t}_{a}$ and is such that $I_{\bm{P}\bm{w}_{a-1}}$ and $I_{\bm{P}\bm{w}_{a}}$ have both arclength strictly less than $\pi$ . By Lemma 5.1(v), $\bm{P}\bm{w}_{a-1}$ and $\bm{P}\bm{w}_{a}$ have both strictly positive third coordinate. Since $\bm{P}\bm{w}_{a-1}+\bm{P}\bm{w}_{a}=q_{a}\bm{t}_{a}$ and $\bm{t}_{a}$ has positive third coordinate too, $q_{a}$ must be strictly positive.

For the second statement we observe that $\bm{t}_{a}$ is a fixed point of $\bm{A}_{a-1}=\bm{R}_{\bm{w}_{a-1}}$ , as well as of $\bm{A}_{a}=\bm{R}_{\bm{w}_{a}}$ . We thus compute $\bm{A}_{a-1}\bm{w}_{a}=\bm{A}_{a-1}(-\bm{w}_{a-1}+q_{a}\bm{t}_{a})=\bm{w}_{a-1}+q_{a}\bm{t}_{a}$ , and analogously for the other identity in (10.1). ∎

Let $x\in[0,1)$ have $B$ -symbolic sequence $\bm{a}$ . If, for some $t\geq 0$ , we have $a_{t}=a_{t+2}$ while $a_{t+1}\in\{a_{t}-1,a_{t}+1\}$ , then we say that $x$ moves parabolically at time $t$ .

Proof of Theorem 10.1.

Let $\mu$ be the infinite measure induced by the density $\sum_{a}h_{a}$ of Theorem 7.2(ii). Since $([0,1),\mu,B)$ is ergodic and conservative, by the Halmos version of the Poincaré recurrence theorem the set $P$ of points that move parabolically at infinitely many times has full $\mu$ -measure. As $\sum_{a}h_{a}$ is bounded from below by some positive constant, $\mu(P^{c})=0$ implies $\lambda(P^{c})=0$ . In particular, the set $P^{\prime}$ of points $x$ that move parabolically at infinitely many times, and are such that $\Phi^{\prime}(x)$ exists finite, has full Lebesgue measure. We claim that $\Phi^{\prime}(x)=0$ for every $x\in P^{\prime}$ .

Fix such an $x$ , and let $\bm{a}$ be its $B$ -symbolic sequence. Then, for each $t\geq 0$ , $x$ belongs to the cylinder $B_{a_{0}}^{-1}\cdots B_{a_{t-1}}^{-1}[I_{a_{t}}]$ , whose closure is the $\arg$ -image of $\bm{A}_{a_{0}}\cdots\bm{A}_{a_{t-1}}[I_{\bm{w}_{a_{t}}}]$ . To be fully precise we clarify that, according to Definition 6.2, $I_{a}$ is the half-open interval $[\bm{t}_{a},\bm{t}_{a+1})$ (or, here, its $\arg$ -image), while $I_{\bm{w}_{a}}$ is, as defined in §5, the closed interval $[\bm{t}_{a},\bm{t}_{a+1}]$ . However, our fixed $x$ is surely not $B$ -terminating, so interval endpoints are of no concern here.

It is easy to show that

[TABLE]

Suppose by contradiction that the above limit is different from [math]. Then, taking the quotient of two consecutive terms and multiplying by $m-1$ , we obtain

[TABLE]

Up to a factor of $2\pi$ , the length of $B_{a_{0}}^{-1}\cdots B_{a_{t}}^{-1}[I_{a_{t+1}}]$ equals the arclength of $\bm{A}_{a_{0}}\cdots\bm{A}_{a_{t}}[I_{\bm{w}_{a_{t+1}}}]$ which, by Lemma 5.1(vii), is asymptotic to the inverse of $(\bm{A}_{a_{0}}\cdots\bm{A}_{a_{t}}\bm{w}_{a_{t+1}})_{3}$ , the index $3$ referring to the $3$ rd coordinate. Therefore, writing $\bm{A}_{a_{0}}\cdots\bm{A}_{a_{t-1}}=\bm{C}_{t-1}$ for short, we have

[TABLE]

Assume now that $t$ is a parabolic time and write $a_{t}=a_{t+2}=a$ ; without loss of generality $a_{t+1}=a-1$ . Using Lemma 10.2 and observing that $\bm{A}_{a}\bm{t}_{a}=\bm{t}_{a}$ , we compute

[TABLE]

Since $(\bm{C}_{t-1}\bm{w}_{a})_{3}$ is eventually positive (actually, it goes to infinity for $t\to\infty$ ), the last term in the above chain of equalities is less than $2$ for all sufficiently large parabolic times. If $m\geq 4$ this contradicts (10.2) and establishes Theorem 10.1.

If $m=3$ we need one more parabolic iteration. Namely, we redefine a parabolic time as a time $t$ at which the $B$ -symbolic sequence of $x$ has the form either $a(a-1)a(a-1)a$ or $a(a+1)a(a+1)a$ . Then the chain of equalities in (10.3) starts with

[TABLE]

and ends up with

[TABLE]

which is eventually less than $4/3$ , again contradicting (10.2). ∎

In §6 we set $\varGamma^{\pm}_{B}=\langle\mathscr{A}_{0},\ldots,\mathscr{A}_{m-1}\rangle<\operatorname{PSU}^{\pm}_{1,1}\mathbb{Z}[i]$ ; let us now define $\Gamma^{\pm}_{B}=C^{-1}\varGamma^{\pm}_{B}C=\langle A_{0},\ldots,A_{m-1}\rangle<\operatorname{PSL}^{\pm}_{2}\mathbb{Z}$ and $\bm{\Gamma}^{\pm}_{B}=\langle\bm{A}_{0},\ldots,\bm{A}_{m-1}\rangle<\operatorname{O}^{\uparrow}_{2,1}\mathbb{Z}$ ; see the diagram (5.2). Let $A\in\Gamma^{\pm}_{B}$ ; then $A^{2}$ has positive determinant and is conjugate to a matrix either of the form $\bigl{[}\begin{smallmatrix}\exp(t/2)&\\ &\exp(-t/2)\end{smallmatrix}\bigr{]}$ or of the form $\bigl{[}\begin{smallmatrix}1&t\\ &1\end{smallmatrix}\bigr{]}$ ( $\Gamma_{B}$ does not contain elliptic elements). The formulas in (2.2) show immediately that the spectral radius $\rho(\bm{A}^{2})$ of $\bm{A}^{2}$ is the square of the spectral radius of $A^{2}$ ; taking square roots we obtain $\rho(\bm{A})=\rho(A)^{2}$ .

We fix a lifting —whose choice is irrelevant— of $A_{0},\ldots,A_{m-1}$ to $\operatorname{SL}^{\pm}_{2}\mathbb{Z}$ , and we denote by $\Sigma^{k}$ (respectively, $\bm{\Sigma}^{k}$ ) the set of all products of $k$ elements of $\Sigma=\Sigma^{1}=\{A_{0},\ldots,A_{m-1}\}$ (respectively, $\{\bm{A}_{0},\ldots,\bm{A}_{m-1}\}$ ), repetitions allowed. We recall that the joint spectral radius of $\Sigma$ is the number

[TABLE]

where $\lVert\phantom{A}\rVert$ is the operator norm induced by some vector norm, whose choice is irrelevant; see [5], [21], [23] for a detailed treatment. By the Berger-Wang theorem

[TABLE]

and the previous remarks imply that $\rho(\bm{\Sigma})=\rho(\Sigma)^{2}$ .

The finiteness conjecture [31, p. 19] states the following:

•

For every finite set of matrices $\Pi$ there exists $k\geq 1$ and $A\in\Pi^{k}$ such that $\rho(\Pi)=\rho(A)^{1/k}$ .

Although the conjecture has been refuted in [9], counterexamples are difficult to construct, and are widely believed to be rare; see [26] for a detailed discussion and references to the literature. We do not know if the sets $\Sigma=\{A_{0},\ldots,A_{m-1}\}$ defining our billiard maps always satisfy the conjecture. However, for any specific example we examined it was easy to guess an appropriate $k$ and $A\in\Sigma^{k}$ , and the guess was proved correct by explicitly constructing an appropriate matrix norm; see Example 10.6.

Definition 10.3.

Let $(\sigma,\rho)\in\mathcal{S}_{B}$ , and let $\gamma:\mathbb{R}\to\mathcal{D}$ be the geodesic path of ideal endpoints $\gamma(-\infty)=\rho$ and $\gamma(+\infty)=\sigma$ , parametrized by arclength, and entering the table $D$ at $t=0$ . Then $\gamma$ descends to a billiard trajectory $\bar{\gamma}:\mathbb{R}\to D=\varGamma^{\pm}_{B}\backslash\mathcal{D}$ , and we define the mean free path of $\bar{\gamma}$ to be

[TABLE]

provided that the limit exists (it surely does if $\bar{\gamma}$ is periodic).

Theorem 10.4.

For $\tilde{\mu}$ -every $(\sigma,\rho)$ , the mean free path of $\bar{\gamma}$ equals [math]. The supremum of the family of mean free paths of periodic trajectories equals $2\log(\rho(\Sigma))$ , and this supremum is a maximum if and only if the finiteness conjecture holds for $\Sigma$ .

Proof.

Let $f:\mathcal{S}_{B}\to\mathbb{R}_{>0}$ be defined by $f(\sigma,\rho)=\sup\{t>0:\gamma(t)\in D\}$ , where $\gamma$ depends on $(\sigma,\rho)$ as in Definition 10.3. Then the integral of $f$ with respect to $\tilde{\mu}$ is finite, since it equals one half of the volume of the unit tangent bundle of $\varGamma_{B}\backslash\mathcal{D}$ . Since the measure-preserving system $(\mathcal{S}_{B},\tilde{\mu},\widetilde{B})$ is conservative, a basic result of infinite ergodic theory [25, §4] yields that for $\tilde{\mu}$ -every $(\sigma,\rho)$ we have

[TABLE]

As the limit above is precisely the free mean path of $\bar{\gamma}$ , our first statement follows.

Let $M=\sup\{\operatorname{mfp}(\bar{\gamma}):\text{$ \bar{\gamma} $is a periodic billiard trajectory}\}$ . Given $k\geq 3$ , let $A$ have maximum spectral radius in $\Sigma^{k}$ . Surely $A^{2}$ cannot be parabolic and, by the unique factorization of $A$ as a product of elements in $\Sigma$ , we see that there exists $B=A_{b_{0}}\cdots A_{b_{h-1}}\in\Sigma^{h}$ such that $2\leq h\leq k$ , $b_{0}\not=b_{h-1}$ , and $A$ is conjugate to $B$ . Define $\gamma:\mathbb{R}\to\mathcal{D}$ by $\gamma(t)=CB*\exp(ti)$ , where $C$ is the Cayley matrix. Then $\gamma$ descends to a $h$ -bounces periodic billiard trajectory $\bar{\gamma}$ on $D$ , which we claim to have length $2\log(\rho(B))$ . Indeed, if $h$ is even then $B$ is hyperbolic; thus, by the proof of [12, Proposition 1], $\bar{\gamma}$ has length $2\operatorname{arccosh}(\lvert\operatorname{tr}B\rvert/2)$ , which is indeed $2\log(\rho(B))$ . If $h$ is odd, then we replace $B$ with $B^{2}$ and obtain that $\bar{\gamma}$ has length $\log(\rho(B^{2}))$ , which again equals $2\log(\rho(B))$ . As $\bar{\gamma}$ involves $h$ bounces, we have $\operatorname{mfp}(\bar{\gamma})=2\log(\rho(B)^{1/h})$ ; we conclude that $2\log(\rho(A)^{1/k})\leq 2\log(\rho(B)^{1/h})=\operatorname{mfp}(\bar{\gamma})$ , and thus $2\log(\rho(\Sigma))\leq M$ .

Conversely, any periodic trajectory $\bar{\gamma}$ involving $k$ bounces can be lifted (nonuniquely) to a unit speed geodesic path $\gamma:\mathbb{R}\to\mathcal{D}$ . The $B$ -symbolic sequence $\bm{a}$ of $\gamma(+\infty)=\sigma\in S^{1}$ is periodic of period $k$ and the argument above, applied to $A=A_{a_{0}}\cdots A_{a_{k-1}}$ , shows that $\bar{\gamma}$ has mean free path $2\log(\rho(A)^{1/k})$ ; therefore $M\leq 2\log(\rho(\Sigma))$ . ∎

Theorem 10.5.

The function $\Phi$ is Hölder continuous of exponent

[TABLE]

If the finiteness conjecture holds for $\Sigma$ , then $\alpha$ is the best Hölder exponent (i.e., $\Phi$ is not Hölder continuous of exponent $\beta$ , for any $\beta>\alpha$ ).

Proof.

Let $\lVert\bm{x}\rVert=\max\{\lvert x_{1}\rvert,\lvert x_{2}\rvert,\lvert x_{3}\rvert\}$ denote the $\infty$ -norm in $\mathbb{R}^{3}$ ; note that $\lVert\bm{x}\rVert=\lvert x_{3}\rvert$ on $\mathcal{S}\cap\mathbb{Z}^{3}$ , exception being made for the four points $(\pm 1,0,0)$ , $(0,\pm 1,0)$ only. As noted in the proof of Theorem 10.1, the closure of the cylinder $B_{a_{0}}^{-1}\cdots B_{a_{k-1}}^{-1}[I_{a_{k}}]$ is the $\arg$ -image of $\bm{A}_{a_{0}}\cdots\bm{A}_{a_{k-1}}[I_{\bm{w}_{a_{k}}}]$ . Taking into account Lemma 5.1(iii) and (vii), the length of the former is asymptotic, as $k$ increases, to $\pi^{-1}\lVert\bm{A}_{a_{0}}\cdots\bm{A}_{a_{k-1}}\bm{w}_{a_{t}}\rVert^{-1}$ . Once fixed a constant $C>\pi\max\{\lVert\bm{w}_{a_{0}}\rVert,\ldots,\lVert\bm{w}_{a_{m-1}}\rVert\}>1$ , this implies that there exists a level $k_{0}$ such that, for every $k\geq k_{0}$ and every cylinder $B_{a_{0}}^{-1}\cdots B_{a_{k-1}}^{-1}[I_{a_{k}}]$ of level $k$ , we have

[TABLE]

where the matrix norm is the one induced by the vector norm.

Fix now $\varepsilon>0$ . Then there exists $k_{1}\geq k_{0}$ such that, for every $k\geq k_{1}$ and every matrix $\bm{A}_{a_{0}}\cdots\bm{A}_{a_{k-1}}\in\bm{\Sigma}^{k}$ , we have $\rho(\bm{\Sigma})+\varepsilon>\lVert\bm{A}_{a_{0}}\cdots\bm{A}_{a_{k-1}}\rVert^{1/k}$ . Let $0\leq x<x^{\prime}<1$ be such that

[TABLE]

Let $k\geq k_{1}$ be minimum such that the interval $[x,x^{\prime}]$ contains a cylinder $B_{a_{0}}^{-1}\cdots B_{a_{k-1}}^{-1}[I_{a_{k}}]$ of level $k$ ; then we have

[TABLE]

which implies

[TABLE]

On the other hand, the interval $[x,x^{\prime}]$ may contain at most $1+(m-2)+(m-2)=2m-3$ endpoints of cylinders of level $k$ ; therefore

[TABLE]

which implies

[TABLE]

Eliminating $k$ from (10.4) and (10.5) and rearranging terms, we obtain

[TABLE]

whence

[TABLE]

where

[TABLE]

We thus obtained

[TABLE]

Since $E$ does not depend on $\varepsilon$ , we let $\varepsilon$ tend to [math] and obtain the Hölder condition $\Phi x^{\prime}-\Phi x\leq E(x^{\prime}-x)^{\alpha}$ , valid for $x^{\prime}-x\leq l_{1}$ (remember that $\rho(\bm{\Sigma})=\rho(\Sigma)^{2}$ ). Replacing $E$ with $\max\{E,l_{1}^{-\alpha}\}$ , the condition holds for every pair $x<x^{\prime}$ .

Assume now that the finiteness conjecture holds for $\bm{\Sigma}$ , and let $\bm{A}=\bm{A}_{a_{0}}\cdots\bm{A}_{a_{k-1}}\in\bm{\Sigma}^{k}$ be a maximizing matrix (i.e., $\rho(\bm{\Sigma})=\rho(\bm{A})^{1/k}$ ). We must have $a_{0}\not=a_{k-1}$ , since otherwise $\bm{A}$ would be conjugate to a matrix $\bm{B}$ in $\bm{\Sigma}^{k-2}$ and we would have $\rho(\bm{B})^{1/(k-2)}>\rho(\bm{A})^{1/k}=\rho(\bm{\Sigma})$ , which is impossible. The eigenvalues of $\bm{A}$ are $(-1)^{k}$ , $\rho(\bm{A})$ , and $\rho(\bm{A})^{-1}$ ; let $\bm{v}_{1},\bm{v}_{2},\bm{v}_{3}$ be the corresponding eigenvectors. The vector $\bm{w}_{a_{0}}$ cannot lie in the subspace spanned by $\bm{v}_{1}$ and $\bm{v}_{3}$ , because $\lVert\bm{A}^{n}\bm{w}_{a_{0}}\rVert\to\infty$ for $n\to\infty$ . This easily implies that the length of the cylinder $(B_{a_{0}}^{-1}\cdots B_{a_{k-1}}^{-1})^{n}[I_{a_{0}}]$ , of level $kn$ and endpoints $x_{n}<x^{\prime}_{n}$ , is asymptotic to $C\rho(\bm{A})^{-n}$ as $n\to\infty$ , for some constant $C$ . But then, for any $\varepsilon>0$ ,

[TABLE]

because $\rho(\bm{\Sigma})^{\alpha+\varepsilon}>m-1$ implies $\rho(\bm{A})^{\alpha+\varepsilon}>(m-1)^{k}$ , and thus $(m-1)^{-k}/\rho(\bm{A})^{-(\alpha+\varepsilon)}>1$ . ∎

Example 10.6.

Consider the square billiard table of Example 9.1. By the symmetries of the table, the graph of the induced Minkowski function $\Phi$ in Figure 7 (right) results from the gluing of four identical pieces, the fourth piece corresponding to the interval $[-i,1]$ in $S^{1}$ . Since the foldings $\bm{F},\bm{J}\bm{F},\bm{J}$ involved in the construction of the Romik map in §3 are isometries, it is not difficult to realize that this fourth piece is conjugate via stereographic projection from $[0,1,1]$ to the Minkowski function $Q_{E}$ introduced in [7] for the Romik map. As the above stereographic projection is a Lipschitz bijection with Lipschitz inverse between $[-i,1]$ and $[0,1]$ , the Hölder exponents of $\Phi$ and of $Q_{E}$ must agree.

The set $\Sigma$ contains the four matrices

[TABLE]

By looking at our square billiard table, we obviously conjecture that the maximum periodic mean free path should be realized by bouncing between two opposite walls; in other words, that the finiteness conjecture should hold for $\Sigma$ , with witnessing matrix $A_{3}A_{1}\in\Sigma^{2}$ (or its conjugate $A_{2}A_{0}$ ).

Denote by $\lVert\phantom{A}\rVert_{2}$ the spectral norm on $2\times 2$ real matrices induced by the euclidean norm on $\mathbb{R}^{2}$ . Then, as it is well known, $\lVert A\rVert_{2}=\rho(A^{\top}A)^{1/2}$ , and one checks immediately that $\lVert A_{a}\rVert_{2}=\sqrt{3+\sqrt{8}}$ for every $a\in\{0,1,2,3\}$ . Since $\rho(A_{3}A_{1})^{1/2}\leq\rho(\Sigma)\leq\max\{\lVert A\rVert_{2}:A\in\Sigma^{1}\}$ , and $\rho(A_{3}A_{1})^{1/2}$ equals $\sqrt{3+\sqrt{8}}=1+\sqrt{2}$ as well, our conjecture is confirmed. Theorem 10.4 now yields that $\Phi$ , and thus $Q_{E}$ , has Hölder best exponent $\log(3)/(2\log(1+\sqrt{2}))$ , in agreement with [7, Theorem 2].

Bibliography47

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Aaronson. An introduction to infinite ergodic theory , volume 50 of Mathematical Surveys and Monographs . American Mathematical Society, Providence, RI, 1997.
2[2] J. Aaronson and M. Denker. The Poincaré series of 𝐂 \ 𝐙 \ 𝐂 𝐙 \mathbf{C}\backslash\mathbf{Z} . Ergodic Theory Dynam. Systems , 19(1):1–20, 1999.
3[3] R. C. Alperin. The modular tree of Pythagoras. Amer. Math. Monthly , 112(9):807–816, 2005.
4[4] F. J. M. Barning. On Pythagorean and quasi-Pythagorean triangles and a generation process with the help of unimodular matrices. Math. Centrum Amsterdam Afd. Zuivere Wisk. , 1963(ZW-011):37, 1963.
5[5] M. A. Berger and Y. Wang. Bounded semigroups of matrices. Linear Algebra Appl. , 166:21–27, 1992.
6[6] B. Berggren. Pytagoreiska trianglar. Tidskr. Elementär Mat. Fys. Kemi , 17:129–139, 1934.
7[7] F. P. Boca and C. Linden. On Minkowski type question mark functions associated with even or odd continued fractions. Monatsh. Math. , 187(1):35–57, 2018.
8[8] A. I. Borevich and I. R. Shafarevich. Number theory . Academic Press, 1966.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Billiards on pythagorean triples

Abstract.

1. Introduction

2. Notation and preliminaries

Lemma 2.1**.**

Proof.

Convention 2.2**.**

Notation 2.3**.**

Theorem 2.4**.**

Proof.

3. Pythagorean triples and the Romik map

4. The de Sitter space

Theorem 4.1**.**

Proof.

5. Circle intervals

Lemma 5.1**.**

Proof.

Remark 5.2**.**

Theorem 5.3**.**

Proof.

Definition 5.4**.**

Theorem 5.5**.**

Proof.

6. Billiard maps

Definition 6.1**.**

Definition 6.2**.**

Example 6.3**.**

Definition 6.4**.**

Lemma 6.5**.**

Remark 6.6**.**

Proof of Lemma 6.5.

7. Natural extension and invariant measures

Theorem 7.1**.**

Proof.

Theorem 7.2**.**

Proof.

8. The Lagrange theorem

Theorem 8.1**.**

Proof.

Claim

Proof of Claim

Lemma 8.2**.**

Proof.

Theorem 8.3**.**

Proof.

Example 8.4**.**

9. Minkowski functions

Example 9.1**.**

Theorem 9.2**.**

Lemma 9.3**.**

Proof.

Proof of Theorem 9.2.

Theorem 9.4**.**

Proof.

Example 9.5**.**

10. Singularity and Hölder exponent

Theorem 10.1**.**

Lemma 10.2**.**

Proof.

Proof of Theorem 10.1.

Definition 10.3**.**

Theorem 10.4**.**

Proof.

Theorem 10.5**.**

Proof.

Example 10.6**.**

Lemma 2.1.

Convention 2.2.

Notation 2.3.

Theorem 2.4.

Theorem 4.1.

Lemma 5.1.

Remark 5.2.

Theorem 5.3.

Definition 5.4.

Theorem 5.5.

Definition 6.1.

Definition 6.2.

Example 6.3.

Definition 6.4.

Lemma 6.5.

Remark 6.6.

Theorem 7.1.

Theorem 7.2.

Theorem 8.1.

Lemma 8.2.

Theorem 8.3.

Example 8.4.

Example 9.1.

Theorem 9.2.

Lemma 9.3.

Theorem 9.4.

Example 9.5.

Theorem 10.1.

Lemma 10.2.

Definition 10.3.

Theorem 10.4.

Theorem 10.5.

Example 10.6.