Phylogenetic complexity of the Kimura 3-parameter model

Mateusz Micha{\l}ek; Emanuele Ventura

arXiv:1704.02584·math.CO·April 11, 2017

Phylogenetic complexity of the Kimura 3-parameter model

Mateusz Micha{\l}ek, Emanuele Ventura

PDF

TL;DR

This paper proves that the algebraic ideals of the Kimura 3-parameter phylogenetic model are generated in degree four, confirming a longstanding conjecture in algebraic statistics.

Contribution

It establishes that the ideals for this model are generated in degree four, resolving a key conjecture by Sturmfels and Sullivant.

Findings

01

Ideals are generated in degree four

02

Confirmed a conjecture by Sturmfels and Sullivant

03

Advances understanding of algebraic structure of phylogenetic models

Abstract

In algebraic statistics, the Kimura 3-parameter model is one of the most interesting and classical phylogenetic models. We prove that the ideals associated to this model are generated in degree four, confirming a conjecture by Sturmfels and Sullivant.

Equations74

P(M,{\bf s})=\sum_{\begin{subarray}{c}f:\mathcal{V}\rightarrow S\\ f_{|\mathcal{L}}={\bf s}\end{subarray}}\prod_{(v_{1},v_{2})\in\mathcal{E}}\Big{(}M((v_{1},v_{2}))\Big{)}_{(f(v_{1}),f(v_{2}))},

P(M,{\bf s})=\sum_{\begin{subarray}{c}f:\mathcal{V}\rightarrow S\\ f_{|\mathcal{L}}={\bf s}\end{subarray}}\prod_{(v_{1},v_{2})\in\mathcal{E}}\Big{(}M((v_{1},v_{2}))\Big{)}_{(f(v_{1}),f(v_{2}))},

Ψ : Rep (T) ∋ M \to s : L \to S \sum P (M, s) l \in L ⨂ l_{s (l)} \in l \in L ⨂ V_{l} .

Ψ : Rep (T) ∋ M \to s : L \to S \sum P (M, s) l \in L ⨂ l_{s (l)} \in l \in L ⨂ V_{l} .

T = α 0 γ \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T = α 0 γ \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

\tilde{T} = 0 γ α \dots 0 α β \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

\tilde{T} = 0 γ α \dots 0 α β \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

α α + 0 β + γ 0 = 00 + γ α + α β .

α α + 0 β + γ 0 = 00 + γ α + α β .

r_{0} = (g_{1} + a_{1}, \dots, g_{n} + a_{n}) \mbox an d r_{1} = (g_{1}, \dots, g_{n}) .

r_{0} = (g_{1} + a_{1}, \dots, g_{n} + a_{n}) \mbox an d r_{1} = (g_{1}, \dots, g_{n}) .

ϕ_{g} : Z_{2} \times Z_{2} \to Z_{2},

ϕ_{g} : Z_{2} \times Z_{2} \to Z_{2},

T_{0} = α 0 x \dots β 0 y \dots γ γ 0 \dots 0 α z \dots 0 β w \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} = α 0 x \dots β 0 y \dots γ γ 0 \dots 0 α z \dots 0 β w \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 y \dots α x 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α w \dots 0 z α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 y \dots α x 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α w \dots 0 z α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

\noindent Case I : x = β, y = α, z = β, w = 0; \noindent Case II : x = β, y = α, z = β, w = β; \noindent Case III : x = β, y = β, z = β, w = β .

\noindent Case I : x = β, y = α, z = β, w = 0; \noindent Case II : x = β, y = α, z = β, w = β; \noindent Case III : x = β, y = β, z = β, w = β .

T_{0} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} = α 0 β \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} = α 0 β \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α \dots \dots 0 γ \dots \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α \dots \dots 0 γ \dots \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α γ \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α γ \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α β \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α β \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 β \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α γ \dots 0 γ α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 β \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α γ \dots 0 γ α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 β \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α 0 \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 β \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α 0 \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α 0 \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T_{0} - T_{1} = α 0 α \dots α β 0 \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots - 0 α 0 \dots 00 α \dots 0 \dots \dots \dots \dots \dots \dots \dots 0 \dots \dots \dots .

T = q x z \dots q y x \dots \dots \dots \dots \dots,

T = q x z \dots q y x \dots \dots \dots \dots \dots,

N (t) = t^{15} + 1005 t^{14} + 230763 t^{13} + 11423223 t^{12} + 197336781 t^{11} + 1476133641 t^{10} + 5369113631 t^{9} + 10097960379 t^{8} + 10077653595 t^{7} + 5323111487 t^{6} + 1442513865 t^{5} + 187603341 t^{4} + 10384023 t^{3} + 198795 t^{2} + 1005 t + 1.

N (t) = t^{15} + 1005 t^{14} + 230763 t^{13} + 11423223 t^{12} + 197336781 t^{11} + 1476133641 t^{10} + 5369113631 t^{9} + 10097960379 t^{8} + 10077653595 t^{7} + 5323111487 t^{6} + 1442513865 t^{5} + 187603341 t^{4} + 10384023 t^{3} + 198795 t^{2} + 1005 t + 1.

H (t) = \frac{22261501}{4168212048000} t^{18} + \frac{799045380}{4168212048000} t^{17} + \frac{13381457673}{4168212048000} t^{16} + \frac{138721353336}{4168212048000} t^{15}

H (t) = \frac{22261501}{4168212048000} t^{18} + \frac{799045380}{4168212048000} t^{17} + \frac{13381457673}{4168212048000} t^{16} + \frac{138721353336}{4168212048000} t^{15}

+ \frac{995839168812}{4168212048000} t^{14} + \frac{5247736051320}{4168212048000} t^{13} + \frac{21011354421226}{4168212048000} t^{12} + \frac{65366574541632}{4168212048000} t^{11}

+ \frac{995839168812}{4168212048000} t^{14} + \frac{5247736051320}{4168212048000} t^{13} + \frac{21011354421226}{4168212048000} t^{12} + \frac{65366574541632}{4168212048000} t^{11}

+ \frac{160636901283573}{4168212048000} t^{10} + \frac{316408365264420}{4168212048000} t^{9} + \frac{507035368484229}{4168212048000} t^{8} + \frac{671227146881928}{4168212048000} t^{7}

+ \frac{160636901283573}{4168212048000} t^{10} + \frac{316408365264420}{4168212048000} t^{9} + \frac{507035368484229}{4168212048000} t^{8} + \frac{671227146881928}{4168212048000} t^{7}

+ \frac{744003206327314}{4168212048000} t^{6} + \frac{695859081785280}{4168212048000} t^{5} + \frac{545170528162872}{4168212048000} t^{4} + \frac{340981469563104}{4168212048000} t^{3}

+ \frac{744003206327314}{4168212048000} t^{6} + \frac{695859081785280}{4168212048000} t^{5} + \frac{545170528162872}{4168212048000} t^{4} + \frac{340981469563104}{4168212048000} t^{3}

+ \frac{151089754960800}{4168212048000} t^{2} + \frac{38894674089600}{4168212048000} t + 1.

+ \frac{151089754960800}{4168212048000} t^{2} + \frac{38894674089600}{4168212048000} t + 1.

\tilde{N} (t) = t^{13} + 1007 t^{12} + 107752 t^{11} + + 2813176 t^{10} + 26622909 t^{9} + 109147219 t^{8} + 211160560 t^{7} + 199302992 t^{6} + 91202787 t^{5} + 19336749 t^{4} + 1724040 t^{3} + 54360 t^{2} + 495 t + 1.

\tilde{N} (t) = t^{13} + 1007 t^{12} + 107752 t^{11} + + 2813176 t^{10} + 26622909 t^{9} + 109147219 t^{8} + 211160560 t^{7} + 199302992 t^{6} + 91202787 t^{5} + 19336749 t^{4} + 1724040 t^{3} + 54360 t^{2} + 495 t + 1.

\tilde{N}^{'} (t) = 3 t^{13} + 2253 t^{12} + 211288 t^{11} + + 5060488 t^{10} + 44891401 t^{9} + 174437831 t^{8} + 321990512 t^{7} + 291183248 t^{6} + 127959653 t^{5} + 26052683 t^{4} + 2223560 t^{3} + 66520 t^{2} + 559 t + 1.

\tilde{N}^{'} (t) = 3 t^{13} + 2253 t^{12} + 211288 t^{11} + + 5060488 t^{10} + 44891401 t^{9} + 174437831 t^{8} + 321990512 t^{7} + 291183248 t^{6} + 127959653 t^{5} + 26052683 t^{4} + 2223560 t^{3} + 66520 t^{2} + 559 t + 1.

T_{0} - T_{1} = α 0 \dots \dots α β \dots \dots 0 β \dots \dots \dots \dots \dots \dots 00 γ \dots 00 γ \dots - 0 α \dots \dots 0 β \dots \dots 00 \dots \dots \dots \dots \dots \dots 00 \dots \dots 00 \dots \dots .

T_{0} - T_{1} = α 0 \dots \dots α β \dots \dots 0 β \dots \dots \dots \dots \dots \dots 00 γ \dots 00 γ \dots - 0 α \dots \dots 0 β \dots \dots 00 \dots \dots \dots \dots \dots \dots 00 \dots \dots 00 \dots \dots .

000 + β β α + β γ β = 0 β β + β γ 0 + β 0 α .

000 + β β α + β γ β = 0 β β + β γ 0 + β 0 α .

T_{0} - T_{1} = α 0 \dots \dots α β \dots \dots 0 α \dots \dots 0 γ \dots \dots 0 x \dots \dots \dots \dots \dots \dots 00 γ \dots 00 γ \dots - 0 α \dots \dots 0 β \dots \dots 00 \dots \dots 0 y \dots \dots 0 z \dots \dots \dots \dots \dots \dots 00 \dots \dots 00 \dots \dots .

T_{0} - T_{1} = α 0 \dots \dots α β \dots \dots 0 α \dots \dots 0 γ \dots \dots 0 x \dots \dots \dots \dots \dots \dots 00 γ \dots 00 γ \dots - 0 α \dots \dots 0 β \dots \dots 00 \dots \dots 0 y \dots \dots 0 z \dots \dots \dots \dots \dots \dots 00 \dots \dots 00 \dots \dots .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Phylogenetic complexity of the Kimura $3$ -parameter model

Mateusz Michałek and Emanuele Ventura

Abstract

In algebraic statistics, the Kimura $3$ -parameter model is one of the most interesting and classical phylogenetic models. We prove that the ideals associated to this model are generated in degree four, confirming a conjecture by Sturmfels and Sullivant.

2010 Mathematics Subject Classification. Primary 52B20, Secondary 14M25, 13P25

1 Introduction

The part of computational biology that models evolution and describes mutations in this process is called phylogenetics [40]. This is a fertile subject witnessing many connections to several parts of mathematics such as algebraic geometry [8, 23], combinatorics [4, 15, 34], and representation theory [9, 31]. The methods used in this context of research are powerful and do not only apply to biology, but are employed in several other fields [2] such as modeling changes of words in languages [21], literary studies [3] or linguistics itself [37] with ideas going back to Darwin [14].

A crucial object in phylogenetics is a tree model, which is a parametric family of probability distributions. It consists of a tree $\mathcal{T}$ , a finite set of states $S$ and a family $\mathcal{M}$ of transition matrices, usually given by a linear subspaces of all $|S|\times|S|$ matrices. The case of particular interest is when $S=\{\textnormal{A,C,G,T}\}$ , where the basis elements correspond to the four nucleobases of DNA: adenine (A), cytosine (C), guanine (G), and thymine (T).

The models for which $\mathcal{M}$ is a proper subspace of matrices reflect some symmetries among elements of $S$ . These symmetries are usually encoded by the action of a finite group $G$ on $S$ . In these terms, $\mathcal{M}$ can be regarded as the space of $G$ -invariant matrices or tensors. Such models constitute a class of interest and they are called equivariant [18]. If $G$ is the trivial group, we obtain the general Markov model, corresponding, on the algebraic geometry side, to secant varieties of Segre products. When the elements of $S$ can be identified with those of $G$ , the model is called group-based. Henceforth we assume $G$ to be abelian.

The simplest among the equivariant, and group-based, models is the Cavender-Farris-Neyman model. This is the instance for $S=G=\mathbb{Z}_{2}$ , the group with two elements. A good understanding of this model from the algebraic geometry point of view has led to tremendous advances in this field. Sturmfels and Sullivant [41, Theorem 28] showed that the algebraic varieties arising from it are defined by quadrics. Additionally, Buczyńska and Wiśniewski described many of its remarkable algebro-geometric properties [8]. Consequently, Sturmfels and Xu [44], and Manon [32] described the connections of the model to toric degenerations of moduli spaces of rank two vector bundles on marked curves of fixed genus. For more relations to conformal field theory, we refer to [29, 31].

The Cavender-Farris-Neyman model is the simplest among the hyperbinary models [6, Section 3], that are given by $S=G=(\mathbb{Z}_{2})^{n}$ . The most biologically meaningful example of those is the Kimura $3$ -parameter model; this corresponds to $n=2$ . In this case, $S=\{\textnormal{A,C,G,T}\}$ , and, moreover, the action of $G$ reflects the pairing between purines (A,G) and pyrimidines (C,T). This model was introduced by Kimura [28] much before the setting above was developed. Using numerical experiments, Sturmfels and Sullivant conjectured that the ideals of the algebraic varieties associated to this model are generated by polynomials of degree at most four [41, Conjecture 30]. The confirmation of this conjecture is the main result of the present article. For any group $G$ , Sturmfels and Sullivant defined the phylogenetic complexity $\phi(G)$ of $G$ .

Definition 1.1 (Phylogenetic complexity [41]).

Let $K_{1,n}$ be the star with $n$ leaves, and $X(G,K_{1,n})$ the variety associated to the group-based model. Let $\phi(G,K_{1,n})$ be the maximal degree of a generator in a minimal generating set of the ideal $I(X(G,K_{1,n}))$ . The phylogenetic complexity $\phi(G)$ of $G$ is $\sup_{n\in\mathbb{N}}\{\phi(G,K_{1,n})\}$ .

In [35], it was shown that for any abelian group $G$ , its phylogenetic complexity $\phi(G)$ is finite. The main contribution of this article is a more detailed study of the phylogenetic complexity of $G=\mathbb{Z}_{2}\times\mathbb{Z}_{2}$ .

Main Theorem.

The phylogenetic complexity of the Kimura $3$ -parameter model $\phi(\mathbb{Z}_{2}\times\mathbb{Z}_{2})$ equals four.

For more interesting results on the Kimura $3$ -parameter model we refer to [9, 10, 11, 30].

Algebraic varieties associated to a model.

We recall the explicit construction of the algebraic variety associated to a model. It is the Zariski closure of the locus of all probability distributions on the states of leaves allowed in the model.

A representation of a model on a tree $\mathcal{T}$ is an association $\mathcal{E}\rightarrow\mathcal{M}$ of transition matrices to edges $\mathcal{E}$ of $\mathcal{T}$ . The set of all representations is denoted by $\operatorname{Rep}(\mathcal{T})$ . (Here we do not mention the root distribution, since it does not affect the family of probability distributions we obtain.) To each vertex $v$ of $\mathcal{T}$ we associate an $|S|$ dimensional vector space $V_{v}$ with basis $(v_{s})_{s\in S}$ . We may regard an element of $\mathcal{M}$ associated to an edge $(v_{1},v_{2})=e\in\mathcal{E}$ as an element of the tensor product $V_{v_{1}}\otimes V_{v_{2}}$ . We fix a representation $M\in\operatorname{Rep}(\mathcal{T})$ and an association ${\bf s}:\mathcal{L}\rightarrow S$ . Here $\mathcal{L}$ is the set of leaves, i.e. vertices of degree one, of $\mathcal{T}$ . Following the usual Markov rule, we may compute the probability of $\bf s$ :

[TABLE]

where $\mathcal{V}$ is the set of vertices of $\mathcal{T}$ . We may identify ${\bf s}$ with a basis element $\bigotimes_{l\in\mathcal{L}}l_{{\bf s}(l)}$ of $\bigotimes_{l\in\mathcal{L}}V_{l}$ . This provides the map:

[TABLE]

The image of this map is the family of probability distributions described by the model and its Zariski closure is the algebraic variety that represents the model. For group-based models, we denote this variety $X(G,\mathcal{T})$ , where $G$ is the group defining the model and $\mathcal{T}$ is the tree as above.

Earlier contributions.

Our proof of the main theorem relies on previous results by many authors that we now recall.

The first fundamental tool is the Discrete Fourier Transform. This is a linear change of coordinates, based on the representation theory of $G$ . For special cases in phylogenetics, it was first used by Hendy and Penny [26], and by Erdös, Székely, and Steel [42]. In higher generality, it is treated in [33, 41]. For group-based models, the DFT turns $\Psi$ into a monomial map, proving that the associated algebraic variety $X(G,\mathcal{T})$ is a toric variety. This translates the classical algebraic problem of finding defining equations of a variety into a combinatorial one. For more information about toric methods we refer to [12, 25, 43].

Another key result is the reduction from arbitrary trees to the so-called stars or claw-trees $K_{1,n}$ , i.e., trees with one inner vertex and $n$ leaves. The general procedure for group-based models to obtain ideals arising from arbitrary trees, knowing the ideals for $K_{1,n}$ , was discovered in [41]. Again, this turned out to be very influential, leading, on one hand, to the general constructions of toric fiber products [31, 45], and, on the other, to generalizations for equivariant models [18].

Combinatorial and computational methods in toric geometry are very well developed. As a starting point in our article we need to compute algebraic invariants of toric varieties embedded in very high dimensional ambient spaces. Here the computer algebra packages Normaliz [7], 4ti2 [47], along with previous computational results from [16] and [41] are used. In particular, Castenluovo-Mumford regularity plays a crucial role in the proof for $n=6$ . These classical invariants are briefly discussed in the Appendix 4, for the sake of completeness.

This work may be also seen in the framework of the stabilisation of equations of a family of algebraic varieties. Indeed, our proof not only bounds the degrees of the generators, but in principle provides an inductive procedure to obtain all generators in case of $K_{1,n+1}$ , assuming the generators for $K_{1,n}$ to be known. Finding equations of an infinite sequence of algebraic varieties, that come naturally in families, is an interesting current theme of research. This usually involves classical varieties such as secants of Segre varieties [19] and Grassmannians [20]. Indeed, the main result of Draisma and Eggermont in [17] shows that for equivariant models the associated algebraic variety can always be defined set-theoretically in some bounded degree, once $G$ and $S$ are both fixed. The fact that $\phi(G)$ is finite constitutes the main result of [35]. Recently, another ideal-theoretic result was proved by Sam [38] showing that the ideal of $k$ th secant variety of $d$ th Veronese embeddings is generated in bounded degree that is independent of $d$ . Interestingly, the ideal-theoretic generation in bounded degree for secants of Segre varieties and Grassmannians are still central open problems. Finiteness issues are strongly connected with the theory of twisted commutative algebras and $\Delta$ -modules by Sam and Snowden [39], and the theory of noetherianity by Draisma and Kuttler [19], Hillar and Sullivant [27], and others.

Apart from beautiful results of existence, that are quite often non-constructive or very far from optimal, it is of interest finding an explicit description of phylogenetic algebraic varieties. One of the most well-known examples is the salmon conjecture [1], since the prize offered by Allman for the hypothetical solver would be a smoked Copper river salmon. It asks for the description of $\sigma_{4}(\mathbb{P}^{3}\times\mathbb{P}^{3}\times\mathbb{P}^{3})$ , the algebraic variety representing the general Markov model for $|S|=4$ and $\mathcal{T}=K_{1,3}$ . The generators of the ideal are still unknown, however a set-theoretic description was found by Friedland and Gross [24]. More recently, Daleo and Hauenstein [13] gave a numerical proof of the salmon conjecture.

As far as we know, our result is the only ideal-theoretic description, apart from the Jukes-Cantor model, where $|S|=4$ and $\mathcal{T}$ is an arbitrary tree.

Plan of the article.

The whole article is devoted solely to the proof of the main theorem. In Section 2 we introduce the notation that is used throughout the proof. As the proof consists of several parts, some of them very technical, we present the overview of its structure in Section 3.1. The main result is established in Sections 3.2 and 3.3.

2 Preliminaries and notation

In this section we collect all the notation and terminology we will use in the rest of the paper. We divide this section into paragraphs to facilitate the reading.

**Groups.

**Henceforth we set $G=\mathbb{Z}_{2}\times\mathbb{Z}_{2}$ , unless otherwise stated. We denote the elements of $G$ by $0,\alpha,\beta$ , and $\gamma$ . To denote unknown elements of $G$ , we use letters $g,x,y,w,p,q\ldots$ We also refer to an unknown element, that is not relevant in a specific argument, with question mark “?”.

Apart from $G$ , the most natural groups that enter the picture are the symmetric group on $n$ leaves $\mathfrak{S}_{n}$ , the group of flows $\mathfrak{G}$ , and the automorphism group $\textnormal{Aut}(G)$ . The group of flows is the following.

Definition 2.1 (Group of flows).

Let $G$ be a abelian group and $n\in\mathbb{N}$ . The set of flows $\mathfrak{G}=\left\{(g_{1},\ldots,g_{n})|\sum g_{i}=0\right\}$ of length $n$ of $G$ forms a group under the componentwise group operation. It is non-canonically isomorphic to the group $G^{n-1}$ , the direct product of $n-1$ copies of $G$ .

The automorphism group of $G$ , $\textnormal{Aut}(G)\cong\mathfrak{S}_{3}$ , is the group of bijective group homomorphisms from $G$ to itself. The automorphism of $G$ specified by $\alpha\mapsto\alpha,\beta\mapsto\gamma,\gamma\mapsto\beta$ is simply denoted by $\beta\leftrightarrow\gamma$ ; similarly for all the other automorphisms of $G$ having a non-trivial fixed element.

**The toric variety $X(G,K_{1,n})$ .

**For any abelian group $G$ , the variety $X(G,K_{1,n})$ is a projective toric variety of dimension $n(|G|-1)$ living in $\mathbb{P}^{|G|^{n-1}-1}$ , where the projective coordinates are in bijection with flows [33].

Let us recall here its corresponding polytope. Let $M\cong\mathbb{Z}^{|G|}$ be the lattice whose basis corresponds to the elements of $G$ . Consider $M^{n}$ with the basis $e_{(i,g)}$ indexed by pairs $(i,g)\in[n]\times G$ . We define a map of sets from the group of flows to the lattice, $\psi:\mathfrak{G}\rightarrow M^{n}$ , by $\psi((g_{1},\dots,g_{n}))=\sum_{i=1}^{n}e_{(i,g_{i})}$ . The vertices of the polytope of $X(G,K_{1,n})$ are the images of the flows under the injective map $\psi$ .

Remark 2.2.

The family of varieties $X(G,K_{1,n})$ has a wealth of symmetries; the group $\mathfrak{S}_{n}$ , the group of flows $\mathfrak{G}$ , and the automorphism group $\textnormal{Aut}(G)$ all act on the ideals of these varieties.

**Binomials, tables, and moves. **

Ideals of toric varieties are binomial prime ideals. Thus they admit a minimal generating set of binomials. Binomials may be identified with a pair of tables of the same size, $T_{0}$ and $T_{1}$ , of elements of $G$ , regarded up to row permutation; this is another natural group in this setting which we implicitly take into account. Indeed, a binomial is a pair of monomials and the variables correspond to rows. Given the number of leaves $n$ , coordinates are in bijection with flows of length $n$ of $G$ . Hence rows are identified with flows of $n$ elements in $G$ . Columns are in bijection with the $n$ leaves. From the definition of the toric ideals $I(X(G,K_{1,n}))$ [41], it follows that a binomial belongs to $I(X(G,K_{1,n}))$ if and only if the two tables representing it are compatible, i.e., for each $i$ , the $i$ th column of $T_{0}$ and the $i$ th column of $T_{1}$ are equal as multisets. We index the columns of a given pair of tables $T_{0},T_{1}$ , with $n$ columns, by integers $1\leq i\leq n$ . We refer to the element in the $i$ th column of row $r$ as $r(i)$ .

Let $T$ be any table of elements of $G$ . The procedure consisting of selecting a subset of rows in $T$ of cardinality at most $d$ , and replacing it with a compatible set of rows is a move of degree $d$ . A binomial, represented by a pair of tables $T_{0},T_{1}$ of elements of $G$ , is generated by binomials of degree at most $d$ if and only if there exists a finite sequence of moves of degree $d$ applied to $T_{0}$ or $T_{1}$ that transform $T_{0}$ into $T_{1}$ .

Example 2.3.

Let $T$ be the table

[TABLE]

The table $T$ can be transformed by a move of degree three into the table

[TABLE]

Indeed, the set of the first three rows of $T$ is compatible with the set of the first three rows of $\tilde{T}$ . Note that if the rows in $T$ are flows, then the rows of $\tilde{T}$ are flows as well. The move described above is denoted by

[TABLE]

Remark 2.4.

In the notation for moves, we do not use the indices of the columns involved in the move. Instead, the indices are always clear from the move itself. For instance, the move in Example 2.3 is in columns $1,2$ . Also, note that, in general, the columns used for a move do not need to be consecutive.

Remark 2.5.

The groups $\mathfrak{S}_{n}$ , the group of flows $\mathfrak{G}$ , and the automorphism group $\textnormal{Aut}(G)$ act on the equations of $X(G,K_{1,n})$ , and hence on the tables. The group $\mathfrak{S}_{n}$ acts permuting the columns of the pair of tables corresponding to a binomial in the ideal of the variety. The groups $\mathfrak{G}$ and $\textnormal{Aut}(G)$ act on the entries of the tables in the natural way, i.e., by evaluation.

We now introduce one of the most important concepts for our approach. Given a pair of flows, we define a distance between them, which will enable us to use an inductive procedure on tables. The distance we consider is the classical Hamming distance between two words.

Definition 2.6 (Hamming distance).

Let $r_{0}$ and $r_{1}$ be two flows in $\mathfrak{G}$ :

[TABLE]

Let $I=\{\ell\in[n]|a_{\ell}\neq 0\}$ and $J=\{\ell\in[n]|a_{\ell}=0\}$ . The multiset $\{a_{\ell}\}_{\ell\in I}$ constitutes the * disagreement string $a_{\ell_{1}}\ldots a_{\ell_{|I|}}$ of the pair of flows $r_{0}$ and $r_{1}$ . The cardinality $|I|$ is the Hamming distance between $r_{0}$ and $r_{1}$ . The multiset $\{a_{\ell}\}_{\ell\in J}$ constitutes their agreement string. Up to the action of the group of flows $\mathfrak{G}$ on both flows, we may assume that the group elements $g_{i}=0$ for all $i$ .*

Remark 2.7 (Tables and Hamming distance).

Given a pair of tables $T_{0},T_{1}$ , we “compare” them using the notion of Hamming distance as follows. Since the tables come with undistinguishable rows, we may choose as first rows of $T_{0}$ and $T_{1}$ two rows that minimize the Hamming distance among all the pairs of rows from $T_{0}$ and $T_{1}$ . After fixing the first row in $T_{0}$ and in $T_{1}$ , as described in Section 3.1, one of the techniques adopted in Sections 3.2 and 3.3 is as follows. With moves of degree at most four, we create another pair of rows with strictly smaller Hamming distance than the initial one.

Counting functions.

We will make use of counting functions on the tables $T_{0}$ and $T_{1}$ . A counting function $f$ on the columns of $T_{0}$ has the same values as counting function on the columns of $T_{1}$ , since the pairs of tables we are interested in are compatible, i.e., columnwise they are the same as multisets. Given $x\in G$ , we denote by $x_{i_{1}\ldots i_{k}}$ the number of copies $x\in G$ appearing in the columns $i_{1},\ldots,i_{k}$ in $T_{0}$ , or in $T_{1}$ .

Example 2.8.

The function $\alpha_{12}-2\cdot 0_{3}$ counts the number of copies of $\alpha$ in columns $1$ and $2$ minus two times the number of copies of [math] in column $3$ .

From an algebraic point of view, a counting function defines a grading of the variables, that is a specialization of the multi-grading. Thus the fact that the counting function gives the same value on two tables is equivalent to the fact that the two corresponding monomials have the same degree with respect to the induced grading. Additionally, from the perspective of toric geometry, the counting function is induced by restricting the torus action to a special one-parameter subgroup.

Group homomorphisms.

We will make use of group homomorphisms in order to do counting arguments in a given pair of tables. We denote

[TABLE]

the group homomorphism given by the quotient map sending each element $x\in G$ to its class modulo the subgroup generated by the element $g\in G$ .

3 Complexity of the Kimura $3$ -parameter model

The aim of this section is to establish the phylogenetic complexity of the Kimura $3$ -parameter model. In Section 3.1, we discuss the structure of the proof, postponing the technical part of it to Sections 3.2 and 3.3.

3.1 Main result and structure of the proof

We proceed presenting our main result along with the outline of the plan of the proof strategy.

Theorem 3.1.

The phylogenetic complexity of the Kimura $3$ -parameter model $\phi(\mathbb{Z}_{2}\times\mathbb{Z}_{2})$ equals four.

The structure of the proof is presented in Figure 1. Our proof is an induction on the number of leaves $n$ , i.e., the number of columns of the tables. The base of our induction is $n=3$ . The case of $n\leq 5$ leaves has been studied computationally. More precisely, for $n=3$ the result is presented in [41] and for $n=4$ it is computed in [16]. For $n=5$ we used the program featured in [16] to produce the vertices of the polytope. The computer algebra program 4ti2 [47] specialized for toric ideals was able to compute the Markov basis using a server equipped with a CPU 4 Intel-Xeon E7-8837/32 cores/2.67GHz and a memory of 1024Gb RAM.

Proposition 3.2.

The ideal $I(X(G,K_{1,5}))$ is minimally generated by $22240$ polynomials of degree at most four: $12960$ quadrics, $2560$ cubics, and $6720$ quartics.

The case $n=6$ is treated in Section 3.3.3. Methods similar to the general case $n\geq 7$ and bounds on Castelnuovo-Mumford regularity obtained using Normaliz [7] allow us to reduce the problem to a computation handled with 4ti2. From the computational point of view, it is interesting to note that we were not able to address the case $n=6$ only with computational tools. Based on our experiments with 4ti2, we expect the computation to be not feasible: it would run for several years on a server of the same capability as the one mentioned above, and a memory of 1Tb RAM would not be sufficient to finish the computation.

For $n\geq 7$ , we have an induction on the degree $d$ of the generators, i.e., the number of rows of the table. Inside a specific degree $d$ , we have an induction on the Hamming distance $k$ of two rows of the tables. The strategy in this inner induction on the Hamming distance $k$ is the following. Suppose we have a binomial generator of degree $d\geq 5$ . Hence, we have a pair of tables consisting of $d$ rows each and with $n\geq 7$ columns. Two rows have Hamming distance $k$ and we reduce it to $k=0$ ; in other words, the given pair of tables is transformed into a pair of tables that have an identical row. This is a binomial which is a product of a binomial of degree $d-1$ and a variable. By induction on $d$ , such a binomial can be generated in degree at most $4$ .

Hence the aim of the induction on the Hamming distance $k$ is to reduce it to $k=0$ . In order to achieve this, we address the case $k\geq 3$ into two separate propositions in Section 3.2; see Proposition 3.5 and Corollary 3.6, and Proposition 3.12. This reduces the proof to $k=2$ . Recall that there do not exist flows whose Hamming distance is $k=1$ , since they cannot disagree only in one entry.

We now discuss the strategy in case $k=2$ , the technical heart of the proof, which is tackled in Section 3.3. In spite of many symmetries, discussed in Section 2, there are several cases one has to consider: We identify ten cases, indexed by roman numerals, where the first two rows of the given pair of tables $T_{0},T_{1}$ have a disagreement string of length $k=2$ . Here we provide a uniform proof for three crucial cases: Case I, II, and III. As we show them simultaneously with the very same techniques, we refer to those as the “main case”. The rest of the cases is treated by reducing them to the main case.

For the proof in the main case, we look at the second rows of each of the tables $T_{0}$ and $T_{1}$ . Let $\ell$ denote the length of the disagreement string between those two, in columns not involving the first two. By Corollary 3.6, we are able to assume $\ell\leq 3$ and, since $n\geq 7$ , the length of the agreement string between the second row of $T_{0}$ and the second row of $T_{1}$ , outside columns $1$ and $2$ , is at least $n-5\geq 2$ . Since the columns are indistinguishable up to the action of $\mathfrak{S}_{n}$ , we may assume that the columns $n-1$ and $n$ are involved in the agreement string. Now the aim is to reduce to the situation in which no row has two nonzero entries in the columns $n-1$ and $n$ : employing moves of degree at most four, we would like to eliminate all the strings which have nonzero entries on both columns $n-1$ and $n$ . We call such strings bad pairs.

Definition 3.3 (Bad pairs).

A bad pair is a string $xy$ , where the elements $x,y\in G$ are such that:

(i)

they are both nonzero; 2. (ii)

$x$ * is in column $n-1$ and $y$ is in column $n$ .*

We now show that eliminating all the bad pairs we fall back to the case of $n-1$ leaves, which allows us to conclude, by the outermost induction.

Theorem 3.4.

Suppose that a pair of compatible tables $T_{0},T_{1}$ with $n\geq 7$ columns do not contain rows with bad pairs. Then the corresponding binomial is generated in degree at most $\phi(G,K_{1,n-1})$ .

Proof.

The assumption implies that for every row $r$ of $T_{0}$ and $T_{1}$ we have either $r(n-1)=0$ or $r(n)=0$ . Summing up the columns $n-1$ and $n$ , we obtain two tables $\tilde{T}_{0}$ and $\tilde{T}_{1}$ . The crucial observation is that $\tilde{T}_{0}$ and $\tilde{T}_{1}$ are compatible tables with $n-1$ columns. Hence they correspond to a binomial in $I(X(G,K_{1,n-1}))$ . This binomial is generated in degree at most $\phi(G,K_{1,n-1})$ by definition. This implies that $\tilde{T}_{0}$ and $\tilde{T}_{1}$ can be transformed into each other by a finite sequence of moves of degree at most $\phi(G,K_{1,n-1})$ . Each of these moves lifts to the tables $T_{0}$ and $T_{1}$ , transforming all their columns accordingly, except columns $n-1$ and $n$ . Here the moves permute the pairs of elements, where each pair is formed by the two elements in columns $n-1$ and $n$ , in a fixed row. These moves transform $T_{0},T_{1}$ into $\hat{T}_{0},\hat{T}_{1}$ . The latter need not be the same though; indeed, they may differ in columns $n-1$ and $n$ . As in the proof of [35, Theorem 3.12], we make quadratic moves to adjust the elements in columns $n-1$ and $n$ . These transform $\hat{T}_{0}$ into $\hat{T}_{1}$ . Hence the tables $T_{0},T_{1}$ are generated in degree at most $\phi(G,K_{1,n-1})$ . ∎

3.2 Reduction of Hamming distance $\geq$ 3

In this section, we start our reduction of the Hamming distance. More precisely, we assume the Hamming distance to be at least three and we prove that we can reduce it to two; the latter will be discussed in Section 3.3. We proceed analyzing the cases when the disagreement string is given by at least four entries.

Proposition 3.5.

The disagreement strings (i) $\alpha\alpha\alpha\alpha$ , (ii) $\alpha\alpha\beta\beta$ , (iii) $\alpha\alpha\beta\gamma$ , and (iv) $\alpha\alpha\alpha\beta$ can be reduced.

Proof.

(i). Consider the function $0_{1234}-\alpha_{1234}$ . By the action of the group of flows $\mathfrak{G}$ , we may assume that this counting function is nonpositive on both of the tables. Since the function is stricly positive in the first row of $T_{1}$ , there exists a row $r$ in $T_{1}$ where there are strictly more copies of $\alpha$ than copies of [math] in the columns $1,2,3,4$ . On the other hand, $r$ cannot contain $\alpha\alpha$ in two of the columns $1,2,3,4$ , since we would exchange those with the corresponding entries in the first row and this would decrease the Hamming distance. Thus $r$ has one copy of $\alpha$ and no copies of [math] in columns $1,2,3,4$ . If the row $r$ has both copies of $\beta$ and $\gamma$ , we would move the string $\alpha\beta\gamma$ to the first row of $T_{1}$ , reducing the Hamming distance. Whence we may assume that $r$ contains the string $\alpha\beta\beta\beta$ in columns $1,2,3,4$ . Notice that in columns $2,3,4$ of $T_{1}$ , there are no strings of the form $\alpha\alpha$ or $\gamma\gamma$ , otherwise quadratic moves would decrease the Hamming distance. Additionally, in columns $2,3,4$ there is no string of the form $\alpha\gamma$ ; for this we can apply in $T_{1}$ the cubic move $0000+\alpha\beta\beta\beta+?\alpha\gamma=\alpha\beta\gamma 0+0\alpha 0\beta+?0\beta$ . Now, we introduce the counting function $0_{234}+\beta_{234}-\alpha_{234}-\gamma_{234}$ on $T_{1}$ . By the previous discussion about the possible strings in columns $2,3,4$ , this function is at least one in every row of $T_{1}$ . Consequently, there exists a row $r^{\prime}$ in $T_{0}$ where this function is three. As a consequence, the row $r^{\prime}$ contains either the string $\beta\beta$ or $00$ . This would decrease the Hamming distance.

(ii). Consider the counting function $0_{1234}-\alpha_{12}-\beta_{34}$ . By the action of the group of flows $\mathfrak{G}$ , we may assume it is nonpositive on both of the tables. Since this function is strictly positive on the first row of $T_{1}$ , there exists a row $r$ in $T_{1}$ where the function is strictly negative. Note that on the row $r$ , one has $\alpha_{12},\beta_{34}\leq 1$ ; otherwise we would make a quadratic move, involving $r$ and the first row of $T_{1}$ , reducing the Hamming distance.

If in the row $r$ we have $\alpha_{12}=\beta_{34}=1$ , then $0_{1234}\leq 1$ , by the value of the counting function on $r$ . Hence in the row $r$ , there exists $\gamma$ , which allows us to make a quadratic move reducing the Hamming distance. Without loss of generality, we have $\alpha_{12}=1,\beta_{34}=0$ , and $0_{1234}=0$ . Thus the row $r$ contains either the string $\gamma\alpha\gamma\gamma$ or the string $\alpha\gamma\gamma\gamma$ . In both cases, we exchange $\gamma\gamma$ with the first row of $T_{1}$ and we act with the flow $(0,0,\gamma,\gamma)$ on $T_{0}$ producing $\alpha\alpha\alpha\alpha$ , which is (i).

(iii). Consider the function $0_{1234}-\alpha_{12}-\beta_{3}-\gamma_{4}$ . By the action of the group of flows $\mathfrak{G}$ , we may assume it is nonpositive on both of the tables. Therefore there exists a row $r$ in $T_{1}$ where the function is strictly positive. Note that on the row $r$ one has $\alpha_{12}\leq 1$ .

If in the row $r$ we have $\alpha_{12}=1$ and $\beta_{3}=1$ , then we may assume $r$ contains the string $\alpha x\beta y$ in columns $1,2,3,4$ . We have $x,y\neq\alpha,\beta,\gamma$ , as otherwise in each of these circumstances we would make a quadratic move between $r$ and the first row of $T_{1}$ , reducing the Hamming distance. Then the function is zero on $r$ , which is not possible by assumption. Analogously, we may conclude when $\alpha_{12}=1$ and $\gamma_{4}=1$ .

If in the row $r$ we have $\alpha_{12}=1,\beta_{3}=0$ , and $\gamma_{4}=0$ , then $0_{1234}=0$ . In this case we have $\alpha_{34}=0$ , because of a quadratic move between $r$ and the first row of $T_{1}$ . Hence the row $r$ contains the string $\alpha\gamma\beta$ in columns $1,3,4$ , which again would reduce the Hamming distance.

If in the row $r$ we have $\alpha_{12}=0$ , then either $\beta_{3}=1$ or $\gamma_{4}=1$ . If $\beta_{3}=\gamma_{4}=1$ , then in columns $1,2$ the row $r$ contains the string $00$ ; indeed we cannot have copies of $\alpha,\beta$ or $\gamma$ by quadratic moves with the first row of $T_{1}$ . This implies that the counting function $\alpha_{12}+\beta_{3}+\gamma_{4}-0_{1234}$ is zero on the row $r$ , which is not possible by the assumption. If in the row $r$ we have $\beta_{3}=0$ and $\gamma_{4}=1$ , then $0_{1234}=0$ . In the row $r$ we can now exclude all the possible elements in each column by quadratic moves, obtaining the string $\beta\alpha\gamma$ in columns $2,3,4$ . We exchange this string with the first row of $T_{1}$ , reducing the Hamming distance. Analogously, if in the row $r$ we have $\beta_{3}=1$ and $\gamma_{4}=0$ , we obtain $\gamma\beta\alpha$ in columns $2,3,4$ , and we conclude in the same way.

(iv). Consider the counting function $0_{1234}-\alpha_{123}-\beta_{4}$ . By the action of the group of flows $\mathfrak{G}$ , we may assume it is nonpositive on the tables. Therefore there exists a row $r$ in $T_{1}$ where the function is strictly negative. Thus on the row $r$ we have $0_{1234}\leq 1$ , as $\alpha_{123}\leq 1$ .

Suppose that in the row $r$ we have $0_{1234}=1$ . Then $\alpha_{123}=1$ and $\beta_{4}=1$ , by the assumption on the value of the counting function on $r$ . In two of the columns $1,2,3$ we cannot have $\alpha$ or $\beta$ by quadratic moves, involving $r$ and the first row of $T_{1}$ . Thus we have a copy of $\gamma$ ; we now make a quadratic move between $r$ and the first row of $T_{1}$ , which decrease the Hamming distance.

Suppose that in the row $r$ we have $0_{1234}=0$ . If in the row $r$ we have $\alpha_{123}=0$ , then $\beta_{4}=1$ . In columns $1,2,3$ we cannot have $\beta$ , as otherwise we would exchange the string $\beta\beta$ with the first row of $T_{1}$ , thus reducing the Hamming distance. Whence $r$ contains the string $\gamma\gamma\gamma\beta$ in columns $1,2,3,4$ . If in the row $r$ we have $\alpha_{123}=1$ , then $\beta_{4}=0$ . In this situation, by the same argument, $r$ contains the string $\alpha\gamma\gamma\gamma$ (or $\gamma\alpha\gamma\gamma$ or $\gamma\gamma\alpha\gamma$ ). We claim that having the string $\alpha\gamma\gamma\gamma$ can be reduced to the case of having the string $\gamma\gamma\gamma\beta$ up to quadratic moves and group automorphism. Indeed, suppose we have the string $\alpha\gamma\gamma\gamma$ in the row $r$ . We exchange $\gamma\gamma$ from $r$ with $00$ from the first row of $T_{1}$ in columns $2,3$ . We act with the flow $(0,\gamma,\gamma,0)$ on both tables and we transpose column $1$ and column $4$ . Now the row $r$ contains the string $\gamma\gamma\gamma\beta$ in columns $1,2,3,4$ .

By the previous discussion, it is enough to deal only with the string $\gamma\gamma\gamma\beta$ in $r$ . Consider the counting function $\alpha_{123}+\beta_{123}-\gamma_{123}-0_{123}$ . Note that this function has only odd values. We now show that the function cannot be positive on a row of $T_{1}$ . Indeed, assume there is a row $r^{\prime}$ where the function takes a positive value. Then the row $r^{\prime}$ contains either $\alpha\alpha$ , $\beta\beta$ or $\alpha\beta$ in columns $1,2,3$ . The first two cases are not possible, because we would exchange them with the string $\gamma\gamma$ in the row $r$ ; this would produce $\alpha\alpha$ or $\beta\beta$ in the row $r$ , which we would exchange with $00$ in the first row of $T_{1}$ . We are left with the possibility of $r^{\prime}$ having $\alpha\beta$ in columns $1,2,3$ . For this we apply in $T_{1}$ the cubic move $0000+\gamma\gamma\gamma\beta+?\alpha\beta=\gamma\gamma\beta\beta+0\alpha 00+?0\gamma$ .

In conclusion, the counting function $\alpha_{123}+\beta_{123}-\gamma_{123}-0_{123}$ is strictly negative on every row of $T_{1}$ . Since the value of this function on the first row of $T_{0}$ is $3$ , there exists a row $r^{\prime\prime}$ in $T_{0}$ on which the function is $-3$ . Thus in $r^{\prime\prime}$ we have either $00$ or $\gamma\gamma$ in columns $1,2,3$ . In this case, we would exchange them with the first row of $T_{0}$ reducing the Hamming distance. ∎

Corollary 3.6.

Suppose that a table $T$ contains two rows $r$ and $r^{\prime}$ having disagreement string of cardinality four. Then, using moves if degree at most three, $T$ can be transformed in such a way that the disagreement string has cardinality at most three. Moreover, only the four columns of the disagreement string are involved in the reduction.

Proof.

Assume two rows $r$ and $r^{\prime}$ do not agree on four elements. Up to the action of the group of flows $\mathfrak{G}$ and $\mathfrak{S}_{4}$ , the elements of $r$ in the disagreement string can be set to be $0000$ ; all the possibilities for the elements of $r^{\prime}$ in the disagreement string are $\alpha\alpha\alpha\alpha$ , $\alpha\alpha\beta\beta$ , $\alpha\alpha\beta\gamma$ , and $\alpha\alpha\alpha\beta$ . By Proposition 3.5, these disagreement strings can be reduced. Hence, performing the moves in the proof of the Proposition 3.5, we transform the tables in such a way that the cardinality of the disagreement string is at most three. ∎

Now we deal with the disagreement string of length three, $\alpha\beta\gamma$ . We begin with preparatory lemmas.

Lemma 3.7.

Suppose that the disagreement string between $T_{0}$ and $T_{1}$ is $\alpha\beta\gamma$ , in columns $1,2,3$ . Then we may assume that there exists a row $r^{\prime}$ in $T_{0}$ containing the string $00$ in columns $1,2,3$ .

Proof.

We introduce the counting function $0_{123}-\alpha_{1}-\beta_{2}-\gamma_{3}$ . By the action of the group of flows $\mathfrak{G}$ , we may assume that the sum is nonnegative on $T_{0}$ . Then there exists a row $r^{\prime}$ in $T_{0}$ where the function is strictly positive.

If in the row $r^{\prime}$ we have $0_{123}=1$ , then $\alpha_{1}=\beta_{2}=\gamma_{3}=0$ , by the assumption on the counting function evaluated at $r^{\prime}$ . By the action of the group of flows $\mathfrak{G}$ , we may assume without loss of generality that $r^{\prime}$ contains the string $0xy$ in columns $1,2,3$ . Then $x\neq 0,\beta$ by assumption. Also, $x\neq\gamma$ , as otherwise we would exchange the string $0\gamma$ with $\alpha\beta$ in the first row of $T_{0}$ , reducing the Hamming distance between $T_{0}$ and $T_{1}$ . Hence $x=\alpha$ . Similarly, $y\neq 0,\gamma$ and $y\neq\beta$ , as otherwise we exchange $0\beta$ with $\alpha\gamma$ in the first row of $T_{0}$ . Hence $r^{\prime}$ contains the string $0\alpha\alpha$ in columns $1,2,3$ , which we exchange with the first row of $T_{0}$ . ∎

Lemma 3.8.

We may assume that the row $r^{\prime}$ of Lemma 3.7 in $T_{0}$ contains the string $00\gamma$ in columns $1,2,3$ . More generally, for every row $r^{\prime\prime}$ containing the string $00$ in columns $1,2,3$ , the nonzero element of $r^{\prime\prime}$ in columns $1,2,3$ coincides with the corresponding entry of the first row of $T_{0}$ .

Proof.

The row $r^{\prime}$ contains a string with exactly two elements equal to [math] in the columns $1,2$ and $3$ . By the action of $\mathfrak{S}_{n}$ , we may assume that $r^{\prime}$ contains the string $00x$ in columns $1,2,3$ . Note that $x\neq\alpha,\beta$ , as in both cases we make a quadratic move between $r^{\prime}$ and the first row of $T_{0}$ , reducing the Hamming distance between $T_{0}$ and $T_{1}$ . Thus $x=\gamma$ . By the action of the group of flows $\mathfrak{G}$ , in every row $r^{\prime\prime}$ containing the string $00$ in columns $1,2,3$ , the nonzero entry coincides with the corresponding entry of the first row of $T_{0}$ . ∎

Lemma 3.9.

Suppose that in $T_{0}$ we have a row $r^{\prime}$ containing $00\gamma$ . Then this is the only string that a row with $00$ in columns $1,2,3$ may contain.

Proof.

Since the row $r^{\prime}$ in $T_{0}$ contains $00\gamma$ , then it cannot contain another copy of $\gamma$ , as we would exchange with the first row of $T_{0}$ , thus reducing the Hamming distance. Hence $r^{\prime}$ contains $00\gamma\alpha\beta$ , since it is a flow. Assume there exists another row $r^{\prime\prime}$ containing a string with $00$ , different from $00\gamma$ . By Lemma 3.8, the unique nonzero entry in columns $1,2,3$ of $r^{\prime\prime}$ agrees with the corresponding entry of the first row of $T_{0}$ . Assume that $r^{\prime\prime}$ contains $0\beta 0$ in columns $1,2,3$ . Then we apply the cubic move $\alpha\beta\gamma 00+00\gamma\alpha\beta+0\beta 0=0\beta 00\beta+0\beta\gamma\alpha 0+\alpha 0\gamma$ , reducing the Hamming distance. For a row containing $\alpha 00$ we conclude in the same way. ∎

Lemma 3.10.

As in the proof of Lemma 3.9, we assume that $r^{\prime}$ contains $00\gamma\alpha\beta$ in columns $1,2,3,4,5$ . There exists a row $r^{\prime\prime}$ in $T_{0}$ such that $r^{\prime\prime}(3)=0$ and, moreover, $r^{\prime\prime}$ contains the string $\alpha\beta 0\alpha\beta$ in columns $1,2,3,4,5$ .

Proof.

Such a row $r^{\prime\prime}$ exists in $T_{0}$ by the compatibility of the two tables. The structure of $T_{0}$ is:

[TABLE]

By Lemma 3.9, we have $x,y\neq 0$ . Analogously, we have $z,w\neq 0$ by applying Lemma 3.9, upon exchanging the string $\alpha\beta\gamma 00$ in the first row with $00\gamma\alpha\beta$ in the second row.

Note that $x\neq\beta$ and $y\neq\alpha$ , as otherwise, exchanging with the first row, in the first case with $\alpha\gamma$ and in the second with $\beta\gamma$ , we would reduce the Hamming distance; analogously, $z\neq\beta$ and $w\neq\alpha$ . Furthermore, by Lemma 3.8, we have $x,y\neq\gamma$ as otherwise we would create the string $\gamma 00$ and $0\gamma 0$ respectively. Analogously $z,w\neq\gamma$ . Hence the only remaining possibility is $xy=\alpha\beta$ and $zw=\alpha\beta$ . ∎

Lemma 3.11.

The counting function $0_{12345}-\alpha_{14}-\beta_{25}-\gamma_{3}$ is at most $-1$ on every row of $T_{0}$ .

Proof.

For the sake of contradiction, suppose there exists a row $r$ in $T_{0}$ , where the counting function is nonnegative. In $T_{0}$ , there exists a row $r^{\prime\prime}$ with $r^{\prime\prime}(3)=0$ . By Lemma 3.10, the row $r^{\prime\prime}$ contains the string $\alpha\beta 0\alpha\beta$ .

If in the row $r$ we have $0_{12345}\geq 3$ , then $r(3)\neq 0$ , again, by Lemma 3.10. Hence we have at least two differences with $r^{\prime\prime}$ and we can make a quadratic move between $r$ and $r^{\prime\prime}$ . This reduces the Hamming distance. Thus on the row $r$ one has $0_{12345}\leq 2$ .

If $0_{12345}=2$ on $r$ , we have the following possibilities:

(i)

$r$ contains $00\gamma xy$ ; 2. (ii)

$r$ contains $xyz00$ ; 3. (iii)

$r$ contains $x0yz0$ ; 4. (iv)

$r$ contains $x0y0z$ .

In case (i), we have $x,y\neq 0$ by the assumption on the value of the counting function. Additionally, $x,y\neq\gamma$ , as we would exchange the string $00\gamma\gamma$ with the first row in $T_{0}$ . Consider the differences between $r$ and $r^{\prime\prime}$ . If $xy\neq\alpha\beta$ , then we can make a move involving column $3$ , at most one of columns $1,2$ and either column $4$ or $5$ between $r$ and $r^{\prime\prime}$ . This allows us to exchange $\gamma$ in $r$ with [math] in $r^{\prime\prime}$ ; this contradicts Lemma 3.10. Hence $xy=\alpha\beta$ , which on the other hand contradicts the nonnegativity of the counting function. Exchanging $r^{\prime}$ , the row appearing in Lemma 3.10 containing $00\gamma\alpha\beta$ , with the first row of $T_{0}$ , case (ii) is the same as case (i).

In case (iii), $x\neq 0$ by the assumption on the value of the counting function. Moreover, $x\neq\gamma$ since we would exchange $\gamma 0$ in $r$ with the string $\alpha\beta$ in $r^{\prime\prime}$ in columns $1,2$ , contradicting Lemma 3.8. We also have $x\neq\beta$ , because we could make a quadratic move in columns $1,5$ between $\beta 0$ in $r$ with $0\beta$ in $r^{\prime}$ , obtaining the string $\beta 0\gamma\alpha 0$ in $r^{\prime}$ . Now, we exchange in columns $1,3$ , the string $\beta\gamma$ in $r^{\prime}$ with $\alpha 0$ in $r^{\prime\prime}$ , which produces the string $\alpha 00\alpha 0$ ; this reduces the Hamming distance. Finally, if $x=\alpha$ , we exchange in columns $1,2,5$ , the string $\alpha 00$ in $r$ with $\alpha\beta\beta$ in $r^{\prime\prime}$ , obtaining $\alpha 00\alpha 0$ , which again reduces the Hamming distance.

In case (iv), $x\neq 0$ by the assumption on the value of the counting function. Additionally, $x\neq\gamma$ , because otherwise we would exchange in columns $1,2$ the string $\gamma 0$ in $r$ with $\alpha\beta$ in $r^{\prime\prime}$ , thus contradicting Lemma 3.10. Also, $x\neq\beta$ , as we would make a quadratic move on columns $1,2,4$ between $r$ and $r^{\prime\prime}$ , contradicting again Lemma 3.10. Analogously, $z\neq 0,\alpha,\gamma$ . Hence $r$ contains the string $\alpha 0y0\beta$ . We exchange in columns $1,2,4,5$ the string $\alpha 00\beta$ in $r$ with $00\alpha\beta$ in $r^{\prime}$ , which produces $00y$ in $r$ , which in turn implies $y=\gamma$ by Lemma 3.8. This contradicts the nonnegativity of the counting function.

If $0_{12345}=1$ , by symmetry, we may assume $r(2)=0$ or $r(3)=0$ . If $r(3)=0$ , then by Lemma 3.10, $r$ contains $\alpha\beta 0\alpha\beta$ , which contradicts the nonnegativity of the counting function. If $r(2)=0$ , then $r$ contains $x0yzt$ . Then $z\neq 0$ by the assumption. Moreover, $z\neq\gamma$ , as we would exchange $r$ with $r^{\prime\prime}$ in columns $2,4$ , contradicting Lemma 3.10.

If $z=\alpha$ , we now consider the value of $x$ . We have $x\neq 0$ by assumption. We have $x\neq\alpha$ by assumption on the nonnegativity of the counting function. Moreover, $x\neq\gamma$ , since otherwise we would exchange in columns $1,2$ the string $\gamma 0$ in $r$ with $\alpha\beta$ in $r^{\prime\prime}$ contradicting Lemma 3.10. Hence $x=\beta$ , i.e., $r$ contains the string $\beta 0y\alpha t$ . Now, $t\neq 0$ , by the assumption on the value of $0_{12345}$ . Moreover $t\neq\beta$ , by the assumption on the value of the counting function on $r$ . Also notice that $t\neq\gamma$ , as otherwise we exchange in columns $1,5$ the string $\beta\gamma$ of $r$ with $\alpha 0$ of the first row of $T_{0}$ , and then we exchange $\beta\beta$ from the first row with $00$ in $r^{\prime}$ reducing the Hamming distance. Therefore $r$ contains the string $\beta 0y\alpha\alpha$ , which we exchange with the string $\alpha\beta 0\alpha\beta$ in $r^{\prime\prime}$ in columns $1$ and $5$ , contradicting Lemma 3.10.

If $z=\beta$ , then $r$ contains $x0y\beta t$ . Furthermore, $t\neq 0$ by assumption on the value of $0_{12345}$ . Moreover, $t\neq\alpha$ exchanging in columns $4,5$ the string $\beta\alpha$ of $r$ with $\alpha\beta$ of $r^{\prime\prime}$ , contradicting Lemma 3.10. Analogously, we would contradict Lemma 3.10 for $t=\gamma$ , exchanging in columns $2,4,5$ , the string $0\beta\gamma$ in $r$ with $\beta\alpha\beta$ in $r^{\prime\prime}$ . Hence $r$ contains the string $x0y\beta\beta$ . Here $x\neq 0$ , by assumption. Moreover, $x\neq\alpha$ , because of the nonnegativity of the counting function. Also, $x\neq\gamma$ , because we would contradict Lemma 3.10, exchanging $\gamma 0$ of $r$ with $\alpha\beta$ of $r^{\prime\prime}$ . Therefore $r$ contains $\beta 0y\beta\beta$ , but we exchange it with $\alpha\beta 0\alpha\beta$ in columns $1,4$ contradicting Lemma 3.10.

If $0_{12345}=0$ , then $\alpha_{14}=\beta_{25}=\gamma_{3}=0$ , by the assumption on the nonnegativity of the function on $r$ . Thus $r$ contains $xyztw$ different from $\alpha\beta 0\alpha\beta$ in columns $1,2,4,5$ . Hence we have two identical differences between $r$ and $r^{\prime\prime}$ , which allow to make a quadratic move, contradicting Lemma 3.10. ∎

Proposition 3.12.

The disagreement string $\alpha\beta\gamma$ can be reduced.

Proof.

By Lemma 3.11, the counting function $0_{12345}-\alpha_{14}-\beta_{25}-\gamma_{3}$ is at most $-1$ on every row of $T_{0}$ . As a consequence, there exists a row $r$ in $T_{1}$ , where the function is at most $-2$ . By the value of the counting function on the row $r$ , the entries in $r$ must agree in two, three, four or five entries with $\alpha\beta\gamma\alpha\beta$ .

If $r$ agrees in five entries, it contains $\alpha\beta\gamma\alpha\beta$ . We exchange $\alpha\beta\gamma$ with $000$ in the first row of $T_{1}$ , which reduces the Hamming distance between $T_{0}$ and $T_{1}$ . If $r$ agrees in four entries, we denote by $x$ the element where $r$ does not agree with $\alpha\beta\gamma\alpha\beta$ . If $x\neq r(3)$ , then we would have either the string $\alpha\beta\gamma$ or $\gamma\alpha\beta$ , which is also in table $T_{0}$ ; this reduces the Hamming distance. Suppose $r$ contains $\alpha\beta x\alpha\beta$ . If $x=0$ , the table $T_{0}$ contains the same flow. If $x=\alpha$ or $\beta$ , we exchange $\alpha\alpha$ or $\beta\beta$ with $00$ in the first row of $T_{1}$ .

If $r$ agrees with $\alpha\beta\gamma\alpha\beta$ in three entries, we denote by $xy$ the remaining two. First, note that if $xy$ are in columns $1,2$ or in columns $4,5$ , we exchange $\alpha\beta\gamma$ or $\gamma\alpha\beta$ with $000$ in the first row of $T_{1}$ ; this decreases the Hamming distance.

Assume that both of $x$ and $y$ are in columns $1,2,3$ . If $r$ contains $x\beta y\alpha\beta$ , then $x\neq\alpha,\beta$ , because otherwise we would exchange the string $\alpha\alpha$ or $\beta\beta$ with the first row of $T_{1}$ reducing the Hamming distance. Whence $x=0,\gamma$ . Moreover $y\neq\gamma$ , by definition. Additionally, $y\neq\beta$ , because we would move $\beta\beta$ to the first row of $T_{1}$ , reducing the Hamming distance. It follows that $y=0,\alpha$ . On the other hand, $xy\neq 00$ , since the counting function $0_{12345}-\alpha_{14}-\beta_{25}-\gamma_{3}$ is at most $-2$ on $r$ . Furthermore, $x+y\neq\beta$ , as otherwise we would exchange $x\beta y$ with $000$ in the first row of $T_{1}$ , reducing the Hamming distance between $T_{0}$ and $T_{1}$ . Hence $r$ contains either $\gamma\beta 0\alpha\beta$ or $0\beta\alpha\alpha\beta$ . For the first, we exchange in columns $2,3,5$ , the string $\beta 0\beta$ with $000$ in the first row of $T_{1}$ , and we exchange $\alpha\beta 0\alpha\beta$ in $T_{0}$ with the first row of $T_{0}$ . For the second, we exchange $0\beta\alpha\alpha\beta$ with the first row of $T_{1}$ and $\alpha\beta 0\alpha\beta$ in $T_{0}$ with the first row of $T_{0}$ , which reduces the Hamming distance.

If $r$ contains $\alpha xy\alpha\beta$ , then applying the automorphism $\alpha\leftrightarrow\beta$ and a transposition between columns $1$ and $2$ , we are in the case when the row $r$ contains $x\beta y\alpha\beta$ .

If $x,y$ are both in columns $3,4,5$ , we apply analogous moves as the ones featured above. Then we may assume that $x$ is either in column $1$ or $2$ , and $y$ is either in column $4$ or $5$ . In all these cases, we have $x=0$ and $y=0$ , as all the other possibilities are excluded by exchanging with the first row of $T_{1}$ . The fact that $x=y=0$ contradicts the value of the counting function on $r$ .

If $r$ agrees with $\alpha\beta\gamma\alpha\beta$ in two entries, we have $0_{12345}=0$ on $r$ , since the value of the counting function $0_{12345}-\alpha_{14}-\beta_{25}-\gamma_{3}$ on $r$ is at most $-2$ . In columns $1,2,3$ , there is at least one entry $x$ which does not agree with the corresponding entry in $\alpha\beta\gamma$ , because otherwise we would move $\alpha\beta\gamma$ to the first row of $T_{1}$ , reducing the Hamming distance. Denoting the elements where they do not agree by $x,y,z$ , the strings that $r$ may contain are: $\alpha xy\alpha z$ , $\alpha\beta xyz$ , and $\alpha xyz\beta$ . Note that these are all the possible, as the remaining ones are resolved in the same way upon exchanging the string $\alpha\beta\gamma 00$ in the first row with $00\gamma\alpha\beta$ in the second row of $T_{0}$ . If $r$ contains $\alpha xy\alpha z$ , then we exchange the string $\alpha\alpha$ of $r$ in columns $1,4$ with $00$ in $T_{1}$ . We now exchange the string $\alpha\beta 0\alpha\beta$ of $r^{\prime\prime}$ with the first row in $T_{0}$ ; these two rows have lower Hamming distance. If $r$ contains $\alpha\beta x$ in columns $1,2,3$ , then $x\neq\gamma$ , by the counting function. Moreover, $x\neq 0$ since $0_{12345}=0$ . Hence $x=\alpha$ or $\beta$ . Now we exchange $\alpha\alpha$ or $\beta\beta$ with $00$ in the first row of $T_{1}$ reducing the Hamming distance. If $r$ contains $\alpha xyz\beta$ , by definition or by quadratic moves we can exclude the cases $x=\alpha,\beta,0$ , and $y=\alpha,\gamma,0$ . Hence $r$ contains $\alpha\gamma\beta$ , which we exchange with the first row of $T_{1}$ , decreasing the Hamming distance. ∎

The preceding results of this section show the following corollary.

Corollary 3.13.

The Hamming distance of two flows can be reduced to at most two.

3.3 The disagreement string $\alpha\alpha$

In this section, we proceed in the case of the disagreement string $\alpha\alpha$ .

[TABLE]

Let us denote the row in $T_{0}$ starting with the string $0x$ by $r_{0x}$ and the row in $T_{1}$ starting with the string $\alpha z$ by $r_{\alpha z}$ . After fixing the first rows and the first two columns, we make moves of degree at most four on the rest of tables in such a way that the number of agreements in $r_{0x}$ and $r_{\alpha z}$ is maximized.

Remark 3.14.

Corollary 3.6 ensures that, after possibly making moves of degree at most four, the rows $r_{0x}$ and $r_{\alpha z}$ in $T_{0}$ and $T_{1}$ respectively, agree in at least $n-5$ entries. Up to the action of $\mathfrak{S}_{n}$ on the $n$ leaves, and hence on the columns, these are the last $n-5$ columns.

Definition 3.15.

The string in the last $n-5$ columns of the rows $r_{0x}$ and $r_{\alpha z}$ is the the agreement string between $r_{0x}$ and $r_{\alpha z}$ . Up to the action of the group of flows $\mathfrak{G}$ , these entries are zeros.

Our aim is to prove the following three crucial cases, which we refer to as the main case:

[TABLE]

In Section 3.3.1, we reduce any other possible case to one of the above.

3.3.1 Reduction to the main case

Up to the action of the group of flows $\mathfrak{G}$ , there are at least as many copies of [math] as copies of $\alpha$ in the first two columns of $T_{0}$ . Up to the action of $\textnormal{Aut}(G)$ , we may assume $x=\beta$ . We will show that all cases can be resolved, by reducing to the main case ( $\star$ ‣ 3.3).

We first collect a useful lemma which we will use to resolve easily some of the cases.

Lemma 3.16.

If in table $T_{1}$ in (1) we have $\left\{z,w\right\}=\left\{\beta,\gamma\right\}$ , then the corresponding cases can be reduced. If in table $T_{0}$ in (1) we have $\left\{x,y\right\}=\left\{\beta,\gamma\right\}$ , then the corresponding cases can be reduced.

Proof.

If $\left\{z,w\right\}=\left\{\beta,\gamma\right\}$ , then in $T_{1}$ we have either the cubic move $00+\alpha\beta+\gamma\alpha=\alpha\alpha+0\beta+\gamma 0$ or $00+\alpha\gamma+\beta\alpha=\alpha\alpha+\beta 0+\gamma 0$ . The second sentence is the symmetric version of the first: acting with the flow $(\alpha,\alpha,0,\ldots,0)\in\mathfrak{G}$ on the tables, we produce the same tables as in the first statement. ∎

We now analyze all the possible cases. We refer to the tables $T_{0}$ and $T_{1}$ in (1).

Case $y=\alpha$ . In this case, the table $T_{0}$ has the form:

[TABLE]

We may have $z=0,\beta,\gamma$ .

$z=\beta$ .

Here, $w=\gamma$ is reduced by Lemma 3.16. Hence we have $w=0$ (Case I) or $w=\beta$ (Case II).

$z=0$ .

Here, $w=0$ (Case X), $w=\beta$ (Case VII), $w=\gamma$ (Case VI).

$z=\gamma$ .

Here, $w=0$ (Case IV), $w=\gamma$ (Case V), $w=\beta$ is resolved by Lemma 3.16.

Case $y=\beta$ . In this case, the table $T_{0}$ has the form:

[TABLE]

We may have $z=\beta,0,\gamma$ .

$z=\beta$ .

Here, $w=0$ (which is Case II by acting with the flow $(\alpha,\alpha,0,\ldots,0)\in\mathfrak{G}$ and $\gamma\leftrightarrow\beta$ ), $w=\beta$ (Case III), $w=\gamma$ resolved by Lemma 3.16.

$z=0$ .

Here, $w=0$ (Case IX), $w=\beta$ (which is Case II by acting with the flow $(\alpha,\alpha,0,\ldots,0)\in\mathfrak{G}$ , transposing and $\gamma\leftrightarrow\beta$ ), $w=\gamma$ (which is Case V by acting the flow $(\alpha,\alpha,0,\ldots,0)\in\mathfrak{G}$ and transposition).

$z=\gamma$ .

Here, $w=0$ (which is Case V by acting with the flow $(\alpha,\alpha,0,\ldots,0)\in\mathfrak{G}$ ), $w=\gamma$ (Case VIII).

We now reduce all the cases to the main case ( $\star$ ‣ 3.3), postponing its proof for the moment, as this requires more technical results.

Cases IV and V.

In this case we have:

[TABLE]

We may assume we do not have strings $\gamma\gamma,00,\gamma 0,0\gamma$ in columns $1,2$ of $T_{0}$ ; this is shown by the same arguments in the proof of Lemma 3.22. Hence the counting function $\alpha_{12}+\beta_{12}-0_{12}-\gamma_{12}$ is nonnegative on every row of $T_{0}$ . On the other hand, in the table $T_{1}$ , in columns $1,2$ we do not have the string $\alpha\beta$ , as we would reduce this case with a cubic move. In the same columns of $T_{1}$ , the string $\alpha\alpha$ would decrease the Hamming distance. Moreover, the string $\beta\beta$ is reduced by the cubic move $\alpha\gamma+0\alpha+\beta\beta=0\beta+\beta\gamma+\alpha\alpha$ , and $\beta\alpha$ is reduced by the cubic move $00+\alpha\gamma+\beta\alpha=\alpha\alpha+\beta 0+0\gamma$ . This is a contradiction and thus it shows the reduction.

**Case VI.

**In this case we have:

[TABLE]

In columns $1,2$ in $T_{1}$ , the string $\alpha\beta$ is resolved by Lemma 3.16. The string $\alpha\gamma$ in columns $1,2$ of $T_{1}$ is Case V. Since we cannot have the string $\alpha\alpha$ in columns $1,2$ of $T_{1}$ , the counting function $\alpha_{1}-0_{2}$ is nonpositive in every row of $T_{1}$ . Thus there exists a row $r$ in $T_{0}$ with $r(2)=0$ and $r(1)\neq\alpha$ . Hence $r(1)=\beta$ . Acting by the flow $(\alpha,\alpha,0,\ldots,0)$ and transposition we reduce to Case V.

**Case VII.

**In this case we have:

[TABLE]

We exclude the string $\alpha\beta$ in columns $1,2$ in $T_{1}$ , since it is Case II. We also exclude $\alpha\gamma$ by Lemma 3.16. As in Case VI, there exists a row $r$ in $T_{0}$ such that $r(1)=\beta$ and $r(2)=0$ . Now, by acting with the flow $(\alpha,\alpha,0,\ldots,0)\in\mathfrak{G}$ , making a transposition and applying the group automorphism $\gamma\leftrightarrow\beta$ , we reduce to Case II.

Case VIII.

In this case we have:

[TABLE]

We may exclude in columns $1,2$ in $T_{0}$ the string $00$ . Also, we exclude the string $\gamma\gamma$ by the quartic move $\alpha\alpha+0\beta+\beta 0+\gamma\gamma=00+\alpha\gamma+\gamma\alpha+\beta\beta$ . Moreover, in columns $1,2$ in $T_{0}$ , notice that we can exclude the strings $0\gamma$ and $\gamma 0$ by Lemma 3.16. Hence the counting function $\alpha_{12}+\beta_{12}-0_{12}-\gamma_{12}$ is nonnegative on every row of $T_{0}$ . On the other hand, in $T_{1}$ we may reduce the string $\alpha\alpha$ , $\alpha\beta$ and $\beta\alpha$ by Lemma 3.16. Finally, we are able to reduce the string $\beta\beta$ by the quartic move $00+\alpha\gamma+\gamma\alpha+\beta\beta=\gamma\gamma+\beta 0+0\beta+\alpha\alpha$ . This is a contradiction and thus it shows the reduction.

Case IX.

In this case we have:

[TABLE]

Analogously to the proof of Lemma 3.22, we exclude $\gamma\gamma,0\gamma,\gamma 0,00$ in columns $1,2$ of $T_{0}$ . So the counting function $\alpha_{12}+\beta_{12}-0_{12}-\gamma_{12}$ is nonnegative on every row of $T_{0}$ . On the other hand, in columns $1,2$ of $T_{1}$ , the strings $\alpha\beta,\beta\alpha$ correspond to the case for $z=\beta$ and $w=0$ in tables (1), which were previously done. Thus there exists a row $r$ such that $r(1)=\beta$ and $r(2)=\beta$ by the positivity of the counting function in $T_{0}$ and $T_{1}$ . Exchanging the string $00$ in the first row with the string $\beta\beta$ in $r$ , acting by $\alpha\alpha$ on both $T_{0}$ and $T_{1}$ , applying the automorphisms $\gamma\leftrightarrow\alpha$ and $\gamma\leftrightarrow\beta$ we obtain Case III.

Case X.

In this case we have:

[TABLE]

In $T_{1}$ , in columns $1,2$ we can exclude $\alpha\beta$ , because it is Case I. The string $\alpha\gamma$ reduces to Case IV. As usual, the string $\alpha\alpha$ is excluded. Hence the counting function $\alpha_{1}-0_{2}$ is nonpositive in every row of $T_{1}$ . Hence there exists a row $r$ in $T_{0}$ such that $r(1)\neq\alpha$ and $r(2)=0$ . The possible values of $r(1)$ are either $\gamma$ or $\beta$ , since for $r(1)=0$ we have an immediate reduction. For $r(1)=\gamma$ we apply Lemma 3.16 and $r(1)=\beta$ is Case IX.

3.3.2 Preliminary Lemmas

We are now ready to present our preliminary lemmas, that are devised to tackle the main case ( $\star$ ‣ 3.3). As they will be used very often, we give them specific reference names in order to facilitate the reading.

Lemma 3.17 (Difference Lemma).

Suppose we have the table $T$ whose first three rows are $r_{1},r_{2},r_{3}$ :

[TABLE]

where $q,x,y,z\in G$ and $x\neq y,z$ . If one of the following holds:

(i)

$z\neq y$ * and $r_{2}(i)-r_{3}(i)$ is $x-y$ or $x-z$ for some $i>2$ ; or* 2. (ii)

$z=y$ , $q\neq y$ and $r_{2}(i)-r_{3}(i)$ is $x-y$ or $x-q$ for some $i>2$ ,

then we can transform the row $r_{1}$ to a row starting with the string $xx$ .

Proof.

When the difference $r_{2}(i)-r_{3}(i)=x-y$ in both (i) and (ii), we make the quadratic move $xyw+zx(w+x-y)=xx(w+x-y)+zyw$ , which exchanges the corresponding entries in rows $r_{2}$ and $r_{3}$ , thus creating a row starting with the string $xx$ . Analogously for the case (i), when the difference is $r_{2}(i)-r_{3}(i)=x-z$ . In (ii), when the difference is $r_{2}(i)-r_{3}(i)=x-q$ , we make the cubic move $qq+xyw+yx(w+x-q)=xx+qy(w+x-q)+yqw$ . ∎

Remark 3.18.

Note that the Difference Lemma 3.17 distinguishes one group element in each table in each of the crucial cases Case I, Case II, and Case III. In all the cases, these are $\gamma$ in $T_{0}$ and $\beta$ in $T_{1}$ . In particular, if the second and third row differ on some index $i>2$ , then their difference must be equal to the distinguished element.

Although basic, the Difference Lemma 3.17 will be used very frequently. We apply it following the observation above. Indeed, our aim will be often to produce a row starting with a string of type $xx$ and conclude by induction. To this end, after identifying the situation described in Lemma 3.17, if $r_{2}(i)\neq r_{3}(i)$ , then we will be able to immediately infer what can be the element $r_{2}(i)-r_{3}(i)\in G$ ; to exclude all the other possible values we apply the Difference Lemma 3.17, obtaining a row starting with the string $xx$ . This will be useful to decrease the given Hamming distance and conclude by induction on the degree.

Lemma 3.19 (Standard Lemma).

Let $T$ be a table and suppose there is an element $y\in G$ in some row $r$ with $r(n-1)=0$ and $r(n)=0$ . Suppose there is a row $r^{\prime}$ of $T$ with $r^{\prime}(n-1)=x$ and $r^{\prime}(n)=x$ , where $0\neq x\in G$ , and a row $r^{\prime\prime}$ with the element $y+x$ in the same column as $y$ . Then we can exchange $y$ and $y+x$ (and appropriate entries in columns $n-1$ and $n$ ). The same statement holds when $y$ is a string of elements of $G$ .

Proof.

Let us consider the entries $r^{\prime\prime}(n-1)=u$ and $r^{\prime\prime}(n)=v$ . If $u=x$ or $v=x$ , then we make a quadratic move putting $y+x$ and $x$ in the row $r$ . If $u=0$ or $v=0$ , then we move the string $xx$ to the row $r$ , and finally we exchange $x$ with [math] and $y$ with $y+x$ . If $u=v$ are equal, then we move the string $xx$ in the row $r^{\prime\prime}$ exchanging it with the string $uv$ , thus we exchange $y$ with $y+x$ and [math] with $x$ . Hence, we may assume that $u\neq v$ and they are both different from [math] and $x$ . Hence, the sum $u+v+x=0$ . Thus, we may exchange $y$ with $y+x$ and $00$ with $uv$ . The last statement is shown using the same arguments. This completes the proof. ∎

We now record some technical results on the main case ( $\star$ ‣ 3.3). Note that in the main case we have $x=z=\beta$ .

Lemma 3.20.

If $r_{0\beta}(i)=r_{\alpha\beta}(i)$ for some $i>2$ then we may assume that both are equal to $0\in G$ . In particular, both rows have [math] on the agreement string.

Proof.

Without loss of generality, let us assume $i=3$ . If $r_{0\beta}(i)=r_{\alpha\beta}(i)=\beta$ , then a quadratic move allows us to produce the string $0\beta\beta 0$ in both tables. If $r_{0\beta}(i)=r_{\alpha\beta}(i)=\gamma$ , then in both tables we obtain the string $\alpha\beta\gamma 0$ by quadratic moves again. If $r_{0\beta}(i)=r_{\alpha\beta}(i)=\alpha$ , then in both tables we obtain $\alpha 0\alpha 0$ . The last string is obtained in $T_{1}$ by quadratic moves, and in $T_{0}$ by the following moves:

(i)

in Case I and II, by the cubic move $\alpha\alpha 0+0\beta\alpha+\alpha 0=\alpha 0\alpha+\alpha\beta 0+0\alpha$ ; 2. (ii)

in Case III, by two quadratic moves, upon exchanging $0\beta$ with $\beta 0$ .

∎

Remark 3.21.

We observe that in Case I and Case III, the tables $T_{0}$ and $T_{1}$ are in “symmetry”. More precisely, the fixed entries in table $T_{1}$ can be obtained from the ones in $T_{0}$ , by acting with the flow $(\alpha,\alpha,0,\dots,0)\in\mathfrak{G}$ and applying the automorphism $\beta\leftrightarrow\gamma$ of $G$ , that exchanges $\beta$ and $\gamma$ . In particular, if we can prove a statement for $T_{0}$ then a “symmetric” statement holds for $T_{1}$ .

Lemma 3.22.

We may assume that no row in $T_{0}$ contains in columns $1,2$ any string of the form $\gamma\gamma,0\gamma,\gamma 0,00$ . Analogously, no row in $T_{1}$ contains in columns $1,2$ any of the strings of the form $\gamma\gamma,\alpha\gamma,\gamma\alpha,\alpha\alpha$ .

Proof.

In all the cases, one can obtain either $00$ in $T_{0}$ or $\alpha\alpha$ in $T_{1}$ . For $T_{0}$ , these are: $\alpha\alpha+0\beta+\alpha 0+0\gamma=00+\alpha\gamma+0\alpha+\alpha\beta$ , $\alpha\alpha+0\beta+\gamma 0=00+\gamma\alpha+\alpha\beta$ , $\alpha\alpha+0\beta+\beta 0+\gamma\gamma=00+\gamma\alpha+\alpha\gamma+\beta\beta$ . The statement for $T_{1}$ readily follows by Remark 3.21. ∎

Lemma 3.23.

For any row $r_{xy}$ in $T_{0}$ differing from $r_{0\beta}$ on some column index $i>2$ not by $\gamma$ , we may assume $xy=\alpha\alpha,\beta\beta,\alpha\beta$ or $\beta\alpha$ . Analogously, in $T_{1}$ , if $r_{xy}$ differs from $r_{\alpha\beta}$ on some column index $i>2$ not by $\beta$ , then $xy=00,\beta\beta,\beta 0$ or $0\beta$ .

Proof.

In Case I and Case II, by the Difference Lemma 3.17, and a quadratic move with $r_{0\beta}$ or with $r_{\alpha 0}$ in $T_{0}$ , we may assume $x+y=0$ or $x+y=\gamma$ . The result follows by Lemma 3.22.

In Case III we exclude $x+y=\alpha$ . Indeed, if $xy=\alpha 0$ or $xy=0\alpha$ , then we are in Case II (more precisely, for $0\alpha$ we also need to exchange the two columns to reduce to Case II). If $xy=\beta\gamma$ or $\gamma\beta$ , by the quadratic moves $0\beta w+\beta\gamma(w+\alpha)=0\gamma(w+\alpha)+\beta\beta w$ or $0\beta w+\gamma\beta(w+\beta)=\beta\beta(w+\beta)+\gamma 0w$ we produce $0\gamma$ or $\gamma 0$ and apply Lemma 3.16. Remark 3.21 gives the symmetric statement for $T_{1}$ . ∎

Lemma 3.24.

If there exists and index $j$ such that $r_{0\beta}(j)=\beta$ in $T_{0}$ , then we may assume that $r_{\alpha\beta}(j)=0$ in $T_{1}$ . Analogously, if there exists an index $j$ such that $r_{\alpha\beta}(j)=\gamma$ in $T_{1}$ , then we may assume that $r_{0\beta}(j)=0$ .

Proof.

Assume $r_{0\beta}(3)=\beta$ . Suppose $r_{\alpha\beta}(3)=\alpha$ or $\gamma$ in $T_{1}$ . Then there exists a row $r$ in $T_{1}$ with $r(3)=\beta$ . The row $r$ contains the string $xy\beta$ in columns $1,2,3$ for some $x,y\in G$ . Let us determine the possible values of $r(2)=y$ . If $y=\beta$ we would have the string $0\beta\beta$ in both of the tables. By Lemma 3.23, $y=0$ . Whence the counting function $\beta_{3}-0_{2}$ is nonpositive on every row of $T_{1}$ . It follows that in $T_{0}$ there exists a row $r^{\prime}$ with $r^{\prime}(2)=0$ and $r^{\prime}(3)\neq\beta$ . By Lemma 3.23, we have $r^{\prime}(3)=\alpha$ . For $r_{\alpha\beta}(3)=\alpha$ , by quadratic moves, we obtain $\alpha 0\alpha 0$ in both tables. Now consider the case $r_{\alpha\beta}(3)=\gamma$ . In $T_{1}$ for every row $r$ with $r(2)=0$ and $r(3)=\beta$ (likewise the $r$ above), we have $r(1)=\beta$ . Indeed $r^{\prime\prime}(1)$ is either [math] or $\beta$ by Lemma 3.23. On the other hand, $r^{\prime\prime}(1)\neq 0$ because otherwise we would produce the string $0\beta\beta$ in $T_{1}$ , which is also in $T_{0}$ . Hence $r^{\prime\prime}(1)=\beta$ . Since in $T_{0}$ we have the row $r^{\prime}$ with $r^{\prime}(3)=\alpha$ , there exists a row $r^{\prime\prime}$ in $T_{1}$ with $r^{\prime\prime}(3)=\alpha$ . If $r^{\prime\prime}(2)=0$ we are done, as we produce $\alpha 0\alpha$ in both tables. For $r^{\prime\prime}(2)=\alpha$ we have the cubic move in $T_{1}$ , $\alpha\beta\gamma+\beta 0\beta+\alpha\alpha=\alpha 0\alpha+\beta\alpha\gamma+\beta\beta$ . For $r^{\prime\prime}(2)=\beta$ , we have the quartic move in $T_{1}$ , $000+\alpha\beta\gamma+\beta 0\beta=\alpha 0\alpha+\beta\beta 0+0\beta\beta+0\gamma$ . For $r^{\prime\prime}(2)=\gamma$ , we have the quartic move in $T_{1}$ , $000+\alpha\beta\gamma+\beta 0\beta+\gamma\alpha=\alpha 0\alpha+\beta\beta 0+0\gamma\gamma+0\beta$ . ∎

3.3.3 The case of $n=6$ leaves

After having set up the cornerstone of our approach, we are ready to first establish the case of $n=6$ leaves. Let $P$ be the lattice polytope of the Kimura $3$ -parameter model for $n=6$ leaves. Here we are in the setting of polytopes. To be consistent with standard terminology, binomials in the ideal of the Kimura $3$ -parameter model are identified with relations among lattice points, which in turn are naturally identified with variables. The minimal generating relations among the vertices of the polytope $P$ constitute a Markov basis. The degree of an element of a Markov basis is the total degree of the corresponding binomial in the standard grading. The degree of the corresponding table is the number of rows. Only in this section, given a Markov basis element $B$ , which we think of as a binomial, we introduce the notation $\deg(B)$ to denote its degree.

As recalled in Section 2, the polytope $P$ is $18$ dimensional. Following the notation of Section 2, a generating set of the full lattice $M^{6}$ is $e_{(i,g)}\in[6]\times G$ . However, our lattice is a sublattice of $M^{6}$ . Since we have the six linear relations $e_{(i,0)}^{*}+e_{(i,\alpha)}^{*}+e_{(i,\beta)}^{*}+e_{(i,\gamma)}^{*}=1$ for $1\leq i\leq 6$ satisfied by the vertices of the polytope, we can choose the elements $e_{(i,\alpha)},e_{(i,\beta)},e_{(i,\gamma)}$ for $1\leq i\leq 6$ to serve as a basis of the $18$ -dimensional lattice of interest.

Proposition 3.25.

The polytope $P$ defines an $18$ dimensional projectively normal (in particular, Cohen-Macaulay) toric variety in $\mathbb{P}^{1023}$ . Its Hilbert series is $Hs(t)=\frac{N(t)}{(1-t)^{19}}$ , where

[TABLE]

Its Hilbert polynomial is

[TABLE]

*In particular, the Markov basis has elements of degree at most $16$ .

Let us consider the following two codimension two faces of $P$ :*

(i)

$\tilde{P}$ * contains points corresponding to flows that have [math] or $\alpha$ on the sixth leaf. This is the intersection of $P$ with the linear subspace $e_{(6,\beta)}^{*}=e_{(6,\gamma)}^{*}=0$ .* 2. (ii)

$\tilde{P}^{\prime}$ * contains points corresponding to flows that do not have $\gamma$ on the sixth leaf and on the fifth leaf. This is the intersection of $P$ with the linear subspace $e_{(5,\gamma)}^{*}=e_{(6,\gamma)}^{*}=0$ .*

The Hilbert series of (i) is $Hs(t)=\frac{\tilde{N}(t)}{(1-t)^{17}}$ , where

[TABLE]

The Hilbert series of (ii) is $Hs(t)=\frac{\tilde{N}^{\prime}(t)}{(1-t)^{17}}$ , where

[TABLE]

In particular, the Markov basis in both cases has elements of degree at most $14$ .

Proof.

The computation of Hilbert series and verification of normality were obtained using Normaliz [7]. The statements about the degree of Markov basis are a consequence of well-known theorems on regularity of normal toric varieties, see Appendix 4. ∎

Lemma 3.26.

The following three codimension three faces $P_{1},P_{2},P_{3}$ of $P$ have Markov basis with elements of degree at most four:

(i)

$P_{1}$ * contains points corresponding to flows that have [math] on the sixth leaf. This is the intersection of $P$ with the linear subspace $e_{(6,\alpha)}^{*}=e_{(6,\beta)}^{*}=e_{(6,\gamma)}^{*}=0$ and is isomorphic to the Kimura $3$ -parameter model polytope for five leaves.* 2. (ii)

$P_{2}$ * contains points corresponding to flows that do not have $\beta$ or $\gamma$ on the sixth leaf and do not have $\gamma$ on the fifth leaf. This is the intersection of $P$ with the linear subspace $e_{(5,\gamma)}^{*}=e_{(6,\beta)}^{*}=e_{(6,\gamma)}^{*}=0$ .* 3. (iii)

$P_{3}$ * contains points corresponding to flows that do not have $\gamma$ on the fourth, the fifth and the sixth leaf. This is the intersection of $P$ with the linear subspace $e_{(4,\gamma)}^{*}=e_{(5,\gamma)}^{*}=e_{(6,\gamma)}^{*}=0$ .*

Proof.

We employed 4ti2 [47] to compute explicitly the Markov basis in all three cases. More specifically, for $P_{2}$ we obtained $47112$ relations: $36840$ quadrics, $2304$ cubics, and $7968$ quartics. For $P_{3}$ , we obtained $57058$ relations: $48600$ quadrics, $2176$ cubics, and $6282$ quartics. ∎

Remark 3.27.

The polytopes $P_{1},P_{2}$ and $P_{3}$ are not isomorphic, although they have the same dimension. One can easily see that $P_{1},P_{2},P_{3}$ have $256,384,432$ vertices respectively. Similarly, $\tilde{P}$ and $\tilde{P}^{\prime}$ have $512$ and $576$ vertices respectively.

Let us consider a Markov basis element $B$ of $P$ . We show that one of the following holds:

(i)

$B$ has either degree less than or equal to four; 2. (ii)

$B$ has $\deg(B)>16$ , which is not possible by Proposition 3.25; 3. (iii)

$B$ is a Markov basis element of $\tilde{P}$ or $\tilde{P}^{\prime}$ of degree at least $15$ , which is not possible by Proposition 3.25; 4. (iv)

$B$ is a Markov basis element for a polytope isomorphic to $P_{1},P_{2}$ or $P_{3}$ (in this case, it has degree at most four by Lemma 3.26).

Proposition 3.28.

Any Markov basis element $B$ for $P$ has degree at most four.

Proof.

It is enough to restrict to the main case ( $\star$ ‣ 3.3). We first prove two claims, Claim (i) and (ii).

Claim (i): For any row $r$ of $T_{0}$ distinct from the first one, for any pair of indices $2<i<j\leq 6$ , we have that either $\phi_{\gamma}(r_{0\beta}(i))=\phi_{\gamma}(r(i))$ or $\phi_{\gamma}(r_{0\beta}(j))=\phi_{\gamma}(r(j))$ . The analogous statement holds for $T_{1}$ , with the group homomorphism $\phi_{\gamma}$ replaced by $\phi_{\beta}$ .

Proof of Claim (i).

Suppose the statement is not true for some pair of indices $i,j$ . If $r_{0\beta}(i)-r(i)=r_{0\beta}(j)-r(j)$ , then we can make a quadratic move on $i,j$ , and conclude using the Difference Lemma 3.17. Thus, without loss of generality, we may assume $r_{0\beta}(i)-r(i)=\alpha$ and $r_{0\beta}(j)-r(j)=\beta$ . If there exists another index $2<k\leq 6$ such that $r_{0\beta}(k)-r(k)\neq 0$ , then we can make a move on a subset of $\{i,j,k\}$ and, again, conclude by the means of the Difference Lemma 3.17. In conclusion, $\sum_{l=3}^{6}r_{0\beta}(l)=\alpha+\beta+\sum_{l=3}^{6}r(l)$ . As $r$ and $r_{0\beta}$ are flows and $r_{0\beta}(1)+r_{0\beta}(2)=\beta$ , this contradicts Lemma 3.23, which prescribes the first two columns of a row differing not by $\gamma$ with $r_{0\beta}$ . ∎

By Proposition 3.5 and Lemma 3.20, we may assume that $r_{0\beta}(6)=r_{\alpha\beta}(6)=0$ , as the disagreement string between the two rows has length at most three, outside the first two columns.

Claim (ii): There exists at most one index $i>2$ such that $\phi_{\gamma}(r_{0\beta}(i))\neq 0$ .

Proof of Claim (ii).

As the number of such indices must be odd it is enough to prove that not all $r_{0\beta}(3),r_{0\beta}(4),r_{0\beta}(5)$ are equal to $\alpha$ or $\beta$ . Not all can be equal to $\beta$ since, by by Lemma 3.24, that would contradict the fact that $r_{\alpha\beta}$ is a flow. Say $r_{0\beta}(3)=\beta$ and $r_{0\beta}(4)=r_{0\beta}(5)=\alpha$ . Then we have $r_{\alpha\beta}(3)=0$ by Lemma 3.24 and thus $r_{\alpha\beta}(4)+r_{\alpha\beta}(5)=\gamma$ . However, we may exclude $\{r_{\alpha\beta}(4),r_{\alpha\beta}(5)\}=\{\alpha,\beta\}$ by Lemma 3.20 and we may exclude $\{r_{\alpha\beta}(4),r_{\alpha\beta}(5)\}=\{0,\gamma\}$ by Lemma 3.24. ∎

To continue our proof, we need to introduce some terminology, which we will use only here. A column index $1\leq i\leq 6$ is of type:

(a)

if all elements of $G$ appear in the corresponding $i$ th column of $T_{0}$ (and of $T_{1}$ ); 2. (b)

if exactly three elements of $G$ appear in the $i$ th column; 3. (c)

if exactly two elements of $G$ appear in the $i$ th column; 4. (d)

if exactly one element of $G$ appears in the $i$ th column.

Step 0: We suppose that all columns are of type $(a)$ .

By Claim (ii), there exists one index $j>2$ such that $\phi_{\gamma}(r_{0\beta}(j))\neq 0$ . For $i>2$ , $i\neq j$ there must exist at least two rows $r_{i,1},r_{i,2}$ such that $r_{i,1}(i)=r_{0\beta}(i)+\alpha$ and $r_{i,2}(i)=r_{0\beta}(i)+\beta$ . Note that $r_{i,1}$ , $r_{i,2}$ are not the first row. Further, for the index $j$ there must exist one row $r_{j,1}$ different from the first one such that $r_{j,1}(j)=r_{0\beta}(j)+\alpha$ or $r_{j,1}(j)=r_{0\beta}(j)+\beta$ . All these rows are distinct by Claim (i). Hence, we obtain seven rows; we call them difference rows for $T_{0}$ . Note that the difference rows for $T_{0}$ may only have $\alpha$ and $\beta$ in columns $1,2$ by Lemma 3.23. Analogously, we obtain at least seven difference rows in $T_{1}$ , with copies of [math] or $\beta$ in columns $1$ and $2$ .

If there exist difference rows in $T_{0}$ and $T_{1}$ with $\beta\beta$ in the first two columns, then we obtain the string $\beta\beta 0$ in both tables and we conclude by induction on the degree of $B$ .

Thus suppose that there is no string $\beta\beta$ in columns $1,2$ of $T_{1}$ . It follows that there must be at least seven copies of [math] in columns $1,2$ in the difference rows of $T_{1}$ . Consequently, there are at least nine copies of [math] in columns $1,2$ in $T_{1}$ . By Lemma 3.22, there is no string $00$ in columns $1,2$ in $T_{0}$ , and the difference rows for $T_{0}$ do not have copies of [math] in columns $1,2$ . In conclusion, we have at least this amount of distinct rows in $T_{0}$ :

(i)

three, that are the first ones; 2. (ii)

seven, that are the difference rows; 3. (iii)

seven, that contain copies of [math] in column $1$ or $2$ ; 4. (iv)

two, that have $\gamma$ in column $1$ or $2$ .

Then, we have $\deg(B)>18$ . This is impossible for a Markov basis element by Proposition 3.25.

Step 1: We suppose that there exists exactly one column of type $(b)$ and all others are of type $(a)$ . We may proceed as before, however we obtain only six difference rows in the case when the column of type $(b)$ has column index $3\leq i\leq 6$ . In the case when the column index of the column of type $(b)$ is either $1$ or $2$ , we obtain seven difference rows, but we cannot assume that there exists an additional row with $\gamma$ in the same column index of the column of type $(b)$ . In either of these cases, we have $\deg(B)\geq 3+2\times 6+2=17$ , that contradicts Proposition 3.25.

Step 2: We suppose that there exist exactly two columns of type $(b)$ (resp. one column of type $(c)$ ). Here, we obtain five difference rows. However, $B$ represents a Markov element for $\tilde{P}^{\prime}$ (resp. $\tilde{P}$ ), whose ideals have regularity $14$ ; see Appendix 4 for the definition of the associated ideal. We obtain the bound $\deg B\geq 3+2\times 5+2=15>14$ which contradicts Proposition 3.25.

Step 3: We suppose there exist either:

(i)

three columns of type $(b)$ ), or 2. (ii)

one column of type $(b)$ or $(c)$ and one column of type $(c)$ , or 3. (iii)

one column of type $(d)$ .

In such cases we conclude by Lemma 3.26. ∎

3.3.4 Proof of the main case

In this last part, we finish our proof dealing with the main case ( $\star$ ‣ 3.3). This will be done uniformly, i.e., with the same arguments in all the three instances of the main case and only technical details differ. Here the number of leaves is $n\geq 7$ . The outline is as follows:

(i)

We show that, if $r_{0\beta}(i)=r_{\alpha\beta}(i)$ , then we have $r_{0\beta}(i)=r_{\alpha\beta}(i)=0$ ; 2. (ii)

Among the pairs of tables we consider (tables where we have fixed the first two entries of the rows $r_{0\beta}$ and $r_{\alpha\beta}$ and performed moves of degree at most four so that $r_{0\beta}$ and $r_{\alpha\beta}$ have the agreement string as large as possible) using at most moves of degree four, we attain the situation where the number of bad pairs, i.e., strings $xy$ , with $x,y\neq 0$ , in columns $n-1$ and $n$ is as small as possible; 3. (iii)

We show that we can kill all the bad pairs, i.e., we can make moves of degree at most four killing all of them. Summing up the two columns indexed by $n-1$ and $n$ allows us to conclude by induction on the number of leaves $n$ ; see Theorem 3.4.

We are now ready to establish the main case in the following lemmas.

Lemma 3.29.

We may assume that no rows in $T_{0}$ has the string $\alpha\alpha$ or $\beta\beta$ in columns $n-1$ and $n$ . Analogously, no row in $T_{1}$ has the string $\alpha\alpha$ or $\gamma\gamma$ in columns $n-1$ and $n$ .

Proof.

In such a case we make a quadratic move in columns $n-1$ and $n$ and we conclude by applying the Difference Lemma 3.17. ∎

Lemma 3.30.

We may assume that no row in $T_{0}$ has the string $\alpha\beta$ or $\beta\alpha$ in columns $n-1$ and $n$ . Analogously, no row in $T_{1}$ has the string $\alpha\gamma$ or $\gamma\alpha$ in columns $n-1$ and $n$ .

Proof.

Let $r$ be such a row with such a string in columns $n-1$ and $n$ of $T_{0}$ . If for some other column index $i>2$ , we have $r(i)\neq r_{0\beta}(i)$ then we may exchange $i$ and a nonempty subset of elements under the agreement string. Then we conclude by applying the Difference Lemma 3.17. As $r$ is a flow, we have $r(1)+r(2)=\alpha$ . This contradicts Lemma 3.23. ∎

Lemma 3.31.

We may assume that under the agreement string no row in $T_{0}$ has $\gamma\gamma$ . Analogously, no row in $T_{1}$ has $\beta\beta$ .

Proof.

Let $r$ be a row in $T_{0}$ with $\gamma\gamma$ under the agreement string. We first claim we may assume that $r_{\alpha\beta}$ does not have $\gamma$ in any column. For the sake of contradiction, suppose $r_{\alpha\beta}(i)=\gamma$ for some column index $i$ . Whence, by Lemma 3.24, we have $r_{0\beta}(i)=0$ . By compatibility of the tables $T_{0}$ and $T_{1}$ , there exists a row $r^{\prime}$ in $T_{0}$ with $r^{\prime}(i)=\gamma$ . By the Standard Lemma 3.19, we can make a move to obtain $r_{0\beta}(i)=\gamma$ and conclude by applying Lemma 3.20.

We divide the rest of the proof into two steps according to whether or not there exists $\beta$ in $r_{0\beta}$ .

Step 1: Suppose there exists another $\beta$ in $r_{0\beta}$ . The tables $T_{0}$ and $T_{1}$ are the following:

[TABLE]

By Lemma 3.23, the counting function $\alpha_{3}+\beta_{3}-0_{12}-\gamma_{12}$ is nonnegative on $T_{0}$ . Let $r$ be a row in $T_{1}$ , where the function is strictly positive. We now exclude the case $r(3)=\beta$ . Indeed, in this case, if $r(2)=\beta$ , then we obtain $0\beta\beta$ in both tables. If $r(2)=\alpha$ , by the positivity of the counting function on $r$ , we have $r(1)=\beta$ and we may perform a quadratic move to obtain the string $0\beta\beta$ . Whence $r(3)=\alpha$ .

Let $r^{\prime}$ be a row in $T_{0}$ with $r^{\prime}(3)=\alpha$ . By the Standard Lemma 3.19, we can make a move between $r^{\prime}$ and $r_{0\beta}$ involving this entry. In particular, if $r(1)+r(2)=\gamma$ , then we make a quadratic move between $r$ and $r_{\alpha\beta}$ on first two entries and conclude by Lemma 3.20. Thus $r(1)=r(2)=\beta$ . Let $r^{\prime\prime}$ be a row in $T_{1}$ with $r^{\prime\prime}(3)=\beta$ . We finish the proof of Step 1 by proving that we can always obtain $0\beta\beta$ in $T_{1}$ . First, suppose $r^{\prime\prime}(j)=\alpha$ for $j=1$ or $2$ . Then we may exchange $r^{\prime\prime}$ with $r$ on column indices $j$ and $3$ , obtaining a row $\tilde{r}$ such that $\tilde{r}(1)+\tilde{r}(2)=\gamma$ and $\tilde{r}(3)=\beta$ . Then we can make a quadratic move between $\tilde{r}$ and $r_{\alpha\beta}$ to obtain $0\beta\beta$ in both tables. Also, notice that $r^{\prime\prime}(2)\neq\beta$ as this immediately leads to $0\beta\beta$ in both tables. If $r^{\prime\prime}(1)=\gamma$ we may exchange $r^{\prime\prime}$ and $r_{\alpha\beta}$ on column indices $1$ and $3$ , obtaining $0\beta\beta$ in both tables. If $r^{\prime\prime}(1)=r^{\prime\prime}(2)=0$ , we can make a quadratic move between $r^{\prime\prime}$ and $r$ . Similarly, if $r^{\prime\prime}(1)=0$ and $r^{\prime\prime}(2)=\gamma$ we can make an exchange with $r_{\alpha\beta}$ . If $r^{\prime\prime}(1)=\beta$ and $r^{\prime\prime}(2)=0$ we first exchange it with $r_{\alpha\beta}$ on column indices $2,3$ , then we apply $\alpha 0\beta+\beta\beta\alpha=\beta 0\alpha+\alpha\beta\beta$ . Finally, if $r^{\prime\prime}(1)=\beta$ and $r^{\prime\prime}(2)=\gamma$ , we apply the cubic move

[TABLE]

Step 2: Suppose there is no $\beta$ in $r_{0\beta}$ ; without loss of generality we may assume we have $\alpha$ and $\gamma$ in columns $3,4$ . In column $3$ , in row $r_{\alpha\beta}$ of $T_{1}$ we cannot have $\alpha$ by Lemma 3.20; moreover, we cannot have $\beta$ by the Standard Lemma 3.19 applied to table $T_{0}$ , as we would produce $\beta$ in the row $r_{0\beta}$ , contradicting Lemma 3.20. Thus we have [math] in column $3$ in the row $r_{\alpha\beta}$ , since $\gamma$ is excluded in row $r_{\alpha\beta}$ by the claim in the very first part of the proof. Since the disagreement string has length at most three by Corollary 3.6, we have the following tables $T_{0}$ and $T_{1}$ :

[TABLE]

Furthermore $\{y,z\}=\{\alpha,\beta\}$ . By the disagreement string length, we have $x=0$ . Consider the group morphism $\phi_{\gamma}:G\rightarrow\mathbb{Z}_{2}$ and apply it to columns $3,4,5$ . Note that the evaluation of $r_{0\beta}$ under $\phi_{\gamma}$ in column indices $3,4,5$ is the $0/1$ vector $(1,0,0)$ . We claim that no row of $T_{0}$ can differ by more than one element with respect to $r_{0\beta}$ in column indices $3,4,5$ . Indeed, suppose a row $r$ in $T_{1}$ differs on $i,j\in\{3,4,5\}$ . Then $r(i)+r(j)-r_{0\beta}(i)-r_{0\beta}(j)\in\{0,\gamma\}$ . Thus, by the Standard Lemma 3.19, we can make a quadratic move on $i,j$ and conclude by Difference Lemma 3.17. By double counting, there must exist a row $r^{\prime}$ in $T_{1}$ such that $r^{\prime}(3)\in\{\alpha,\beta\}$ and $r^{\prime}(4),r^{\prime}(5)\in\{0,\gamma\}$ . By a quadratic move and the claim at the very first part of the proof, we may assume $r^{\prime}(4)=r^{\prime}(5)$ . Now we can make a quadratic move between $r^{\prime}$ and $r_{\alpha\beta}$ involving the entry in column $3$ and the entry in either column $4$ or $5$ . However, we may conclude as in the first part of Step 2. ∎

Lemma 3.32.

We may assume that under the agreement string no row in $T_{0}$ has $\alpha\gamma$ or $\gamma\alpha$ (resp. $\beta\gamma$ or $\gamma\beta$ ).

Proof.

Step 0: Assume that there exists $\beta$ in $r_{0\beta}$ and $\gamma$ in $r_{\alpha\beta}$ ; without loss of generality we may assume that they are in columns $3,4$ . In this case the tables are:

[TABLE]

Let $r$ be the row in $T_{0}$ that contains the string $\alpha\gamma$ (resp. $\beta\gamma$ ) under the agreement string. By Lemma 3.20 and Lemma 3.23, we see that $r(4)=0$ . Let $r^{\prime}$ be a row in $T_{0}$ such that $r^{\prime}(4)=\gamma$ . By Lemma 3.20, we can exclude $\gamma$ under the agreement string. Furthermore, performing a quadratic move, we notice that if $r^{\prime}$ has $00$ under the agreements string, we could reduce $\alpha\gamma$ (resp. $\beta\gamma$ ) to $\alpha 0$ (resp. $\beta 0$ ), contradicting the minimality of the number of bad pairs. Also, $r^{\prime}$ cannot have $0\beta$ or $\beta 0$ (resp. $0\gamma$ or $\gamma 0$ ) under the agreement string, as we could exchange it with $\alpha\gamma$ (resp. $\beta\gamma$ ) and conclude as before. Thus, under the agreement string, $r^{\prime}$ has either the string $0\alpha$ or $\alpha 0$ (resp. $0\beta$ or $\beta 0$ ). Now, Lemma 3.20 and Lemma 3.23 allow us to conclude that $r^{\prime}(3)=\beta$ . Hence, the counting function $\beta_{3}-\gamma_{4}$ is strictly positive in $T_{0}$ . Let $r^{\prime\prime}$ be a row in $T_{1}$ such that $r^{\prime\prime}(3)=\beta$ and $r^{\prime\prime}(4)\neq\gamma$ . By Lemma 3.20, we may exclude $\alpha$ in column $4$ in $r^{\prime\prime}$ . Consequently, by Lemma 3.23, $r^{\prime\prime}$ has either [math] or $\beta$ in column $2$ . If $r^{\prime\prime}(2)=\beta$ , we obtain the same string $0\beta\beta 0$ in both tables. If $r^{\prime\prime}(2)=0$ , we obtain the string $\alpha 0\beta\gamma 0$ in $T_{1}$ ; we now show we may also obtain it in $T_{0}$ . We discuss this according to the three crucial cases:

(i)

Case I and II: We apply the move $\alpha\alpha 00+0\beta\beta 000+\alpha 0+??\beta\gamma xy=\alpha 0\beta\gamma+\alpha\beta\beta 0xy+0\alpha+??0000$ , where $xy$ is under the agreement string and $x+y=\alpha$ . (resp. We consider the first two entries of $r^{\prime}$ , which by Lemma 3.23 could be: $\alpha\alpha$ , $\alpha\beta$ , $\beta\alpha$ , $\beta\beta$ . The last three allow to obtain $\alpha\beta 0\gamma$ in both tables. As $r^{\prime}$ must agree on all nonspecified entries with $r_{0\beta}$ this contradicts the fact that $r^{\prime}$ is a flow.); 2. (ii)

Case III: We apply the move $\alpha\alpha 00+0\beta\beta 000+\beta 0+??\beta\gamma xy=\alpha 0\beta\gamma+\beta\alpha\beta 0xy+0\beta+??0000$ where $x+y=\alpha$ . (resp. We proceed as before, noting that we do not use the third row, except for $\beta\alpha$ , in which case we obtain $\beta\alpha 0\gamma$ in both tables).

Step 1: Assume there exists $\beta$ in $r_{0\beta}$ and no $\gamma$ in $r_{\alpha\beta}$ . The tables are:

[TABLE]

As the disagreement string is of length at most three, we must have $x=y$ . Further, by Lemma 3.20 $x=y=0$ or $x=y=\gamma$ . Consider the group morphism $\phi_{\gamma}:G\rightarrow\mathbb{Z}_{2}$ . We claim that after applying $\phi_{\gamma}$ to column indices $3,4,5$ , no row can differ on more than one index from $\phi_{\gamma}((\beta,x,x))=(1,0,0)$ . Indeed, if a row $\tilde{r}$ differs on two indices $i,j$ , then, by the Difference Lemma 3.17, we may assume $r_{0\beta}(i)=\tilde{r}(i)+\alpha$ and $r_{0\beta}(j)=\tilde{r}(j)+\beta$ . The rows $r$ and $\tilde{r}$ must differ by $\alpha$ either in column index $i$ or $j$ , and by $\beta$ on the other. In particular, by reducing the number of bad pairs $\alpha\gamma$ (resp. $\beta\gamma$ ) under the agreement string, we exclude the situation when $\tilde{r}$ has $00$ under the agreement string. By the Difference Lemma 3.17, we also know that $\gamma$ does not appear in $\tilde{r}$ under the agreement string. In the same way, if $\alpha$ or $\beta$ appears under the agreement string, we may exchange it along with the index $i$ or $j$ , again contradicting Difference Lemma 3.17. By double counting, there exists a row $\tilde{r}^{\prime}$ in $T_{1}$ such that $\phi_{\gamma}((\tilde{r}^{\prime}(3,4,5)))=(1,0,0)$ . In particular, there exist two indices such that we can make a quadratic move between $\tilde{r}^{\prime}$ and $r_{\alpha\beta}$ . This either contradicts Lemma 3.24 or one decreases the Hamming distance.

Step 2: Assume there is no $\beta$ in $r_{0\beta}$ and there exists $\gamma$ in $r_{\alpha\beta}$ . The tables are:

[TABLE]

As before $x=y$ equals $\beta$ or [math]. Further $r(5)=0$ . We apply $\phi_{\beta}$ to column indices $3,4,5$ .

We claim we may assume that no row $\tilde{r}$ in $T_{0}$ differs from $\phi_{\beta}((\alpha,\gamma,0))=(1,1,0)$ on more than one index. For the sake of the contradiction, suppose there exists $\tilde{r}$ in $T_{0}$ differing on $i$ and $j$ . If $r_{0\beta}(i)-\tilde{r}(i)=r_{0\beta}(j)-\tilde{r}(j)$ , then we make a quadratic move between $r_{0\beta}$ and $\tilde{r}$ on $i,j$ . If the difference equals $\alpha$ , we conclude by the Difference Lemma 3.17. Thus we assume the difference equals $\gamma$ . If $5\in\{i,j\}$ we conclude by Lemma 3.20. Hence, $\{i,j\}=\{3,4\}$ ; on the other hand, this reduces the Hamming distance. Consequently we have $r_{0\beta}(i)-\tilde{r}(i)=\alpha$ and $r_{0\beta}(j)-\tilde{r}(j)=\gamma$ . Notice that we cannot have $r(i)-\tilde{r}(i)=r(j)-\tilde{r}(j)=\alpha$ , thus at least one difference must be equal to $\gamma$ . Hence, we exclude $00$ in $\tilde{r}$ under the agreement string, as then we could reduce the number of $\alpha\gamma$ (resp. $\beta\gamma$ ) under the agreement string. Further, $\alpha$ and $\beta$ also cannot appear under the agreement string, as otherwise we may conclude by the Difference Lemma 3.17. Whence $\tilde{r}$ has $0\gamma$ or $\gamma 0$ under the agreement string. By Lemma 3.20, we have $j\neq 5$ . Let $\tilde{r}^{\prime}$ be a row of $T_{0}$ with $\tilde{r}^{\prime}(5)=\gamma$ . As before, we conclude that $\tilde{r}^{\prime}$ has $\alpha 0$ or $0\alpha$ under the agreement string (resp. $0\beta$ or $\beta 0$ ), and $\tilde{r}^{\prime}(3)=\alpha$ , $\tilde{r}^{\prime}(4)=\gamma$ . We now exclude the case $i=5$ , i.e., $\tilde{r}(5)=\alpha$ . In such a case, we could exchange $r$ and $\tilde{r}$ on column $5$ and under the agreement string; then with $\tilde{r}^{\prime}$ on column indices $5$ and $j$ ; finally with $r_{0\beta}$ on column indices $5$ and the last entry to conclude by Lemma 3.20. (Resp. We apply the relation on $5$ and the agreement string $000+\alpha 0\gamma+\gamma 0\beta=\gamma 0\gamma+00\beta+\alpha 00$ .)

In conclusion, our discussion leads to $\{i,j\}=\{3,4\}$ and $\tilde{r}(5)=0$ . However, we may exchange $\tilde{r}$ with $\tilde{r}^{\prime}$ on $5$ and $j$ . Consequently we exchange with $r_{0\beta}$ on $5$ and under the agreement string to conclude by Lemma 3.20. This concludes the verification of our claim.

By the claim, there must exist a row $r^{\prime\prime}$ in $T_{1}$ , such that $\phi_{\beta}(r^{\prime\prime}((3,4,5)))=(1,1,0)$ . On two of these indices, $r^{\prime\prime}$ differs from $r_{\alpha\beta}$ by the same element: either $\alpha$ or $\gamma$ . We can make a quadratic move on these two column indices and conclude by Difference Lemma 3.17.

Step 3: Assume there is no $\beta$ in $r_{0\beta}$ and no $\gamma$ in $r_{\alpha\beta}$ . The tables are:

[TABLE]

Suppose $x=\beta$ . Let $\tilde{r}$ be a row of $T_{0}$ with $\tilde{r}(3)=\beta$ . As in the previous steps, we may assume that $\tilde{r}$ has $0\alpha$ or $\alpha 0$ (resp. $0\beta$ or $\beta 0$ ) under the agreement string and $\tilde{r}(i)=r_{0\beta}(i)$ for $4\leq i\leq n-3$ . By Lemma 3.23, we have $\tilde{r}(1,2)=\alpha\alpha$ or $\beta\beta$ (resp. $\tilde{r}(1,2)=\alpha\beta$ or $\beta\alpha$ ; we may obtain $0\beta\beta$ in both tables by the move: $\alpha\alpha 0+0\beta\alpha\gamma 00+(\alpha\beta/\beta\alpha)\beta\gamma 0\beta=0\beta\beta+(\alpha\beta/\beta\alpha)0\gamma 00+\alpha\alpha\alpha\gamma 0\beta$ ). However, $\beta\beta$ easily leads to $0\beta\beta$ in both tables by the cubic move $\alpha\alpha 0+0\beta\alpha+\beta\beta\beta=0\beta\beta+\beta\alpha 0+\alpha\beta\alpha$ in $T_{0}$ . Furthermore, we may assume that $\beta\beta$ does not appear on column indices $1,2$ in any row in $T_{0}$ , otherwise we would exchange with $\tilde{r}$ obtaining $\beta\beta\beta$ in columns $1,2,3$ . It follows that $\alpha_{12}-0_{3}-\beta_{3}$ is positive on $T_{0}$ . However, a positive row in $T_{1}$ contradicts Lemma 3.23.

Thus we may assume $x=0$ . Without loss of generality $\{y,z\}=\{\alpha,\beta\}$ . We apply the homomorphism $\phi_{\gamma}$ to column indices $3,4,5$ . We prove that no row may differ on two indices from $\phi_{\gamma}(r_{0\beta}(3,4,5))=(1,0,0)$ in $T_{0}$ . This is analogous to Step 1. Whence there exists a row $\tilde{r}$ in $T_{1}$ , such that $\phi_{\gamma}(\tilde{r}((3,4,5)))=(1,0,0)$ . We may assume $\tilde{r}(4)=\tilde{r}(5)$ , as otherwise we can make a quadratic move on column indices $4,5$ and conclude by previous steps. However, in such a case we may exchange $\tilde{r}$ with $r_{0\beta}$ (on column index $3$ and on column index either $4$ or $5$ ), conclude by Lemma 3.20 or reduce to the first part of this step, where we assume $x=\beta$ . ∎

Lemma 3.33.

We may assume that under the agreement string no row in $T_{1}$ has $\alpha\beta$ or $\beta\alpha$ (resp. $\beta\gamma$ or $\gamma\beta$ ).

Proof.

Let us act on tables $T_{0}$ , $T_{1}$ by the flow $(\alpha,\alpha,0,\dots,0)\in\mathfrak{G}$ and then apply the group automorphism $\beta\leftrightarrow\gamma$ . This translates Case I and Case III to Case III and Case I of Lemma 3.32 respectively; cf. Remark 3.21. However, Case II is not transformed to the previous cases, due to the rows $r_{\alpha 0}$ in $T_{0}$ and $r_{\beta\alpha}$ in $T_{1}$ . We note that in Steps 1, 2, and 3 of Lemma 3.32 we are only using the rows $r_{0\beta}$ in $T_{0}$ and $r_{\alpha\beta}$ in $T_{1}$ that still appear after translating Case II.

Thus, we only need to conclude in Case II and Step 0, i.e., there exists $\beta$ in $r_{0\beta}$ and $\gamma$ in $r_{\alpha\beta}$ . Without loss of generality, we may assume that they are in columns $3,4$ . The tables are:

[TABLE]

Let $r$ be the row in $T_{1}$ with a bad pair of the form $\alpha\gamma$ (resp. $\beta\gamma$ ). First we exclude $r(3)=\alpha,\beta,\gamma$ by quadratic exchange with $r_{\alpha\beta}$ , and the Difference Lemma 3.17 and Lemma 3.20. Let $\tilde{r}$ be the row in $T_{1}$ such that $\tilde{r}(3)=\beta$ . We note that if $\tilde{r}(n-1)=\tilde{r}(n)=0$ then, exchanging with $r$ we could reduce the number of bad pairs. Moreover, by Lemma 3.20 we know that $\tilde{r}(n-1),\tilde{r}(n)\neq\beta$ . Furthermore, as we already know that $r(3)$ must be equal to zero, we have $\tilde{r}(n-1)+\tilde{r}(n)\neq r(n-1)+r(n)$ . Thus, we must have $\{\tilde{r}(n-1),\tilde{r}(n)\}=\{0,\alpha\}$ (resp. $\{\tilde{r}(n-1),\tilde{r}(n)\}=\{0,\gamma\}$ ). Note that $\tilde{r}(2)=\beta$ gives $0\beta\beta$ in both tables, thus we may assume $\tilde{r}(2)=0$ , by Lemma 3.23. Moreover, we have $\tilde{r}(4)=\gamma$ and hence the counting function $\beta_{3}-\gamma_{4}$ is negative on $T_{1}$ . Let $r^{\prime}$ be the row in $T_{0}$ on which the function is negative, i.e., $r^{\prime}(4)=\gamma$ and $r^{\prime}(3)\neq\beta$ . Now, $r^{\prime}(3)\neq\alpha$ , as otherwise we exchange $r^{\prime}$ and $r_{0\beta}$ and conclude by Lemma 3.20. Thus, by Lemma 3.23, we have $r^{\prime}(1),r^{\prime}(2)\in\{\alpha,\beta\}$ . If $r^{\prime}(2)=\beta$ then we obtain $\alpha\beta 0\gamma$ in both tables, thus we may assume $r^{\prime}(2)=\alpha$ . We may obtain the flow $0\alpha\beta\gamma$ in $T_{0}$ , by exchanging $r^{\prime}$ and $r_{0\beta}$ , and $\alpha 0\beta\gamma$ in $T_{0}$ , by exchanging with $r_{\alpha 0}$ . We finish the proof by showing that we may obtain the latter in $T_{1}$ , by the quadratic move $\alpha\beta 0\gamma+?0\beta=\alpha 0\beta\gamma+?\beta 0$ . ∎

4 Appendix

We present known algebraic results for algebras over monoids that are cones over normal lattice polytopes. Much more information can be found in [5, 22, 36, 43, 46].

Let $M$ be a lattice and $P\subset\{1\}\times M\subset\mathbb{Z}\times M$ be a normal lattice polytope generating the ambient lattice. Let $C(P)\subset\mathbb{Z}\times M$ be the cone over $P$ . The cone $C(P)$ , equipped with addition, has a natural structure of a graded monoid, with the grading induced by the first coordinate. The algebraic properties of the graded algebra $\mathbb{C}[C(P)]$ are strongly related to combinatorial properties of $P$ .

Proposition 4.1.

The function $H_{P}:\mathbb{Z}_{\geq 0}\rightarrow\mathbb{Z}$ defined by $H_{P}(n)=|nP\cap\{n\}\times M|$ is a polynomial known as Ehrhart polynomial. For all $n\geq 0$ , it coincides with the Hilbert function (and hence with the Hilbert polynomial) of the algebra $\mathbb{C}[C(P)]$ . Moreover, it satisfies the Ehrhart reciprocity, i.e. $|H_{P}(-n)|=|\textnormal{int}(nP)\cap\{n\}\times M|$ for $n>0$ , where int denotes the interior points of the polytope.

We immediately see that the polynomial $H_{P}(n)$ may agree with the Hilbert function even for negative $n$ . This happens if and only if $H_{P}(n)=0$ , as the algebra is positively graded.

Definition 4.2 ( $a$ -invariant, Hilbert regularity).

The $a$ -invariant $a(A)$ of an algebra $A$ is the largest integer $a$ such that the Hilbert function differs from the Hilbert polynomial. Hilbert regularity equals the $a$ -invariant plus one.

Corollary 4.3.

The $a$ -invariant of $\mathbb{C}[C(P)]$ is always negative. It equals $-n$ for the smallest $n\in\mathbb{Z}_{>0}$ such that $nP$ contains an interior point.

Proposition 4.4.

If $\dim P=d$ , then $\sum_{j=0}^{\infty}H_{P}(j)t^{j}=h(t)/(1-t)^{d+1}$ for some polynomial $h$ . The $a$ -invariant of $\mathbb{C}[C(P)]$ equals $\deg h-d-1$ .

We note that $d+1-\deg h$ is the smallest dilation of $P$ that contains an interior lattice point.

Proposition 4.5 (Hochster’s Theorem).

The algebra $\mathbb{C}[C(P)]$ is Cohen-Macaulay.

Throughout the article we were interested in generators of the ideal $I$ such that $\mathbb{C}[C(P)]=\mathbb{C}[x_{p}:p\in P\cap M]/I=S/I$ . These are usually very hard to understand even for specific instances. However, there is an algebraic invariant that bounds their degree, known as Castelnuovo-Mumford regularity, or simply, the regularity.

Definition 4.6 (Castelnuovo-Mumford regularity).

For an $S$ -module $M$ its regularity $\textnormal{reg}(M)$ is defined as

[TABLE]

where

[TABLE]

is the minimal free resolution of $M$ .

As $I$ is an $S$ module, its regularity in particular bounds the degree of generators; this is the case $i=0$ in the definition. It can be seen that $\textnormal{reg}(\mathbb{C}[C(P)])$ is the maximal degree of standard monomials under rev-lex in generic coordinates. Hence $\textnormal{reg}(I)$ bounds the degree of such a Gröbner basis, as $\textnormal{reg}(S/I)+1=\textnormal{reg}(I)$ . The following proposition relates both notions of regularity introduced above.

Proposition 4.7.

$a(M)\leq\textnormal{reg}(M)-\textnormal{depth}(M)$ * and equality holds if $M$ is Cohen-Macaulay. In particular, $\textnormal{reg}(\mathbb{C}[C(P)])=\deg h$ and $I$ is generated in degree at most $1+\deg h$ .*

**Acknowledgements.

**Mateusz Michałek was supported by Polish National Science Centre grant no. 2015/19/D/ST1/01180, the Foundation for Polish Science (FNP) and is a member of AGATES group. The authors acknowledge the kind hospitality of UC Berkeley and FU Berlin, where this research was in part conducted.

Bibliography47

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Elizabeth S. Allman, Open Problem: Determine the Ideal Defining σ 4 ( ℙ 3 × ℙ 3 × ℙ 3 ) subscript 𝜎 4 superscript ℙ 3 superscript ℙ 3 superscript ℙ 3 \sigma_{4}(\mathbb{P}^{3}\times\mathbb{P}^{3}\times\mathbb{P}^{3}) , Available on-line (http://www.dms.uaf.edu/ ∼ similar-to \sim eallman/Papers/ salmon Prize.pdf), 2010.
2[2] Quentin Atkinson and Russell D. Gray, Curious Parallels and Curious Connections–Phylogenetic Thinking in Biology and Historical Linguistics , Systematic biology 54 (4) (2005): 513–526.
3[3] Adrian C. Barbrook et al., The Phylogeny of the Canterbury Tales , Nature 394 (6696)(1998), 839.
4[4] Louis J. Billera, Susan P. Holmes, and Karen Vogtmann, Geometry of the space of phylogenetic trees . Adv. in Appl. Math., 27 (4):733–767, 2001.
5[5] Winfried Bruns and Joseph Gubeladze, Polytopes, Rings, and K-Theory , Springer Monographs in Mathematics, Springer, 2009.
6[6] Weronika Buczyńska, Maria Donten-Bury, and Jarosław A. Wiśniewski, Isotropic models of evolution with symmetries , Contemporary Mathematics 496 (2009), 111–132.
7[7] Winfried Bruns, Richard Sieg, Tim Römer, and Christof Söger, Normaliz , http://www.home.uni-osnabrueck.de/wbruns/normaliz/ (2001).
8[8] Weronika Buczyńska and Jarosław A. Wiśniewski, On geometry of binary symmetric models of phylogenetic trees , J. Eur. Math. Soc. 9(3) (2007), 609–635.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Phylogenetic complexity of the Kimura 333-parameter model

Abstract

1 Introduction

Definition 1.1** (Phylogenetic complexity [41]).**

Main Theorem**.**

2 Preliminaries and notation

Definition 2.1** **(Group of flows).

Remark 2.2**.**

Example 2.3**.**

Remark 2.4**.**

Remark 2.5**.**

Definition 2.6** **(Hamming distance).

Remark 2.7** **(Tables and Hamming distance).

Example 2.8**.**

3 Complexity of the Kimura 333-parameter model

3.1 Main result and structure of the proof

Theorem 3.1**.**

Proposition 3.2**.**

Definition 3.3** **(Bad pairs).

Theorem 3.4**.**

Proof.

3.2 Reduction of Hamming distance ≥\geq≥ 3

Proposition 3.5**.**

Proof.

Corollary 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

Lemma 3.9**.**

Proof.

Lemma 3.10**.**

Proof.

Lemma 3.11**.**

Proof.

Proposition 3.12**.**

Proof.

Corollary 3.13**.**

3.3 The disagreement string αα\alpha\alphaαα

Remark 3.14**.**

Definition 3.15**.**

3.3.1 Reduction to the main case

Lemma 3.16**.**

Proof.

3.3.2 Preliminary Lemmas

Lemma 3.17** **(Difference Lemma).

Proof.

Remark 3.18**.**

Lemma 3.19** **(Standard Lemma).

Proof.

Lemma 3.20**.**

Proof.

Remark 3.21**.**

Lemma 3.22**.**

Proof.

Lemma 3.23**.**

Proof.

Lemma 3.24**.**

Proof.

3.3.3 The case of n=6n=6n=6 leaves

Proposition 3.25**.**

Proof.

Lemma 3.26**.**

Proof.

Remark 3.27**.**

Proposition 3.28**.**

Proof.

Proof of Claim (i).

Proof of Claim (ii).

3.3.4 Proof of the main case

Lemma 3.29**.**

Proof.

Lemma 3.30**.**

Phylogenetic complexity of the Kimura $3$ -parameter model

Definition 1.1 (Phylogenetic complexity [41]).

Main Theorem.

Definition 2.1 (Group of flows).

Remark 2.2.

Example 2.3.

Remark 2.4.

Remark 2.5.

Definition 2.6 (Hamming distance).

Remark 2.7 (Tables and Hamming distance).

Example 2.8.

3 Complexity of the Kimura $3$ -parameter model

Theorem 3.1.

Proposition 3.2.

Definition 3.3 (Bad pairs).

Theorem 3.4.

3.2 Reduction of Hamming distance $\geq$ 3

Proposition 3.5.

Corollary 3.6.

Lemma 3.7.

Lemma 3.8.

Lemma 3.9.

Lemma 3.10.

Lemma 3.11.

Proposition 3.12.

Corollary 3.13.

3.3 The disagreement string $\alpha\alpha$

Remark 3.14.

Definition 3.15.

Lemma 3.16.

Lemma 3.17 (Difference Lemma).

Remark 3.18.

Lemma 3.19 (Standard Lemma).

Lemma 3.20.

Remark 3.21.

Lemma 3.22.

Lemma 3.23.

Lemma 3.24.

3.3.3 The case of $n=6$ leaves

Proposition 3.25.

Lemma 3.26.

Remark 3.27.

Proposition 3.28.

Lemma 3.29.

Lemma 3.30.

Lemma 3.31.

Lemma 3.32.

Lemma 3.33.

Proposition 4.1.

Definition 4.2 ( $a$ -invariant, Hilbert regularity).

Corollary 4.3.

Proposition 4.4.

Proposition 4.5 (Hochster’s Theorem).

Definition 4.6 (Castelnuovo-Mumford regularity).

Proposition 4.7.