Algorithmic Search in Group Theory

Robert H. Gilman

arXiv:1812.08116·math.GR·December 20, 2018

Algorithmic Search in Group Theory

Robert H. Gilman

PDF

TL;DR

This paper introduces a Kolmogorov complexity-based random search method for group theory problems, demonstrating its theoretical effectiveness and practical heuristic approximations with experimental support.

Contribution

It proposes a novel search approach using Kolmogorov complexity for group theory, combining theoretical guarantees with heuristic methods and experimental validation.

Findings

01

The method is provably effective in theory.

02

Heuristic approximations perform well in practice.

03

Experimental evidence supports the approach's viability.

Abstract

A method of random search based on Kolmogorov complexity is proposed and applied to two search problems in group theory. The method is provably effective but not practical, so the applications involve heuristic approximations. Perhaps surprisingly, these approximations seem to work. Some experimental evidence is presented.

Equations18

l i m_{n \to \infty} \frac{∣ X \cap T _{n} ∣}{∣ T _{n} ∣} = 0

l i m_{n \to \infty} \frac{∣ X \cap T _{n} ∣}{∣ T _{n} ∣} = 0

n \to \infty lim inf \frac{∣ X \cap C _{n} ∣}{∣ C _{n} ∣} > 0.

n \to \infty lim inf \frac{∣ X \cap C _{n} ∣}{∣ C _{n} ∣} > 0.

\frac{∣ X \cap C _{n + c + c_{f}} ∣}{∣ C _{n + c + c_{f}} ∣} \geq \frac{∣Σ ∣ ^{n}}{∣Σ ∣ ^{n + c + c_{f} + 1}} = \frac{1}{∣Σ ∣ ^{c + c_{f} + 1}}

\frac{∣ X \cap C _{n + c + c_{f}} ∣}{∣ C _{n + c + c_{f}} ∣} \geq \frac{∣Σ ∣ ^{n}}{∣Σ ∣ ^{n + c + c_{f} + 1}} = \frac{1}{∣Σ ∣ ^{c + c_{f} + 1}}

p v = f ab, b B, a B, AA, g bb, B A, B B, A B, ab AA

p v = f ab, b B, a B, AA, g bb, B A, B B, A B, ab AA

\begin{array}[]{r|ccccccccc}\Sigma^{*}&\varepsilon,&0&1&00&01&10&11&000&\cdots\\ \hline\cr N&1&2&3&4&5&6&7&8&\cdots\end{array}

\begin{array}[]{r|ccccccccc}\Sigma^{*}&\varepsilon,&0&1&00&01&10&11&000&\cdots\\ \hline\cr N&1&2&3&4&5&6&7&8&\cdots\end{array}

\begin{array}[]{r|ccccccccc}N&1&2&3&4&5&6&7&8&\cdots\\ \hline\cr N\times N&(1,1)&(1,2)&(2,1)&(1,3)&(2,2)&(3,1)&(1,4)&(2,3)&\cdots\end{array}

\begin{array}[]{r|ccccccccc}N&1&2&3&4&5&6&7&8&\cdots\\ \hline\cr N\times N&(1,1)&(1,2)&(2,1)&(1,3)&(2,2)&(3,1)&(1,4)&(2,3)&\cdots\end{array}

p 8, 2, 3, 1; q 6, 7, 4, 2; 15

p 8, 2, 3, 1; q 6, 7, 4, 2; 15

(8 x^{3} + 2 x^{2} + 3 x + 1) \circ (6 x^{3} + 7 x^{2} + 4 x + 2) (15) = 83879080636024

(8 x^{3} + 2 x^{2} + 3 x + 1) \circ (6 x^{3} + 7 x^{2} + 4 x + 2) (15) = 83879080636024

(1, 7, 8, 11, 10, 6, 2, 3, 4, 9, 5) (1, 2, 6, 5, 7, 3, 4, 10, 11, 9, 8) .

(1, 7, 8, 11, 10, 6, 2, 3, 4, 9, 5) (1, 2, 6, 5, 7, 3, 4, 10, 11, 9, 8) .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Algorithmic Search in Group Theory

Dedicated to the memory of Charles Sims

Robert Gilman

Robert Gilman The author thanks the Hausdorff Institute of Mathematics, the University of Newcastle and the University of Warwick for their hospitality while this paper was being written.

Abstract

A method of random search based on Kolmogorov complexity is proposed and applied to two search problems in group theory. The method is provably effective but not practical, so the applications involve heuristic approximations. Perhaps surprisingly, these approximations seem to work. Some experimental evidence is presented.

1 Introduction

One of Charlie Sims’ substantial contributions to mathematics is his invention of a base and strong generating set for finite permutation groups. This invention played a crucial role in proving the existence of several sporadic finite simple groups and is the foundation of most permutation group algorithms in use today, including those used here.

The origin of this paper is a conversation some years ago between Colva Roney-Dougal and the author about the difficulty of generating random permutation groups with which to test the efficacy of various permutation group algorithms. A common way of sampling random subgroups is to choose generators at random from the ambient group. This approach fails for permutation groups because, as is well known, a random pair of permutations from the symmetric group of degree $n$ generates the symmetric or alternating group of degree $n$ with probability about $1-1/n$ . We propose a different method of search which seems to do better.

We recast the above search problem in the following general form. Given an infinite decidable subset $Y$ of a computably enumerable set $T$ , search $T$ for multiple instances of $Y$ . A common strategy is to decompose $T$ in some convenient way as a union of finite subsets $T=\cup T_{n}$ , choose $n$ large, and test random elements of $T_{n}$ for membership in $Y$ . However the search will be hard if instances of $X$ are rare, in particular if $X$ has asymptotic density 0 with respect to the decomposition of $T$ :

[TABLE]

where $|\cdot|$ denotes cardinality.

This obstacle can be avoided by choosing the decomposition $\{T_{n}\}$ in a special way. Let $\Sigma^{*}$ be the set of all words over a finite alphabet, $\Sigma$ , with at least two letters; and define $C_{n}$ to be the finite set of all words $w$ whose Kolmogorov complexity, $C(w)$ , is at most $n$ .

Theorem 1.

If $X\subset\Sigma^{*}$ is an infinite decidable subset, then

[TABLE]

in other words $X$ has positive lower asymptotic density with respect to the decomposition $\Sigma^{*}=\cup C_{n}$ .

Theorem 1 is proved in Section 2.2.

Corollary 2.

Let $\iota:\Sigma^{*}\to T$ be a computable bijection. If $Y\subset T$ is an infinite decidable subset, then $Y$ has positive lower asymptotic density with respect to the decomposition $T=\cup\iota(C_{n})$ .

Corollary 2, which follows from Theorem 1 with $X=\iota^{-1}(Y)$ , shows that searching for elements of $X$ by choosing words $w$ uniformly at random from $C_{n}$ and testing $\iota(w)$ for membership in $X$ succeeds with probability bounded away from [math] for large enough $n$ . We call this method algorithmic search.

Unfortunately the sets $C_{n}$ are intractable. If we could decide membership in $C_{n}$ , then we could decide membership in $C_{n}-C_{n-1}$ and thereby compute $C(w)$ , which is uncomputable [6, Theorem 2.3.2]. Even more to the point there is no computable upper bound on the size of the largest word in $C_{n}$ as a function of $n$ [6, Theorem 2.3.1]. Restricting ourselves to resource bounded complexity [6, Chapter 7] resolves the computability issues, but does not help much when it comes to practical computation. Instead in Section 3 we use Theorem 1 as a pattern for heuristic searches.

2 Algorithmic search

We require a few elementary results from the theory of Kolmogorov complexity. Since applications of Kolmogorov compexity to group theory are rare (we know of only [2, 4, 8]), we sketch proofs. For a more complete introduction to the theory, the reader is referred to [6] and [10].

2.1 Kolmogorov complexity

As before fix an alphabet $\Sigma$ with $|\Sigma|\geq 2$ , and let $\Sigma^{*}$ denote the free monoid of all words over $\Sigma$ . It is customary to use $\Sigma=\{0,1\}$ , but larger alphabets are convenient when working with finitely generated groups. $\Sigma^{n}$ is the set of words of length $n$ , and $\Sigma^{\leq n}$ is the set of words of length at most $n$ . $\varepsilon$ is the empty word.

The Kolmogorov complexity of a word $w\in\Sigma^{*}$ is the length of the shortest description of $w$ . Since there relatively few short descriptions, most words are in incompressible; that is, their complexity is not much less than their length. Incompressibility can be taken as a definition of randomness.

Descriptions of words are constructed from computer programs. Let $\mathcal{L}$ be a Turing complete programming language over an alphabet $\Delta$ . We want to use programs in $\mathcal{L}$ to compute functions $\Sigma^{*}\to\Sigma^{*}$ , so it is natural to assume $\Sigma\subset\Delta$ . Programs in $\mathcal{L}$ can be coded as words over $\Sigma$ in the following way: code the letters of $\Delta$ as words of some fixed length $\ell$ over $\Sigma$ , and choose $\ell$ large enough to allow an extra reserved word of length $\ell$ which signals the end of a program. Programs become $\ell$ times larger than before, but that does not bother us. The important point is that they are words over $\Sigma$ with the reserved word as a suffix, and they can be easily decoded into their original form.

Definition 3.

A description of $w\in\Sigma^{*}$ is a word in $\Sigma^{*}$ of the form $pv$ where $p$ is a program, $v$ is a word, and $p$ with input $v$ computes $w$ . The length of a shortest description of $w$ is $C(w)$ , the Kolmogorov complexity of $w$ .

Our conditions imply that an arbitrary word can be written as a description $pv$ in at most one way.

Theorem 4.

The following conditions hold.

$C(w)\leq|w|+c$ * for some constant $c$ .* 2. 2.

There are at most $|\Sigma|^{n}$ words $w$ with $C(w)=n$ and at most $|\Sigma|^{n+1}$ words with $C(w)\leq n$ . 3. 3.

If $f:\Sigma^{*}\to\Sigma^{*}$ is a computable function, then for any word $w$ , $C(f(w))\leq C(w)+c_{f}$ where $c_{f}$ is a constant depending on $f$ but not on $w$ .

Proof.

Let $p$ be a program which outputs its input and halts. For every word $w$ , $pw$ is a description of $w$ . Thus the first assertion holds with $c=|p|$ . For the second part observe that since descriptions are words over $\Sigma$ and each word is a description in at most one way, there are at most $|\Sigma|^{n}$ descriptions of length $n$ . Finally let $q$ be a program which computes $f$ , and let $pv$ be a shortest description of $w$ . Programs $p$ and $q$ can be combined with some overhead into a program of length at most $|p|+c_{f}$ which computes $f(w)$ and halts. ∎

2.2 Proof of Theorem 1

Proof.

By hypothesis there is a computable bijection $f:\Sigma^{*}\to X$ . It follows from Theorem 4 that $\Sigma^{n}\subset C_{n+c}$ . Likewise $f(C_{n+c})\subset X\cap C_{n+c+c_{f}}$ and $|C_{n+c+c_{f}}|\leq|\Sigma|^{n+c+c_{f}+1}$ . Consequently

[TABLE]

∎

3 Implementation

The algorithmic search method described in Section 2 is impractical because, as we know from Section 1, the sets $C_{n}$ are intractable. This section is devoted to two heuristic variations, which we also call algorithmic search. Instead of choosing random words in $C_{n}$ , we choose random short descriptions, and to facilitate this choice we restrict the programs allowed in descriptions.

Our heuristic variations are preliminary; ease of programming was a primary consideration. Nevertheless the results seem encouraging. Computations were done with the Magma system [1]. Figure 2 required 40 hours of CPU time on a decent laptop.

3.1 Finitely generated groups

Let $\Sigma^{*}\to G$ be a choice of semigroup generators for the infinite group $G$ , and suppose we wish to choose random elements of $G$ . In the case of finite groups the product replacement algorithm effectively approximates the uniform distribution on the group [9], but there is no uniform distribution on an infinite group. In practice it seems reasonable that $\overline{w}$ , the image in $G$ of $w\in\Sigma^{*}$ , is close to random if for some large $n$ , $w$ is chosen at uniformly at random from $\Sigma^{\leq n}$ . But then sets $X\subset\Sigma^{*}$ of asymptotic density [math] with respect to the decomposition $\Sigma^{*}=\cup\Sigma^{\leq n}$ are invisible. In particular $\overline{w}$ is never equal to $1$ in $G$ [3, Theorem 5.7]. The disadvantage for, say, debugging and testing algorithms is obvious. Algorithmic search seems to do better.

For the algorithmic search used to produce Figure 1, descriptions have the form $pv$ as before, but $p$ is defined to be a sequence of monoid homomorphisms. Each homomorphism is given by listing the images of the letters in $\Sigma$ under that homomorphism. The word described by $pv$ is the image of $v$ under the composition of the homomorphisms. Our semigroup generators are $\Sigma=\{a,A,b,B\}$ where were are writing $A,B$ in place of the customary formal inverses $a^{-1},b^{-1}$ . For example the the description

[TABLE]

describes the word $w=f\circ g(abAA)=aBaBaBaBaBbBaBaB$ . Here we are not adhering strictly to the format from Section 2.1.

$C^{\prime}_{d,c,M}$ denotes the set of all descriptions $pv$ with $d$ homomorphisms, each specified by a tuple of words of length $c$ , and with $|v|\leq M$ . Algorithmic search is performed by choosing random descriptions from $C^{\prime}_{d,c,M}$ for various choices of the parameters, computing the words described, and testing them to see if they define the identity in the two groups from Figure 1.

3.2 Permutation groups

As mentioned above, pairs of permutations chosen at random from the symmetric group $S_{n}$ are unlikely to generate anything but $S_{n}$ or $A_{n}$ . More precisely we have the following theorem.

Theorem 5 ([7]).

Two random permutations in $S_{n}$ generate a subgroup other than $S_{n}$ or $A_{n}$ with probability at most $\frac{1}{n}+\frac{8.8}{n^{2}}$ .

In order to apply algorithmic search to the permutation group search problem from Section 1 we reformulate that problem slightly. $S_{\omega}$ is the group of all permutations of $N=\{1,2,\ldots\}$ with finite support. $S_{n}$ acts on $\{1,\ldots,n\}$ in the usual way, and fixes all other elements of $N$ . $\Sigma=\{0,1\}$ and $\iota:\Sigma^{*}\to S_{\omega}\times S_{\omega}$ is a computable bijection as in Corollary 2. $Y\subset S_{\omega}\times S_{\omega}$ is the collections of all of pairs of permutations which do not generate any $S_{n}$ listed above or its alternating group.

In the original formulation of the search problem, a pair of permutations from $S_{n}$ is ruled out if it generates $S_{n}$ or $A_{n}$ . In the revision a pair from $S_{\omega}$ is ruled out if it generates any $S_{n}$ listed above or its alternating group. The bound of Theorem 5 still applies.

The map $\iota:\Sigma^{*}\to S_{\omega}\times S_{\omega}$ is a composition of bijections $\Sigma^{*}\to N\to N\times N\to S_{\omega}\times S_{\omega}$ . $\Sigma^{*}\to N$ is the correspondence

[TABLE]

while $N\to N\times N$ is a well known way of enumerating $N\times N$

[TABLE]

and $N\times N\to S_{\omega}\times S_{\omega}$ , is constructed from a standard enumeration of all permutations [5, Section 7.2.1.2].

Algorithmic search in this case resembles that of the preceding section except that instead of descriptions based on monoid homomorphisms from $\Sigma^{*}$ to $\Sigma^{*}$ we employ descriptions based on polynomial functions from $N$ to $N$ . The polynomials have non-negative integer coefficients. For example the description

[TABLE]

with 2 degree 3 polynomials describes the integer

[TABLE]

which gets mapped to the pair of permutations

[TABLE]

Figure 2 shows results obtained by selecting 1,000,000 descriptions uniformly at random from the set of all descriptions with 7 degree 2 polynomials, $ax^{2}+bx+c$ , satisfying $1\leq a\leq 20$ , $0\leq b,c\leq 20$ , and with $|v|\leq 1000$ . It appears that $S_{n}$ and $An$ are avoided about 10% of the time and solvable permutation groups are obtained about $.1\%$ of the time. Whether or not results like these are useful for the permutation group search problem is not clear. In any case the method can probably be refined.

Bibliography10

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Wieb Bosma, John Cannon, and Catherine Playoust. The Magma algebra system. I. The user language. J. Symbolic Comput. , 24(3-4):235–265, 1997. Computational algebra and number theory (London, 1993).
2[2] R. I. Grigorchuk. A relationship between algorithmic problems and entropy characteristics of groups. Dokl. Akad. Nauk SSSR , 284(1):24–29, 1985.
3[3] Ilya Kapovich, Alexei Myasnikov, Paul Schupp, and Vladimir Shpilrain. Generic-case complexity, decision problems in group theory, and random walks. J. Algebra , 264(2):665–694, 2003.
4[4] Ilya Kapovich and Paul Schupp. Delzant’s T 𝑇 T -invariant, Kolmogorov complexity and one-relator groups. Comment. Math. Helv. , 80(4):911–933, 2005.
5[5] Donald E. Knuth. The art of computer programming. Vol. 4A. Combinatorial algorithms. Part 1 . Addison-Wesley, Upper Saddle River, NJ, 2011.
6[6] Ming Li and Paul Vitányi. An introduction to Kolmogorov complexity and its applications . Texts in Computer Science. Springer, New York, third edition, 2008.
7[7] Luke Morgan and Colva M. Roney-Dougal. A note on the probability of generating alternating or symmetric groups. Arch. Math. (Basel) , 105(3):201–204, 2015.
8[8] André Nies and Katrin Tent. Describing finite groups by short first-order sentences. Israel J. Math. , 221(1):85–115, 2017.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Algorithmic Search in Group Theory

Abstract

1 Introduction

Theorem 1**.**

Corollary 2**.**

2 Algorithmic search

2.1 Kolmogorov complexity

Definition 3**.**

Theorem 4**.**

Proof.

2.2 Proof of Theorem 1

Proof.

3 Implementation

3.1 Finitely generated groups

3.2 Permutation groups

Theorem 5** ([7]).**

Theorem 1.

Corollary 2.

Definition 3.

Theorem 4.

Theorem 5 ([7]).