
TL;DR
This paper introduces a Kolmogorov complexity-based random search method for group theory problems, demonstrating its theoretical effectiveness and practical heuristic approximations with experimental support.
Contribution
It proposes a novel search approach using Kolmogorov complexity for group theory, combining theoretical guarantees with heuristic methods and experimental validation.
Findings
The method is provably effective in theory.
Heuristic approximations perform well in practice.
Experimental evidence supports the approach's viability.
Abstract
A method of random search based on Kolmogorov complexity is proposed and applied to two search problems in group theory. The method is provably effective but not practical, so the applications involve heuristic approximations. Perhaps surprisingly, these approximations seem to work. Some experimental evidence is presented.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Algorithmic Search in Group Theory
Dedicated to the memory of Charles Sims
Robert Gilman
Robert Gilman The author thanks the Hausdorff Institute of Mathematics, the University of Newcastle and the University of Warwick for their hospitality while this paper was being written.
Abstract
A method of random search based on Kolmogorov complexity is proposed and applied to two search problems in group theory. The method is provably effective but not practical, so the applications involve heuristic approximations. Perhaps surprisingly, these approximations seem to work. Some experimental evidence is presented.
1 Introduction
One of Charlie Sims’ substantial contributions to mathematics is his invention of a base and strong generating set for finite permutation groups. This invention played a crucial role in proving the existence of several sporadic finite simple groups and is the foundation of most permutation group algorithms in use today, including those used here.
The origin of this paper is a conversation some years ago between Colva Roney-Dougal and the author about the difficulty of generating random permutation groups with which to test the efficacy of various permutation group algorithms. A common way of sampling random subgroups is to choose generators at random from the ambient group. This approach fails for permutation groups because, as is well known, a random pair of permutations from the symmetric group of degree generates the symmetric or alternating group of degree with probability about . We propose a different method of search which seems to do better.
We recast the above search problem in the following general form. Given an infinite decidable subset of a computably enumerable set , search for multiple instances of . A common strategy is to decompose in some convenient way as a union of finite subsets , choose large, and test random elements of for membership in . However the search will be hard if instances of are rare, in particular if has asymptotic density 0 with respect to the decomposition of :
[TABLE]
where denotes cardinality.
This obstacle can be avoided by choosing the decomposition in a special way. Let be the set of all words over a finite alphabet, , with at least two letters; and define to be the finite set of all words whose Kolmogorov complexity, , is at most .
Theorem 1**.**
If is an infinite decidable subset, then
[TABLE]
in other words has positive lower asymptotic density with respect to the decomposition .
Theorem 1 is proved in Section 2.2.
Corollary 2**.**
Let be a computable bijection. If is an infinite decidable subset, then has positive lower asymptotic density with respect to the decomposition .
Corollary 2, which follows from Theorem 1 with , shows that searching for elements of by choosing words uniformly at random from and testing for membership in succeeds with probability bounded away from [math] for large enough . We call this method algorithmic search.
Unfortunately the sets are intractable. If we could decide membership in , then we could decide membership in and thereby compute , which is uncomputable [6, Theorem 2.3.2]. Even more to the point there is no computable upper bound on the size of the largest word in as a function of [6, Theorem 2.3.1]. Restricting ourselves to resource bounded complexity [6, Chapter 7] resolves the computability issues, but does not help much when it comes to practical computation. Instead in Section 3 we use Theorem 1 as a pattern for heuristic searches.
2 Algorithmic search
We require a few elementary results from the theory of Kolmogorov complexity. Since applications of Kolmogorov compexity to group theory are rare (we know of only [2, 4, 8]), we sketch proofs. For a more complete introduction to the theory, the reader is referred to [6] and [10].
2.1 Kolmogorov complexity
As before fix an alphabet with , and let denote the free monoid of all words over . It is customary to use , but larger alphabets are convenient when working with finitely generated groups. is the set of words of length , and is the set of words of length at most . is the empty word.
The Kolmogorov complexity of a word is the length of the shortest description of . Since there relatively few short descriptions, most words are in incompressible; that is, their complexity is not much less than their length. Incompressibility can be taken as a definition of randomness.
Descriptions of words are constructed from computer programs. Let be a Turing complete programming language over an alphabet . We want to use programs in to compute functions , so it is natural to assume . Programs in can be coded as words over in the following way: code the letters of as words of some fixed length over , and choose large enough to allow an extra reserved word of length which signals the end of a program. Programs become times larger than before, but that does not bother us. The important point is that they are words over with the reserved word as a suffix, and they can be easily decoded into their original form.
Definition 3**.**
A description of is a word in of the form where is a program, is a word, and with input computes . The length of a shortest description of is , the Kolmogorov complexity of .
Our conditions imply that an arbitrary word can be written as a description in at most one way.
Theorem 4**.**
The following conditions hold.
* for some constant .* 2. 2.
There are at most words with and at most words with . 3. 3.
If is a computable function, then for any word , where is a constant depending on but not on .
Proof.
Let be a program which outputs its input and halts. For every word , is a description of . Thus the first assertion holds with . For the second part observe that since descriptions are words over and each word is a description in at most one way, there are at most descriptions of length . Finally let be a program which computes , and let be a shortest description of . Programs and can be combined with some overhead into a program of length at most which computes and halts. ∎
2.2 Proof of Theorem 1
Proof.
By hypothesis there is a computable bijection . It follows from Theorem 4 that . Likewise and . Consequently
[TABLE]
∎
3 Implementation
The algorithmic search method described in Section 2 is impractical because, as we know from Section 1, the sets are intractable. This section is devoted to two heuristic variations, which we also call algorithmic search. Instead of choosing random words in , we choose random short descriptions, and to facilitate this choice we restrict the programs allowed in descriptions.
Our heuristic variations are preliminary; ease of programming was a primary consideration. Nevertheless the results seem encouraging. Computations were done with the Magma system [1]. Figure 2 required 40 hours of CPU time on a decent laptop.
3.1 Finitely generated groups
Let be a choice of semigroup generators for the infinite group , and suppose we wish to choose random elements of . In the case of finite groups the product replacement algorithm effectively approximates the uniform distribution on the group [9], but there is no uniform distribution on an infinite group. In practice it seems reasonable that , the image in of , is close to random if for some large , is chosen at uniformly at random from . But then sets of asymptotic density [math] with respect to the decomposition are invisible. In particular is never equal to in [3, Theorem 5.7]. The disadvantage for, say, debugging and testing algorithms is obvious. Algorithmic search seems to do better.
For the algorithmic search used to produce Figure 1, descriptions have the form as before, but is defined to be a sequence of monoid homomorphisms. Each homomorphism is given by listing the images of the letters in under that homomorphism. The word described by is the image of under the composition of the homomorphisms. Our semigroup generators are where were are writing in place of the customary formal inverses . For example the the description
[TABLE]
describes the word . Here we are not adhering strictly to the format from Section 2.1.
denotes the set of all descriptions with homomorphisms, each specified by a tuple of words of length , and with . Algorithmic search is performed by choosing random descriptions from for various choices of the parameters, computing the words described, and testing them to see if they define the identity in the two groups from Figure 1.
3.2 Permutation groups
As mentioned above, pairs of permutations chosen at random from the symmetric group are unlikely to generate anything but or . More precisely we have the following theorem.
Theorem 5** ([7]).**
Two random permutations in generate a subgroup other than or with probability at most .
In order to apply algorithmic search to the permutation group search problem from Section 1 we reformulate that problem slightly. is the group of all permutations of with finite support. acts on in the usual way, and fixes all other elements of . and is a computable bijection as in Corollary 2. is the collections of all of pairs of permutations which do not generate any listed above or its alternating group.
In the original formulation of the search problem, a pair of permutations from is ruled out if it generates or . In the revision a pair from is ruled out if it generates any listed above or its alternating group. The bound of Theorem 5 still applies.
The map is a composition of bijections . is the correspondence
[TABLE]
while is a well known way of enumerating
[TABLE]
and , is constructed from a standard enumeration of all permutations [5, Section 7.2.1.2].
Algorithmic search in this case resembles that of the preceding section except that instead of descriptions based on monoid homomorphisms from to we employ descriptions based on polynomial functions from to . The polynomials have non-negative integer coefficients. For example the description
[TABLE]
with 2 degree 3 polynomials describes the integer
[TABLE]
which gets mapped to the pair of permutations
[TABLE]
Figure 2 shows results obtained by selecting 1,000,000 descriptions uniformly at random from the set of all descriptions with 7 degree 2 polynomials, , satisfying , , and with . It appears that and are avoided about 10% of the time and solvable permutation groups are obtained about of the time. Whether or not results like these are useful for the permutation group search problem is not clear. In any case the method can probably be refined.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Wieb Bosma, John Cannon, and Catherine Playoust. The Magma algebra system. I. The user language. J. Symbolic Comput. , 24(3-4):235–265, 1997. Computational algebra and number theory (London, 1993).
- 2[2] R. I. Grigorchuk. A relationship between algorithmic problems and entropy characteristics of groups. Dokl. Akad. Nauk SSSR , 284(1):24–29, 1985.
- 3[3] Ilya Kapovich, Alexei Myasnikov, Paul Schupp, and Vladimir Shpilrain. Generic-case complexity, decision problems in group theory, and random walks. J. Algebra , 264(2):665–694, 2003.
- 4[4] Ilya Kapovich and Paul Schupp. Delzant’s T 𝑇 T -invariant, Kolmogorov complexity and one-relator groups. Comment. Math. Helv. , 80(4):911–933, 2005.
- 5[5] Donald E. Knuth. The art of computer programming. Vol. 4A. Combinatorial algorithms. Part 1 . Addison-Wesley, Upper Saddle River, NJ, 2011.
- 6[6] Ming Li and Paul Vitányi. An introduction to Kolmogorov complexity and its applications . Texts in Computer Science. Springer, New York, third edition, 2008.
- 7[7] Luke Morgan and Colva M. Roney-Dougal. A note on the probability of generating alternating or symmetric groups. Arch. Math. (Basel) , 105(3):201–204, 2015.
- 8[8] André Nies and Katrin Tent. Describing finite groups by short first-order sentences. Israel J. Math. , 221(1):85–115, 2017.
