Learning to Unknot
Sergei Gukov, James Halverson, Fabian Ruehle, Piotr Su{\l}kowski

TL;DR
This paper applies NLP techniques and deep learning models to knot theory, specifically for the UNKNOT problem, and introduces reinforcement learning to find unknotting sequences, revealing new insights into knot simplification.
Contribution
It introduces a novel approach combining NLP, deep learning, and reinforcement learning to study and solve the UNKNOT problem using braid representations.
Findings
Transformer architectures outperform fully-connected networks.
Accuracy increases with braid word length.
Reinforcement learning effectively finds unknotting sequences.
Abstract
We introduce natural language processing into the study of knot theory, as made natural by the braid word representation of knots. We study the UNKNOT problem of determining whether or not a given knot is the unknot. After describing an algorithm to randomly generate -crossing braids and their knot closures and discussing the induced prior on the distribution of knots, we apply binary classification to the UNKNOT decision problem. We find that the Reformer and shared-QK Transformer network architectures outperform fully-connected networks, though all perform well. Perhaps surprisingly, we find that accuracy increases with the length of the braid word, and that the networks learn a direct correlation between the confidence of their predictions and the degree of the Jones polynomial. Finally, we utilize reinforcement learning (RL) to find sequences of Markov moves and braid relations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric and Algebraic Topology · Biochemical and Structural Characterization · Artificial Intelligence in Games
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · 1x1 Convolution · Convolution · Multi-Head Attention · Attention Is All You Need · Layer Normalization · Adafactor · Locality Sensitive Hashing Attention
