Hashing Modulo Context-Sensitive $\alpha$-Equivalence
Lasse Blaauwbroek, Miroslav Ol\v{s}\'ak, Herman Geuvers

TL;DR
This paper introduces a new context-sensitive notion of $oldsymbol{ extalpha}$-equivalence for $oldsymbol{ extlambda}$-terms, along with an efficient hashing scheme that improves subterm identification and sharing, with applications in large-scale mathematical knowledge graphs.
Contribution
It formalizes context-sensitive $oldsymbol{ extalpha}$-equivalence, proves its equivalence to bisimulation, and develops an $O(n extlog n)$ hashing algorithm that outperforms previous methods.
Findings
The hashing scheme correctly identifies $oldsymbol{ extlambda}$-terms modulo context-sensitive $oldsymbol{ extalpha}$-equivalence.
The algorithm operates in $O(n extlog n)$ time, improving upon previous bounds.
Application to Coq proof assistant data demonstrates practical utility in large-scale knowledge graph construction.
Abstract
The notion of -equivalence between -terms is commonly used to identify terms that are considered equal. However, due to the primitive treatment of free variables, this notion falls short when comparing subterms occurring within a larger context. Depending on the usage of the Barendregt convention (choosing different variable names for all involved binders), it will equate either too few or too many subterms. We introduce a formal notion of context-sensitive -equivalence, where two open terms can be compared within a context that resolves their free variables. We show that this equivalence coincides exactly with the notion of bisimulation equivalence. Furthermore, we present an efficient runtime hashing scheme that identifies -terms modulo context-sensitive -equivalence, generalizing over traditional bisimulation partitioning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · semigroups and automata theory · Algorithms and Data Compression
