Instruction set for the representation of graphs
Ezequiel Lopez-Rubio, Mario Pascual-Gonzalez

TL;DR
The paper introduces IsalGraph, a novel compact string encoding for finite graphs that allows efficient, valid, and isomorphism-invariant representation, enabling applications in graph similarity, generation, and language modeling.
Contribution
It presents a new graph encoding method using a small instruction set and a virtual machine, with algorithms for encoding and canonicalization that are efficient and robust.
Findings
Strong correlation between string Levenshtein distance and graph edit distance
Encoding is valid for all strings over the alphabet, ensuring no invalid states
Evaluation on real-world datasets demonstrates practical effectiveness
Abstract
We present IsalGraph, a method for representing the structure of any finite, simple graph as a compact string over a nine-character instruction alphabet. The encoding is executed by a small virtual machine comprising a sparse graph, a circular doubly-linked list (CDLL) of graph-node references, and two traversal pointers. Instructions either move a pointer through the CDLL or insert a node or edge into the graph. A key design property is that every string over the alphabet decodes to a valid graph, with no invalid states reachable. A greedy \emph{GraphToString} algorithm encodes any connected graph into a string in time polynomial in the number of nodes; an exhaustive-backtracking variant produces a canonical string by selecting the lexicographically smallest shortest string across all starting nodes and all valid traversal orders. We evaluate the representation on five real-world graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks · Natural Language Processing Techniques
