The Labeled Direct Product Optimally Solves String Problems on Graphs
Nicola Rizzo, Alexandru I. Tomescu, Alberto Policriti

TL;DR
This paper introduces the labeled direct product of graphs as a universal tool to develop optimal, efficient algorithms for string problems on labeled graphs, including some previously open problems.
Contribution
The paper presents the labeled direct product as a general framework for solving string problems on labeled graphs, simplifying algorithms and extending solutions to new cases.
Findings
Algorithms for string matching and longest common substring are linear in the size of the labeled product graph.
The approach solves the open problem of LCSP on graphs with cycles.
The algorithms are proven optimal under the Orthogonal Vectors Hypothesis.
Abstract
Suffix trees are an important data structure at the core of optimal solutions to many fundamental string problems, such as exact pattern matching, longest common substring, matching statistics, and longest repeated substring. Recent lines of research focused on extending some of these problems to vertex-labeled graphs, although using ad-hoc approaches which in some cases do not generalize to all input graphs. In the absence of a ubiquitous tool like the suffix tree for labeled graphs, we introduce the labeled direct product of two graphs as a general tool for obtaining optimal algorithms: we obtain conceptually simpler algorithms for the quadratic problems of string matching (SMLG) and longest common substring (LCSP) in labeled graphs. Our algorithms are also more efficient, since they run in time linear in the size of the labeled product graph, which may be smaller than quadratic for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAlgorithms and Data Compression · DNA and Biological Computing · Network Packet Processing and Optimization
