Minimizing speculation overhead in a parallel recognizer for regular texts
Angelo Borsotti, Luca Breveglieri, Stefano Crespi Reghizzi, Angelo, Morzenti

TL;DR
This paper introduces RI-DFA, a new automaton type that reduces speculation overhead in parallel regular text recognition, leading to faster multi-core processing with manageable construction costs.
Contribution
The paper presents RI-DFA, a novel automaton that minimizes speculation overhead in parallel recognition without increasing nondeterminism or automaton size, improving efficiency over existing methods.
Findings
RI-DFA reduces the number of starting states compared to traditional DFA.
RI-DFA achieves faster recognition times than NFA-based methods on multi-core architectures.
Construction cost of RI-DFA is moderate and practical.
Abstract
Speculative data-parallel algorithms for language recognition have been widely experimented for various types of finite-state automata (FA), deterministic (DFA) and nondeterministic (NFA), often derived from regular expressions (RE). Such an algorithm cuts the input string into chunks, independently recognizes each chunk in parallel by means of identical FAs, and at last joins the chunk results and checks overall consistency. In chunk recognition, it is necessary to speculatively start the FAs in any state, thus causing an overhead that reduces the speedup compared to a serial algorithm. Existing data-parallel DFA-based recognizers suffer from the excessive number of starting states, and the NFA-based ones suffer from the number of nondeterministic transitions. Our data-parallel algorithm is based on the new FA type called reduced interface DFA (RI-DFA), which minimizes the speculation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Biological Computing · Network Packet Processing and Optimization · Algorithms and Data Compression
