Language Edit Distance & Scored Parsing: Faster Algorithms & Connection to Fundamental Graph Problems
Tomasz Kociumaka, Barna Saha

TL;DR
This paper introduces the first subcubic time approximation algorithms for language edit distance, significantly improving efficiency over traditional methods and connecting the problem to fundamental graph algorithms.
Contribution
It presents the first subcubic time algorithms for approximating language edit distance, establishing a link to the all-pairs shortest paths problem.
Findings
Approximate language edit distance in .491 time for any psilon > 0
Achieve psilon multiplicative approximation within .491 time
Exact insertion-only language edit distance computation implies a subcubic all-pairs shortest paths algorithm
Abstract
Given a context free language over alphabet and a string , {\em the language edit distance} problem seeks the minimum number of edits (insertions, deletions and substitutions) required to convert into a valid member of . The well-known dynamic programming algorithm solves this problem in time (ignoring grammar size) where is the string length [Aho, Peterson 1972, Myers 1985]. Despite its numerous applications, to date there exists no algorithm that computes exact or approximate language edit distance problem in true subcubic time. In this paper we give the first such algorithm that approximates language edit distance in subcubic time. For any arbitrary , our algorithm runs in time and returns an estimate within a multiplicative approximation factor of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Algorithms and Data Compression · Topic Modeling
