LL(1) Parsing with Derivatives and Zippers

Romain Edelmann; Jad Hamza; Viktor Kun\v{c}ak

arXiv:1911.12737·cs.FL·January 22, 2021·5 cites

LL(1) Parsing with Derivatives and Zippers

Romain Edelmann, Jad Hamza, Viktor Kun\v{c}ak

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new, efficient, formally verified LL(1) parser based on derivatives and zippers, improving performance and correctness for parsing context-free expressions.

Contribution

It develops a linear-time, formally verified parsing algorithm for LL(1) expressions using derivatives and zippers, with a practical Scala implementation.

Findings

01

Linear-time parsing algorithm for LL(1) expressions

02

Formal verification of the parser correctness in Coq

03

Implementation as a Scala parser combinator framework

Abstract

In this paper, we present an efficient, functional, and formally verified parsing algorithm for LL(1) context-free expressions based on the concept of derivatives of formal languages. Parsing with derivatives is an elegant parsing technique, which, in the general case, suffers from cubic worst-case time complexity and slow performance in practice. We specialise the parsing with derivatives algorithm to LL(1) context-free expressions, where alternatives can be chosen given a single token of lookahead. We formalise the notion of LL(1) expressions and show how to efficiently check the LL(1) property. Next, we present a novel linear-time parsing with derivatives algorithm for LL(1) expressions operating on a zipper-inspired data structure. We prove the algorithm correct in Coq and present an implementation as a parser combinators framework in Scala, with enumeration and pretty printing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

epfl-lara/scallion-proofs
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLogic, programming, and type systems · Formal Methods in Verification · semigroups and automata theory