TL;DR
This paper introduces a standardized benchmark for table programming languages, focusing on expressive power and diagnostic quality of rich types, to improve comparability and evaluation of different systems.
Contribution
It creates a standard library of table operations with detailed type specifications and a suite of erroneous programs for benchmarking expressiveness and error feedback quality.
Findings
Established a common benchmark for table language expressiveness.
Developed a suite of erroneous programs to evaluate error diagnostics.
Proposed a datasheet format for systematic comparison of implementations.
Abstract
Context: Tables are ubiquitous formats for data. Therefore, techniques for writing correct programs over tables, and debugging incorrect ones, are vital. Our specific focus in this paper is on rich types that articulate the properties of tabular operations. We wish to study both their expressive power and _diagnostic quality_. Inquiry: There is no "standard library" of table operations. As a result, every paper (and project) is free to use its own (sub)set of operations. This makes artifacts very difficult to compare, and it can be hard to tell whether omitted operations were left out by oversight or because they cannot actually be expressed. Furthermore, virtually no papers discuss the quality of type error feedback. Approach: We combed through several existing languages and libraries to create a "standard library" of table operations. Each entry is accompanied by a detailed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
