Learning Semantic String Transformations from Examples
Rishabh Singh, Sumit Gulwani

TL;DR
This paper introduces a novel programming-by-example approach for automating semantic string transformations in spreadsheets, leveraging a new transformation language and synthesis algorithm to learn from user-provided examples.
Contribution
It presents an expressive transformation language combining table lookups and syntactic manipulations, along with a synthesis algorithm that learns transformations from examples, implemented as an Excel add-in.
Findings
Successfully implemented as an Excel add-in
Effective on benchmarks from Excel help-forums
Able to learn complex semantic transformations
Abstract
We address the problem of performing semantic transformations on strings, which may represent a variety of data types (or their combination) such as a column in a relational table, time, date, currency, etc. Unlike syntactic transformations, which are based on regular expressions and which interpret a string as a sequence of characters, semantic transformations additionally require exploiting the semantics of the data type represented by the string, which may be encoded as a database of relational tables. Manually performing such transformations on a large collection of strings is error prone and cumbersome, while programmatic solutions are beyond the skill-set of end-users. We present a programming by example technology that allows end-users to automate such repetitive tasks. We describe an expressive transformation language for semantic manipulation that combines table lookup…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Quality and Management · Advanced Database Systems and Queries · Software Engineering Research
