HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis

Shraddha Barke; Emmanuel Anaya Gonzalez; Saketh Ram Kasibatla; Taylor; Berg-Kirkpatrick; Nadia Polikarpova

arXiv:2405.15880·cs.PL·November 4, 2024·5 cites

HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis

Shraddha Barke, Emmanuel Anaya Gonzalez, Saketh Ram Kasibatla, Taylor, Berg-Kirkpatrick, Nadia Polikarpova

PDF

Open Access 1 Video

TL;DR

This paper introduces HYSYNTH, a hybrid method combining large language models and symbolic search to improve program synthesis accuracy across multiple domains.

Contribution

It presents a novel context-free surrogate model learned from LLM completions to guide program synthesis, outperforming existing methods.

Findings

01

Outperforms unguided search and LLM sampling

02

Effective across three different domains

03

Improves synthesis accuracy significantly

Abstract

Many structured prediction and reasoning tasks can be framed as program synthesis problems, where the goal is to generate a program in a domain-specific language (DSL) that transforms input data into the desired output. Unfortunately, purely neural approaches, such as large language models (LLMs), often fail to produce fully correct programs in unfamiliar DSLs, while purely symbolic methods based on combinatorial search scale poorly to complex problems. Motivated by these limitations, we introduce a hybrid approach, where LLM completions for a given task are used to learn a task-specific, context-free surrogate model, which is then used to guide program synthesis. We evaluate this hybrid approach on three domains, and show that it outperforms both unguided search and direct sampling from LLMs, as well as existing program synthesizers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

HYSYNTH: Context-Free LLM Approximation for Guiding Program Synthesis· slideslive

Taxonomy

TopicsParallel Computing and Optimization Techniques · Software Testing and Debugging Techniques · Embedded Systems Design Techniques