PAC Prediction Sets for Large Language Models of Code

Adam Khakhar; Stephen Mell; Osbert Bastani

arXiv:2302.08703·cs.LG·June 22, 2023

PAC Prediction Sets for Large Language Models of Code

Adam Khakhar, Stephen Mell, Osbert Bastani

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel method for generating PAC prediction sets for large language models of code, providing high-confidence partial program outputs with theoretical guarantees, addressing the challenge of exponential label spaces in structured prediction.

Contribution

It proposes a new algorithm leveraging abstract syntax trees to produce compact, high-confidence partial programs for code generation models, a first in this domain.

Findings

01

Generates compact PAC prediction sets for code models

02

Demonstrates effectiveness on SQL and multiple programming languages

03

Provides theoretical guarantees for partial program predictions

Abstract

Prediction sets have recently been shown to be a promising strategy for quantifying the uncertainty of deep neural networks in a way that provides theoretical guarantees. However, existing techniques have largely targeted settings where the space of labels is simple, so prediction sets can be arbitrary subsets of labels. For structured prediction problems where the space of labels is exponential in size, even prediction sets containing a small fraction of all labels can be exponentially large. In the context of code generation, we propose a solution that considers a restricted set of prediction sets that can compactly be represented as partial programs, which are programs with portions replaced with holes. Given a trained code generation model, our algorithm leverages a programming language's abstract syntax tree to generate a set of programs such that the correct program is in the set…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

adamkhakhar/python-pac-code-prediction-set
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Machine Learning and Algorithms · Adversarial Robustness in Machine Learning

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Adam · SentencePiece · Adafactor · Gated Linear Unit · Inverse Square Root Schedule · Softmax