AuPair: Golden Example Pairs for Code Repair
Aditi Mavalankar, Hassan Mansoor, Zita Marinho, Masha Samsikova, Tom, Schaul

TL;DR
This paper introduces AuPair, a method for improving code repair by synthesizing and selecting optimal example pairs to guide LLMs in self-repair, significantly boosting performance across multiple models and datasets.
Contribution
We propose a novel algorithm to synthesize and select golden example pairs (AuPairs) for in-context learning, enhancing LLM self-repair capabilities in coding tasks.
Findings
Significant performance improvements over baseline methods.
Strong generalization across datasets and models.
Better scaling with increased inference compute budget.
Abstract
Scaling up inference-time compute has proven to be a valuable strategy in improving the performance of Large Language Models (LLMs) without fine-tuning. An important task that can benefit from additional inference-time compute is self-repair; given an initial flawed response, or guess, the LLM corrects its own mistake and produces an improved response, or fix. We leverage the in-context learning ability of LLMs to perform self-repair in the coding domain. The key contribution of our paper is an approach that synthesises and selects an ordered set of golden example pairs, or AuPairs, of these initial guesses and subsequent fixes for the corresponding problems. Each such AuPair is provided as a single in-context example at inference time to generate a repaired solution. For an inference-time compute budget of LLM calls per problem, AuPairs are used to generate repaired…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Machine Learning in Materials Science · Machine Learning and Algorithms
MethodsSparse Evolutionary Training
