Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking

Zhi-Cun Lyu; Xin-Ye Li; Zheng Xie; Ming Li

arXiv:2408.05715·cs.AI·August 13, 2024

Top Pass: Improve Code Generation by Pass@k-Maximized Code Ranking

Zhi-Cun Lyu, Xin-Ye Li, Zheng Xie, Ming Li

PDF

Open Access

TL;DR

Top Pass is a novel code ranking method that improves the likelihood of finding correct solutions quickly by directly optimizing the pass@k metric, significantly enhancing code generation usability.

Contribution

It introduces a new ranking approach that directly optimizes pass@k, leading to better top candidate selection in code generation tasks.

Findings

01

32.9% relative improvement in pass@1 on CodeContests

02

Enhanced ranking quality for code generation models

03

Better usability by reducing candidate review effort

Abstract

Code generation has been greatly enhanced by the profound advancements in Large Language Models (LLMs) recently. Nevertheless, such LLM-based code generation approaches still struggle to generate error-free code in a few tries when faced with complex problems. To address this, the prevailing strategy is to sample a huge number of candidate programs, with the hope of any one in them could work. However, users of code generation systems usually expect to find a correct program by reviewing or testing only a small number of code candidates. Otherwise, the system would be unhelpful. In this paper, we propose Top Pass, a code ranking approach that identifies potential correct solutions from a large number of candidates. Top Pass directly optimizes the pass@k loss function, enhancing the quality at the top of the candidate list. This enables the user to find the correct solution within as few…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Formal Methods in Verification · Natural Language Processing Techniques