Assessing the Answerability of Queries in Retrieval-Augmented Code   Generation

Geonmin Kim; Jaeyeon Kim; Hancheol Park; Wooksu Shin; and Tae-Ho Kim

arXiv:2411.05547·cs.CL·November 26, 2024

Assessing the Answerability of Queries in Retrieval-Augmented Code Generation

Geonmin Kim, Jaeyeon Kim, Hancheol Park, Wooksu Shin, and Tae-Ho Kim

PDF

Open Access

TL;DR

This paper introduces a new task and benchmark dataset for evaluating whether retrieval-augmented code generation models can produce answerable, correct code based on user queries and retrieved APIs, highlighting the task's difficulty.

Contribution

It proposes the answerability evaluation task and creates the RaCGEval benchmark dataset, providing a new way to assess and improve retrieval-augmented code generation models.

Findings

01

Baseline models achieve only 46.7% performance on answerability.

02

Answerability remains a very challenging task for current models.

03

The paper discusses potential methods to significantly improve model performance.

Abstract

Thanks to unprecedented language understanding and generation capabilities of large language model (LLM), Retrieval-augmented Code Generation (RaCG) has recently been widely utilized among software developers. While this has increased productivity, there are still frequent instances of incorrect codes being provided. In particular, there are cases where plausible yet incorrect codes are generated for queries from users that cannot be answered with the given queries and API descriptions. This study proposes a task for evaluating answerability, which assesses whether valid answers can be generated based on users' queries and retrieved APIs in RaCG. Additionally, we build a benchmark dataset called Retrieval-augmented Code Generability Evaluation (RaCGEval) to evaluate the performance of models performing this task. Experimental results show that this task remains at a very challenging…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Intelligent Tutoring Systems and Adaptive Learning