Generating Clarifying Questions for Query Refinement in Source Code   Search

Zachary Eberhart; Collin McMillan

arXiv:2201.09974·cs.SE·January 26, 2022

Generating Clarifying Questions for Query Refinement in Source Code Search

Zachary Eberhart, Collin McMillan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method for generating natural clarifying questions to improve source code search, reducing search time and enhancing query refinement by mimicking human question-asking behavior.

Contribution

It proposes a novel approach for automatic clarifying question generation in code search, leveraging function names and comments, and demonstrates its effectiveness through synthetic and human studies.

Findings

01

Outperformed keyword-based methods in synthetic tests.

02

Reduced search duration in human studies.

03

Enhanced query refinement process in code search.

Abstract

In source code search, a common information-seeking strategy involves providing a short initial query with a broad meaning, and then iteratively refining the query using terms gleaned from the results of subsequent searches. This strategy requires programmers to spend time reading search results that are irrelevant to their development needs. In contrast, when programmers seek information from other humans, they typically refine queries by asking and answering clarifying questions. Clarifying questions have been shown to benefit general-purpose search engines, but have not been examined in the context of code search. We present a method for generating natural-sounding clarifying questions using information extracted from function names and comments. Our method outperformed a keyword-based method for single-turn refinement in synthetic studies, and was associated with shorter search…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zeberhart/zacq
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Engineering Research · Open Source Software Innovations · Topic Modeling