Improved Query Reformulation for Concept Location using CodeRank and Document Structures
Mohammad Masudur Rahman, Chanchal K. Roy

TL;DR
This paper introduces ACER, a new method for improving software concept location by reformulating queries using CodeRank, document structures, and machine learning, significantly enhancing search effectiveness during software maintenance.
Contribution
The paper presents ACER, a novel query reformulation technique that combines CodeRank, document structure analysis, and machine learning for better concept location in software maintenance.
Findings
ACER improves 71% of baseline queries.
Outperforms five existing query reformulation techniques.
Validated on eight software systems with 1,675 queries.
Abstract
During software maintenance, developers usually deal with a significant number of software change requests. As a part of this, they often formulate an initial query from the request texts, and then attempt to map the concepts discussed in the request to relevant source code locations in the software system (a.k.a., concept location). Unfortunately, studies suggest that they often perform poorly in choosing the right search terms for a change task. In this paper, we propose a novel technique --ACER-- that takes an initial query, identifies appropriate search terms from the source code using a novel term weight --CodeRank, and then suggests effective reformulation to the initial query by exploiting the source document structures, query quality analysis and machine learning. Experiments with 1,675 baseline queries from eight subject systems report that our technique can improve 71% of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Web Data Mining and Analysis · Software System Performance and Reliability
