A Needle in a Haystack: Intent-driven Reusable Artifacts Recommendation with LLMs
Dongming Jin, Zhi Jin, Xiaohong Chen, Zheng Fang, Linyu Li, Yuanpeng He, Jia Li, Yirang Zhang, Yingtao Fang

TL;DR
This paper evaluates the effectiveness of Large Language Models in recommending reusable software artifacts based on developer intent, introduces a benchmark for comparison, and proposes a hierarchical framework to improve precision and efficiency.
Contribution
It constructs IntentRecBench for evaluating LLMs in artifact recommendation and proposes TreeRec, a semantic tree-guided framework that enhances LLM performance in this task.
Findings
LLMs outperform traditional methods in artifact recommendation.
TreeRec improves precision and reduces inference time.
The framework demonstrates generalizability across ecosystems.
Abstract
In open source software development, the reuse of existing artifacts has been widely adopted to avoid redundant implementation work. Reusable artifacts are considered more efficient and reliable than developing software components from scratch. However, when faced with a large number of reusable artifacts, developers often struggle to find artifacts that can meet their expected needs. To reduce this burden, retrieval-based and learning-based techniques have been proposed to automate artifact recommendations. Recently, Large Language Models (LLMs) have shown the potential to understand intentions, perform semantic alignment, and recommend usable artifacts. Nevertheless, their effectiveness has not been thoroughly explored. To fill this gap, we construct an intent-driven artifact recommendation benchmark named IntentRecBench, covering three representative open source ecosystems. Using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Software Engineering Techniques and Practices · Scientific Computing and Data Management
