Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs
Claudio Di Sipio, Juri Di Rocco, Davide Di Ruscio, and Vladyslav, Bulhakov

TL;DR
This paper investigates whether large language models can mitigate popularity bias in third-party library recommendations, finding that current LLMs are ineffective despite some improvements in recommendation diversity.
Contribution
The study evaluates state-of-the-art LLM techniques for reducing popularity bias in software library recommenders, highlighting their limitations and suggesting directions for future research.
Findings
LLMs do not effectively address popularity bias in TPL recommenders.
Fine-tuning and penalty mechanisms increase recommendation diversity.
Current LLMs have limitations in mitigating popularity bias.
Abstract
Recommender systems for software engineering (RSSE) play a crucial role in automating development tasks by providing relevant suggestions according to the developer's context. However, they suffer from the so-called popularity bias, i.e., the phenomenon of recommending popular items that might be irrelevant to the current task. In particular, the long-tail effect can hamper the system's performance in terms of accuracy, thus leading to false positives in the provided recommendations. Foundation models are the most advanced generative AI-based models that achieve relevant results in several SE tasks. This paper aims to investigate the capability of large language models (LLMs) to address the popularity bias in recommender systems of third-party libraries (TPLs). We conduct an ablation study experimenting with state-of-the-art techniques to mitigate the popularity bias, including…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Spam and Phishing Detection
