Addressing Popularity Bias in Third-Party Library Recommendations Using   LLMs

Claudio Di Sipio; Juri Di Rocco; Davide Di Ruscio; and Vladyslav; Bulhakov

arXiv:2501.10313·cs.SE·January 20, 2025

Addressing Popularity Bias in Third-Party Library Recommendations Using LLMs

Claudio Di Sipio, Juri Di Rocco, Davide Di Ruscio, and Vladyslav, Bulhakov

PDF

Open Access

TL;DR

This paper investigates whether large language models can mitigate popularity bias in third-party library recommendations, finding that current LLMs are ineffective despite some improvements in recommendation diversity.

Contribution

The study evaluates state-of-the-art LLM techniques for reducing popularity bias in software library recommenders, highlighting their limitations and suggesting directions for future research.

Findings

01

LLMs do not effectively address popularity bias in TPL recommenders.

02

Fine-tuning and penalty mechanisms increase recommendation diversity.

03

Current LLMs have limitations in mitigating popularity bias.

Abstract

Recommender systems for software engineering (RSSE) play a crucial role in automating development tasks by providing relevant suggestions according to the developer's context. However, they suffer from the so-called popularity bias, i.e., the phenomenon of recommending popular items that might be irrelevant to the current task. In particular, the long-tail effect can hamper the system's performance in terms of accuracy, thus leading to false positives in the provided recommendations. Foundation models are the most advanced generative AI-based models that achieve relevant results in several SE tasks. This paper aims to investigate the capability of large language models (LLMs) to address the popularity bias in recommender systems of third-party libraries (TPLs). We conduct an ablation study experimenting with state-of-the-art techniques to mitigate the popularity bias, including…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Spam and Phishing Detection