Improving LLM-based Recommendation with Self-Hard Negatives from Intermediate Layers
Bingqian Li, Bowen Zheng, Xiaolei Wang, Long Zhang, Jinpeng Wang, Sheng Chen, Wayne Xin Zhao, Ji-rong Wen

TL;DR
This paper introduces ILRec, a novel fine-tuning framework for LLM-based recommender systems that leverages self-hard negative signals from intermediate layers to improve preference learning and recommendation accuracy.
Contribution
ILRec is the first to extract and utilize self-hard negative tokens from intermediate layers for more effective preference learning in LLM-based recommendation.
Findings
ILRec significantly outperforms existing methods on three datasets.
Self-hard negatives improve the discriminative power of preference models.
The framework effectively balances negative signal quality with false negative mitigation.
Abstract
Large language models (LLMs) have shown great promise in recommender systems, where supervised fine-tuning (SFT) is commonly used for adaptation. Subsequent studies further introduce preference learning to incorporate negative samples into the training process. However, existing methods rely on sequence-level, offline-generated negatives, making them less discriminative and informative when adapting LLMs to recommendation tasks with large negative item spaces. To address these challenges, we propose ILRec, a novel preference fine-tuning framework for LLM-based recommendation, leveraging self-hard negative signals extracted from intermediate layers to improve preference learning. Specifically, we identify self-hard negative tokens from intermediate layers as fine-grained negative supervision that dynamically reflects the model's preference learning process. To effectively integrate these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Explainable Artificial Intelligence (XAI) · Sentiment Analysis and Opinion Mining
