Finetuning Large Language Model for Personalized Ranking
Zhuoxi Bai, Ning Wu, Fengyu Cai, Xinyi Zhu, Yun Xiong

TL;DR
This paper introduces DMPO, a new framework that improves large language models' ability to perform personalized recommendations by optimizing preference alignment, showing significant performance gains in real-world datasets and cross-domain scenarios.
Contribution
The paper proposes DMPO, a novel training framework that enhances LLMs for recommendation tasks by optimizing preference probabilities, addressing data disparity issues.
Findings
DMPO outperforms traditional recommendation methods in few-shot scenarios.
DMPO demonstrates superior cross-domain recommendation capabilities.
The case study highlights DMPO's potential as an explainable recommendation system.
Abstract
Large Language Models (LLMs) have demonstrated remarkable performance across various domains, motivating researchers to investigate their potential use in recommendation systems. However, directly applying LLMs to recommendation tasks has proven challenging due to the significant disparity between the data used for pre-training LLMs and the specific requirements of recommendation tasks. In this study, we introduce Direct Multi-Preference Optimization (DMPO), a streamlined framework designed to bridge the gap and enhance the alignment of LLMs for recommendation tasks. DMPO enhances the performance of LLM-based recommenders by simultaneously maximizing the probability of positive samples and minimizing the probability of multiple negative samples. We conducted experimental evaluations to compare DMPO against traditional recommendation methods and other LLM-based recommendation approaches.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques
