Pre-train and Fine-tune: Recommenders as Large Models

Zhenhao Jiang; Chenghao Chen; Hao Feng; Yu Yang; Jin Liu; Jie Zhang,; Jia Jia; Ning Hu

arXiv:2501.14268·cs.IR·January 27, 2025

Pre-train and Fine-tune: Recommenders as Large Models

Zhenhao Jiang, Chenghao Chen, Hao Feng, Yu Yang, Jin Liu, Jie Zhang,, Jia Jia, Ning Hu

PDF

Open Access

TL;DR

This paper proposes a novel fine-tuning approach for large pre-trained recommenders using an information-aware adaptive kernel, improving adaptability to user interest changes with successful large-scale deployment.

Contribution

It introduces an information-theoretic framework for fine-tuning recommenders and designs the IAK technique tailored for recommendation systems, with practical deployment insights.

Findings

01

IAK outperforms traditional fine-tuning methods in experiments.

02

The approach is interpretable and effective in large-scale online deployment.

03

Deployment on a billion-scale platform yielded significant business profits.

Abstract

In reality, users have different interests in different periods, regions, scenes, etc. Such changes in interest are so drastic that they are difficult to be captured by recommenders. Existing multi-domain learning can alleviate this problem. However, the structure of the industrial recommendation system is complex, the amount of data is huge, and the training cost is extremely high, so it is difficult to modify the structure of the industrial recommender and re-train it. To fill this gap, we consider recommenders as large pre-trained models and fine-tune them. We first propose the theory of the information bottleneck for fine-tuning and present an explanation for the fine-tuning technique in recommenders. To tailor for recommendation, we design an information-aware adaptive kernel (IAK) technique to fine-tune the pre-trained recommender. Specifically, we define fine-tuning as two…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques