Tiny-NewsRec: Effective and Efficient PLM-based News Recommendation
Yang Yu, Fangzhao Wu, Chuhan Wu, Jingwei Yi, Qi Liu

TL;DR
Tiny-NewsRec enhances news recommendation by adapting pre-trained language models to the news domain and distilling knowledge to create a more efficient and effective model suitable for real-time applications.
Contribution
The paper introduces a domain-specific post-training and a two-stage knowledge distillation approach for PLM-based news recommendation.
Findings
Improves recommendation accuracy in real-world datasets.
Reduces model size and computational overhead.
Maintains high performance after distillation.
Abstract
News recommendation is a widely adopted technique to provide personalized news feeds for the user. Recently, pre-trained language models (PLMs) have demonstrated the great capability of natural language understanding and benefited news recommendation via improving news modeling. However, most existing works simply finetune the PLM with the news recommendation task, which may suffer from the known domain shift problem between the pre-training corpus and downstream news texts. Moreover, PLMs usually contain a large volume of parameters and have high computational overhead, which imposes a great burden on low-latency online services. In this paper, we propose Tiny-NewsRec, which can improve both the effectiveness and the efficiency of PLM-based news recommendation. We first design a self-supervised domain-specific post-training method to better adapt the general PLM to the news domain with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
