Tiny-NewsRec: Effective and Efficient PLM-based News Recommendation

Yang Yu; Fangzhao Wu; Chuhan Wu; Jingwei Yi; Qi Liu

arXiv:2112.00944·cs.IR·December 13, 2022·1 cites

Tiny-NewsRec: Effective and Efficient PLM-based News Recommendation

Yang Yu, Fangzhao Wu, Chuhan Wu, Jingwei Yi, Qi Liu

PDF

Open Access 1 Repo

TL;DR

Tiny-NewsRec enhances news recommendation by adapting pre-trained language models to the news domain and distilling knowledge to create a more efficient and effective model suitable for real-time applications.

Contribution

The paper introduces a domain-specific post-training and a two-stage knowledge distillation approach for PLM-based news recommendation.

Findings

01

Improves recommendation accuracy in real-world datasets.

02

Reduces model size and computational overhead.

03

Maintains high performance after distillation.

Abstract

News recommendation is a widely adopted technique to provide personalized news feeds for the user. Recently, pre-trained language models (PLMs) have demonstrated the great capability of natural language understanding and benefited news recommendation via improving news modeling. However, most existing works simply finetune the PLM with the news recommendation task, which may suffer from the known domain shift problem between the pre-training corpus and downstream news texts. Moreover, PLMs usually contain a large volume of parameters and have high computational overhead, which imposes a great burden on low-latency online services. In this paper, we propose Tiny-NewsRec, which can improve both the effectiveness and the efficiency of PLM-based news recommendation. We first design a self-supervised domain-specific post-training method to better adapt the general PLM to the news domain with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yflyl613/tiny-newsrec
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications

MethodsKnowledge Distillation