Collaborative Distillation for Top-N Recommendation

Jae-woong Lee; Minjin Choi; Jongwuk Lee; and Hyunjung Shim

arXiv:1911.05276·cs.LG·November 14, 2019

Collaborative Distillation for Top-N Recommendation

Jae-woong Lee, Minjin Choi, Jongwuk Lee, and Hyunjung Shim

PDF

TL;DR

This paper introduces a collaborative distillation method for top-N recommendation that effectively compresses recommender models, addressing feedback ambiguity and ranking challenges, and outperforms existing methods in key metrics.

Contribution

It proposes a novel collaborative distillation framework with specialized loss functions and training strategies tailored for top-N recommendation tasks.

Findings

01

Outperforms state-of-the-art methods by up to 33.2% in hit rate.

02

Achieves comparable performance to teacher models.

03

Effectively handles feedback ambiguity and ranking issues.

Abstract

Knowledge distillation (KD) is a well-known method to reduce inference latency by compressing a cumbersome teacher model to a small student model. Despite the success of KD in the classification task, applying KD to recommender models is challenging due to the sparsity of positive feedback, the ambiguity of missing feedback, and the ranking problem associated with the top-N recommendation. To address the issues, we propose a new KD model for the collaborative filtering approach, namely collaborative distillation (CD). Specifically, (1) we reformulate a loss function to deal with the ambiguity of missing feedback. (2) We exploit probabilistic rank-aware sampling for the top-N recommendation. (3) To train the proposed model effectively, we develop two training strategies for the student model, called the teacher- and the student-guided training methods, selecting the most useful feedback…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.