Ranking Distillation: Learning Compact Ranking Models With High   Performance for Recommender System

Jiaxi Tang; Ke Wang

arXiv:1809.07428·cs.LG·September 21, 2018·20 cites

Ranking Distillation: Learning Compact Ranking Models With High Performance for Recommender System

Jiaxi Tang, Ke Wang

PDF

Open Access 1 Repo

TL;DR

This paper introduces ranking distillation, a novel knowledge distillation technique for learning compact, high-performance ranking models in recommender systems, achieving efficiency without sacrificing effectiveness.

Contribution

The paper proposes a new ranking distillation method that trains smaller models to mimic larger models, improving efficiency while maintaining ranking performance.

Findings

01

Student models are less than half the size of teacher models.

02

Ranking performance of student models is comparable to teacher models.

03

Ranking distillation outperforms models trained without it.

Abstract

We propose a novel way to train ranking models, such as recommender systems, that are both effective and efficient. Knowledge distillation (KD) was shown to be successful in image recognition to achieve both effectiveness and efficiency. We propose a KD technique for learning to rank problems, called \emph{ranking distillation (RD)}. Specifically, we train a smaller student model to learn to rank documents/items from both the training data and the supervision of a larger teacher model. The student model achieves a similar ranking performance to that of the large teacher model, but its smaller model size makes the online inference more efficient. RD is flexible because it is orthogonal to the choices of ranking models for the teacher and student. We address the challenges of RD for ranking problems. The experiments on public data sets and state-of-the-art recommendation models showed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

graytowne/rank_distill
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Data Stream Mining Techniques · Advanced Bandit Algorithms Research

MethodsKnowledge Distillation