Cross-Task Knowledge Distillation in Multi-Task Recommendation
Chenxiao Yang, Junwei Pan, Xiaofeng Gao, Tingyu Jiang, Dapeng Liu,, Guihai Chen

TL;DR
This paper introduces a novel Cross-Task Knowledge Distillation framework for multi-task recommendation systems, leveraging task-specific prediction results to improve overall performance through auxiliary tasks, calibrated distillation, and error correction mechanisms.
Contribution
It proposes a new framework combining MTL and KD, with auxiliary tasks, calibrated knowledge transfer, and error correction to address task conflicts and improve recommendation accuracy.
Findings
Enhanced recommendation performance on real-world datasets.
Effective knowledge transfer between tasks improves fine-grained user preference modeling.
The framework outperforms existing multi-task learning methods.
Abstract
Multi-task learning (MTL) has been widely used in recommender systems, wherein predicting each type of user feedback on items (e.g, click, purchase) are treated as individual tasks and jointly trained with a unified model. Our key observation is that the prediction results of each task may contain task-specific knowledge about user's fine-grained preference towards items. While such knowledge could be transferred to benefit other tasks, it is being overlooked under the current MTL paradigm. This paper, instead, proposes a Cross-Task Knowledge Distillation framework that attempts to leverage prediction results of one task as supervised signals to teach another task. However, integrating MTL and KD in a proper manner is non-trivial due to several challenges including task conflicts, inconsistent magnitude and requirement of synchronous optimization. As countermeasures, we 1) introduce…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsRecommender Systems and Techniques · Advanced Bandit Algorithms Research · Domain Adaptation and Few-Shot Learning
MethodsALIGN · Knowledge Distillation
