On Estimating the Training Cost of Conversational Recommendation Systems
Stefanos Antaris, Dimitrios Rafailidis, Mohammad Aliannejadi

TL;DR
This paper investigates the high training costs of conversational recommendation systems, analyzing five strategies and discussing knowledge distillation techniques to reduce inference time and computational expenses.
Contribution
It provides an analysis of training costs for conversational recommendation models and explores knowledge distillation as a solution to reduce inference time.
Findings
High training time for state-of-the-art models
Analysis of five representative training strategies
Discussion of knowledge distillation challenges
Abstract
Conversational recommendation systems have recently gain a lot of attention, as users can continuously interact with the system over multiple conversational turns. However, conversational recommendation systems are based on complex neural architectures, thus the training cost of such models is high. To shed light on the high computational training time of state-of-the art conversational models, we examine five representative strategies and demonstrate this issue. Furthermore, we discuss possible ways to cope with the high training cost following knowledge distillation strategies, where we detail the key challenges to reduce the online inference time of the high number of model parameters in conversational recommendation systems
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Recommender Systems and Techniques · Multimodal Machine Learning Applications
MethodsKnowledge Distillation
