On Estimating the Training Cost of Conversational Recommendation Systems

Stefanos Antaris; Dimitrios Rafailidis; Mohammad Aliannejadi

arXiv:2011.05302·cs.LG·November 11, 2020

On Estimating the Training Cost of Conversational Recommendation Systems

Stefanos Antaris, Dimitrios Rafailidis, Mohammad Aliannejadi

PDF

Open Access

TL;DR

This paper investigates the high training costs of conversational recommendation systems, analyzing five strategies and discussing knowledge distillation techniques to reduce inference time and computational expenses.

Contribution

It provides an analysis of training costs for conversational recommendation models and explores knowledge distillation as a solution to reduce inference time.

Findings

01

High training time for state-of-the-art models

02

Analysis of five representative training strategies

03

Discussion of knowledge distillation challenges

Abstract

Conversational recommendation systems have recently gain a lot of attention, as users can continuously interact with the system over multiple conversational turns. However, conversational recommendation systems are based on complex neural architectures, thus the training cost of such models is high. To shed light on the high computational training time of state-of-the art conversational models, we examine five representative strategies and demonstrate this issue. Furthermore, we discuss possible ways to cope with the high training cost following knowledge distillation strategies, where we detail the key challenges to reduce the online inference time of the high number of model parameters in conversational recommendation systems

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · Multimodal Machine Learning Applications

MethodsKnowledge Distillation