Multi-Task End-to-End Training Improves Conversational Recommendation
Naveen Ram, Dima Kuzmin, Ellie Ka In Chio, Moustafa Farid Alzantot,, Santiago Ontanon, Ambarish Jash, and Judith Yue Li

TL;DR
This paper demonstrates that a unified multitask transformer model can effectively perform conversational recommendations and dialogue generation, outperforming complex multi-component systems by leveraging multitask learning on dialogue and movie data.
Contribution
The study introduces a multitask end-to-end transformer approach for conversational recommendation, showing competitive performance and knowledge transfer from auxiliary tasks.
Findings
Multitask training improves recommendation accuracy by up to 52%.
Knowledge from auxiliary tasks transfers effectively to conversational settings.
Unified transformer models can replace complex multi-component systems.
Abstract
In this paper, we analyze the performance of a multitask end-to-end transformer model on the task of conversational recommendations, which aim to provide recommendations based on a user's explicit preferences expressed in dialogue. While previous works in this area adopt complex multi-component approaches where the dialogue management and entity recommendation tasks are handled by separate components, we show that a unified transformer model, based on the T5 text-to-text transformer model, can perform competitively in both recommending relevant items and generating conversation dialogue. We fine-tune our model on the ReDIAL conversational movie recommendation dataset, and create additional training tasks derived from MovieLens (such as the prediction of movie attributes and related movies based on an input movie), in a multitask learning setting. Using a series of probe studies, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Topic Modeling · Multimodal Machine Learning Applications
MethodsAttention Is All You Need · Adafactor · Softmax · Inverse Square Root Schedule · Layer Normalization · Linear Layer · Dropout · Byte Pair Encoding · Gated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia?
