Multi-Task End-to-End Training Improves Conversational Recommendation

Naveen Ram; Dima Kuzmin; Ellie Ka In Chio; Moustafa Farid Alzantot,; Santiago Ontanon; Ambarish Jash; and Judith Yue Li

arXiv:2305.06218·cs.CL·May 11, 2023·2 cites

Multi-Task End-to-End Training Improves Conversational Recommendation

Naveen Ram, Dima Kuzmin, Ellie Ka In Chio, Moustafa Farid Alzantot,, Santiago Ontanon, Ambarish Jash, and Judith Yue Li

PDF

Open Access

TL;DR

This paper demonstrates that a unified multitask transformer model can effectively perform conversational recommendations and dialogue generation, outperforming complex multi-component systems by leveraging multitask learning on dialogue and movie data.

Contribution

The study introduces a multitask end-to-end transformer approach for conversational recommendation, showing competitive performance and knowledge transfer from auxiliary tasks.

Findings

01

Multitask training improves recommendation accuracy by up to 52%.

02

Knowledge from auxiliary tasks transfers effectively to conversational settings.

03

Unified transformer models can replace complex multi-component systems.

Abstract

In this paper, we analyze the performance of a multitask end-to-end transformer model on the task of conversational recommendations, which aim to provide recommendations based on a user's explicit preferences expressed in dialogue. While previous works in this area adopt complex multi-component approaches where the dialogue management and entity recommendation tasks are handled by separate components, we show that a unified transformer model, based on the T5 text-to-text transformer model, can perform competitively in both recommending relevant items and generating conversation dialogue. We fine-tune our model on the ReDIAL conversational movie recommendation dataset, and create additional training tasks derived from MovieLens (such as the prediction of movie attributes and related movies based on an input movie), in a multitask learning setting. Using a series of probe studies, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRecommender Systems and Techniques · Topic Modeling · Multimodal Machine Learning Applications

MethodsAttention Is All You Need · Adafactor · Softmax · Inverse Square Root Schedule · Layer Normalization · Linear Layer · Dropout · Byte Pair Encoding · Gated Linear Unit · Refunds@Expedia|||How do I get a full refund from Expedia?