Failure Prediction in Conversational Recommendation Systems
Maria Vlachou

TL;DR
This paper introduces a method to predict conversational recommendation system failures, distinguishing between system and catalogue failures, using an AutoEncoder-based predictor on multi-turn semantic representations.
Contribution
It proposes the task of Supervised Conversational Performance Prediction and develops an AutoEncoder-based predictor for failure detection in conversational image recommendation systems.
Findings
Predictors effectively identify system failures in recommendation scenarios.
Predictive performance decreases significantly for catalogue failure detection.
AutoEncoder-based approach shows promise for failure prediction tasks.
Abstract
In a Conversational Image Recommendation task, users can provide natural language feedback on a recommended image item, which leads to an improved recommendation in the next turn. While typical instantiations of this task assume that the user's target item will (eventually) be returned, this might often not be true, for example, the item the user seeks is not within the item catalogue. Failing to return a user's desired item can lead to user frustration, as the user needs to interact with the system for an increased number of turns. To mitigate this issue, in this paper, we introduce the task of Supervised Conversational Performance Prediction, inspired by Query Performance Prediction (QPP) for predicting effectiveness in response to a search engine query. In this regard, we propose predictors for conversational performance that detect conversation failures using multi-turn semantic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
