Towards Better Understanding of User Satisfaction in Open-Domain Conversational Search
Zhumin Chu, Qingyao Ai, Zhihong Wang, Yiqun Liu, Yingye Huang, Rui, Zhang, Min Zhang, Shaoping Ma

TL;DR
This paper investigates how to better evaluate user satisfaction in open-domain conversational search by creating a new dataset, analyzing annotation methods, and proposing models to predict satisfaction.
Contribution
It introduces a novel conversational search platform, a Chinese dataset with rich annotations, and models for session-level satisfaction prediction based on turn-level data.
Findings
Some consistency between user and third-party satisfaction annotations
Significant differences observed between annotation types
Proposed models effectively predict session-level satisfaction
Abstract
With the increasing popularity of conversational search, how to evaluate the performance of conversational search systems has become an important question in the IR community. Existing works on conversational search evaluation can mainly be categorized into two streams: (1) constructing metrics based on semantic similarity (e.g. BLUE, METEOR and BERTScore), or (2) directly evaluating the response ranking performance of the system using traditional search methods (e.g. nDCG, RBP and nERR). However, these methods either ignore the information need of the user or ignore the mixed-initiative property of conversational search. This raises the question of how to accurately model user satisfaction in conversational search scenarios. Since explicitly asking users to provide satisfaction feedback is difficult, traditional IR studies often rely on the Cranfield paradigm (i.e., third-party…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpeech and dialogue systems · Expert finding and Q&A systems · AI in Service Interactions
