Towards Better Understanding of User Satisfaction in Open-Domain   Conversational Search

Zhumin Chu; Qingyao Ai; Zhihong Wang; Yiqun Liu; Yingye Huang; Rui; Zhang; Min Zhang; Shaoping Ma

arXiv:2204.02659·cs.IR·January 25, 2024·1 cites

Towards Better Understanding of User Satisfaction in Open-Domain Conversational Search

Zhumin Chu, Qingyao Ai, Zhihong Wang, Yiqun Liu, Yingye Huang, Rui, Zhang, Min Zhang, Shaoping Ma

PDF

Open Access 1 Repo

TL;DR

This paper investigates how to better evaluate user satisfaction in open-domain conversational search by creating a new dataset, analyzing annotation methods, and proposing models to predict satisfaction.

Contribution

It introduces a novel conversational search platform, a Chinese dataset with rich annotations, and models for session-level satisfaction prediction based on turn-level data.

Findings

01

Some consistency between user and third-party satisfaction annotations

02

Significant differences observed between annotation types

03

Proposed models effectively predict session-level satisfaction

Abstract

With the increasing popularity of conversational search, how to evaluate the performance of conversational search systems has become an important question in the IR community. Existing works on conversational search evaluation can mainly be categorized into two streams: (1) constructing metrics based on semantic similarity (e.g. BLUE, METEOR and BERTScore), or (2) directly evaluating the response ranking performance of the system using traditional search methods (e.g. nDCG, RBP and nERR). However, these methods either ignore the information need of the user or ignore the mixed-initiative property of conversational search. This raises the question of how to accurately model user satisfaction in conversational search scenarios. Since explicitly asking users to provide satisfaction feedback is difficult, traditional IR studies often rely on the Cranfield paradigm (i.e., third-party…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chuzhumin98/convsearch-dataset
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpeech and dialogue systems · Expert finding and Q&A systems · AI in Service Interactions