POSSCORE: A Simple Yet Effective Evaluation of Conversational Search   with Part of Speech Labelling

Zeyang Liu; Ke Zhou; Jiaxin Mao; Max L. Wilson

arXiv:2109.03039·cs.IR·September 8, 2021

POSSCORE: A Simple Yet Effective Evaluation of Conversational Search with Part of Speech Labelling

Zeyang Liu, Ke Zhou, Jiaxin Mao, Max L. Wilson

PDF

1 Repo

TL;DR

POSSCORE is an innovative automatic evaluation metric for conversational search that incorporates part of speech information to better align with human preferences, outperforming existing metrics.

Contribution

This work introduces the first systematic use of POS labels in conversational search evaluation, demonstrating improved correlation with human judgments.

Findings

01

POSSCORE correlates better with human preferences than baseline metrics.

02

Incorporating POS information enhances evaluation accuracy.

03

Experimental results show significant performance improvements.

Abstract

Conversational search systems, such as Google Assistant and Microsoft Cortana, provide a new search paradigm where users are allowed, via natural language dialogues, to communicate with search systems. Evaluating such systems is very challenging since search results are presented in the format of natural language sentences. Given the unlimited number of possible responses, collecting relevance assessments for all the possible responses is infeasible. In this paper, we propose POSSCORE, a simple yet effective automatic evaluation method for conversational search. The proposed embedding-based metric takes the influence of part of speech (POS) of the terms in the response into account. To the best knowledge, our work is the first to systematically demonstrate the importance of incorporating syntactic information, such as POS labels, for conversational search evaluation. Experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zy-liu/posscore
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.