TL;DR
This paper develops a high-quality snippet-level annotation dataset for conversational search, enabling better training and evaluation of response generation models that synthesize relevant passages into concise answers.
Contribution
It introduces a refined annotation protocol and collects a large dataset of 1.8k question-paragraph pairs with high-quality annotations for conversational response generation.
Findings
Refined annotation protocol improves data quality
Collected 1.8k annotated question-paragraph pairs
Insights for future response-generation model design
Abstract
Research on conversational search has so far mostly focused on query rewriting and multi-stage passage retrieval. However, synthesizing the top retrieved passages into a complete, relevant, and concise response is still an open challenge. Having snippet-level annotations of relevant passages would enable both (1) the training of response generation models that are able to ground answers in actual statements and (2) the automatic evaluation of the generated responses in terms of completeness. In this paper, we address the problem of collecting high-quality snippet-level answer annotations for two of the TREC Conversational Assistance track datasets. To ensure quality, we first perform a preliminary annotation study, employing different task designs, crowdsourcing platforms, and workers with different qualifications. Based on the outcomes of this study, we refine our annotation protocol…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
