TL;DR
This paper introduces ProCIS, a large-scale dataset and evaluation framework for proactive conversational information seeking, enabling systems to monitor conversations and proactively retrieve relevant resources at opportune moments.
Contribution
It presents a new dataset, ProCIS, with relevance annotations and a novel evaluation metric for proactive retrieval in multi-party conversations, addressing a gap in existing research.
Findings
ProCIS dataset contains over 2.8 million conversations.
Normalized proactive discounted cumulative gain (npDCG) effectively evaluates proactive retrieval.
Benchmark results demonstrate the effectiveness of various models, including a new proposed model.
Abstract
The field of conversational information seeking, which is rapidly gaining interest in both academia and industry, is changing how we interact with search engines through natural language interactions. Existing datasets and methods are mostly evaluating reactive conversational information seeking systems that solely provide response to every query from the user. We identify a gap in building and evaluating proactive conversational information seeking systems that can monitor a multi-party human conversation and proactively engage in the conversation at an opportune moment by retrieving useful resources and suggestions. In this paper, we introduce a large-scale dataset for proactive document retrieval that consists of over 2.8 million conversations. We conduct crowdsourcing experiments to obtain high-quality and relatively complete relevance judgments through depth-k pooling. We also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
