TREC iKAT 2023: A Test Collection for Evaluating Conversational and   Interactive Knowledge Assistants

Mohammad Aliannejadi; Zahra Abbasiantaeb; Shubham Chatterjee and; Jeffery Dalton; Leif Azzopardi

arXiv:2405.02637·cs.IR·May 7, 2024

TREC iKAT 2023: A Test Collection for Evaluating Conversational and Interactive Knowledge Assistants

Mohammad Aliannejadi, Zahra Abbasiantaeb, Shubham Chatterjee and, Jeffery Dalton, Leif Azzopardi

PDF

1 Repo

TL;DR

The paper introduces the TREC iKAT 2023 collection, a comprehensive benchmark dataset designed to evaluate conversational search agents in personalized, interactive contexts with diverse user personas and complex relevance assessments.

Contribution

It presents a novel test collection with personalized dialogues, PTKB integration, and multi-dimensional response assessments to advance research in conversational knowledge assistants.

Findings

01

Provides 36 dialogues over 20 topics with relevance assessments

02

Includes evaluations on response relevance, completeness, groundedness, and naturalness

03

Challenges CSA to handle diverse personal contexts and user personas

Abstract

Conversational information seeking has evolved rapidly in the last few years with the development of Large Language Models (LLMs), providing the basis for interpreting and responding in a naturalistic manner to user requests. The extended TREC Interactive Knowledge Assistance Track (iKAT) collection aims to enable researchers to test and evaluate their Conversational Search Agents (CSA). The collection contains a set of 36 personalized dialogues over 20 different topics each coupled with a Personal Text Knowledge Base (PTKB) that defines the bespoke user personas. A total of 344 turns with approximately 26,000 passages are provided as assessments on relevance, as well as additional assessments on generated responses over four key dimensions: relevance, completeness, groundedness, and naturalness. The collection challenges CSA to efficiently navigate diverse personal contexts, elicit…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

irlabamsterdam/iKAT
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSparse Evolutionary Training · Balanced Selection