LongEval-Retrieval: French-English Dynamic Test Collection for Continuous Web Search Evaluation
Petra Galu\v{s}\v{c}\'akov\'a Romain Deveaud, Gabriela Gonzalez-Saez,, Philippe Mulhem, Lorraine Goeuriot, Florina Piroi, Martin Popel

TL;DR
LongEval-Retrieval introduces a dynamic, continuous Web search evaluation benchmark based on evolving document collections, queries, and relevance, using data from Qwant to study system persistence over time.
Contribution
It presents a novel dynamic test collection for continuous Web search evaluation, incorporating temporal evolution and bilingual data from French to English.
Findings
Constructed from Qwant search data in French and English
Provides baseline retrieval results and analysis
Simulates real-world evolving Web search environments
Abstract
LongEval-Retrieval is a Web document retrieval benchmark that focuses on continuous retrieval evaluation. This test collection is intended to be used to study the temporal persistence of Information Retrieval systems and will be used as the test collection in the Longitudinal Evaluation of Model Performance Track (LongEval) at CLEF 2023. This benchmark simulates an evolving information system environment - such as the one a Web search engine operates in - where the document collection, the query distribution, and relevance all move continuously, while following the Cranfield paradigm for offline evaluation. To do that, we introduce the concept of a dynamic test collection that is composed of successive sub-collections each representing the state of an information system at a given time step. In LongEval-Retrieval, each sub-collection contains a set of queries, documents, and soft…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Data Quality and Management · Web Data Mining and Analysis
MethodsTest
