RMITB at TREC COVID 2020
Rodger Benham, Alistair Moffat, J. Shane Culpepper

TL;DR
This paper analyzes query fusion runs in TREC COVID 2020, emphasizing how judgment depth impacts performance evaluation and highlighting the importance of comprehensive relevance judgments in search assessment.
Contribution
It documents the query fusion runs submitted to TREC COVID and analyzes the impact of judgment omission on performance metrics.
Findings
Omission of certain runs affects performance rankings.
Additional relevance judgments can significantly improve run evaluation.
Judgment depth is crucial for accurate assessment of search effectiveness.
Abstract
Search engine users rarely express an information need using the same query, and small differences in queries can lead to very different result sets. These user query variations have been exploited in past TREC CORE tracks to contribute diverse, highly-effective runs in offline evaluation campaigns with the goal of producing reusable test collections. In this paper, we document the query fusion runs submitted to the first and second round of TREC COVID, using ten queries per topic created by the first author. In our analysis, we focus primarily on the effects of having our second priority run omitted from the judgment pool. This run is of particular interest, as it surfaced a number of relevant documents that were not judged until later rounds of the task. If the additional judgments were included in the first round, the performance of this run increased by 35 rank positions when using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Topic Modeling · Expert finding and Q&A systems
