From Little Things Big Things Grow: A Collection with Seed Studies for   Medical Systematic Review Literature Search

Shuai Wang; Harrisen Scells; Justin Clark; Bevan Koopman; Guido Zuccon

arXiv:2204.03096·cs.IR·April 26, 2022

From Little Things Big Things Grow: A Collection with Seed Studies for Medical Systematic Review Literature Search

Shuai Wang, Harrisen Scells, Justin Clark, Bevan Koopman, Guido Zuccon

PDF

1 Repo

TL;DR

This paper introduces a new test collection of real seed studies used in medical systematic review query formulation, highlighting their importance over pseudo seed studies and evaluating seed-study based retrieval methods.

Contribution

It provides a novel test collection with real seed studies and analyzes their impact on retrieval effectiveness compared to pseudo seed studies.

Findings

01

Real seed studies differ significantly from pseudo seed studies in representing expert queries.

02

Seed-study based methods improve retrieval performance over traditional approaches.

03

The collection enables better evaluation of seed study-guided retrieval techniques.

Abstract

Medical systematic review query formulation is a highly complex task done by trained information specialists. Complexity comes from the reliance on lengthy Boolean queries, which express a detailed research question. To aid query formulation, information specialists use a set of exemplar documents, called `seed studies', prior to query formulation. Seed studies help verify the effectiveness of a query prior to the full assessment of retrieved studies. Beyond this use of seeds, specific IR methods can exploit seed studies for guiding both automatic query formulation and new retrieval models. One major limitation of work to date is that these methods exploit `pseudo seed studies' through retrospective use of included studies (i.e., relevance assessments). However, we show pseudo seed studies are not representative of real seed studies used by information specialists. Hence, we provide a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ielab/sysrev-seed-collection
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.