SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search

Sean MacAvaney; Arman Cohan; Nazli Goharian

arXiv:2010.05987·cs.CL·October 14, 2020

SLEDGE-Z: A Zero-Shot Baseline for COVID-19 Literature Search

Sean MacAvaney, Arman Cohan, Nazli Goharian

PDF

1 Models

TL;DR

SLEDGE-Z introduces a zero-shot COVID-19 literature search method that leverages scientific pre-training and data filtering, achieving top performance without relying on COVID-specific training data.

Contribution

The paper presents a novel zero-shot ranking algorithm for COVID-19 literature search that outperforms existing models and sets a new baseline for rapid, effective scientific article retrieval.

Findings

01

Achieves P@5 of 0.80 and nDCG@10 of 0.68 on TREC COVID benchmarks

02

Outperforms models trained specifically on COVID data despite no such training

03

Ranks among the top zero-shot methods on the TREC COVID leaderboard

Abstract

With worldwide concerns surrounding the Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2), there is a rapidly growing body of scientific literature on the virus. Clinicians, researchers, and policy-makers need to be able to search these articles effectively. In this work, we present a zero-shot ranking algorithm that adapts to COVID-related scientific literature. Our approach filters training data from another collection down to medical-related queries, uses a neural re-ranking model pre-trained on scientific text (SciBERT), and filters the target document collection. This approach ranks top among zero-shot methods on the TREC COVID Round 1 leaderboard, and exhibits a P@5 of 0.80 and an nDCG@10 of 0.68 when evaluated on both Round 1 and 2 judgments. Despite not relying on TREC-COVID data, our method outperforms models that do. As one of the first search methods to thoroughly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Darkrider/covidbert_medmarco
model· 8 dl
8 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.