An Evaluation of Two Commercial Deep Learning-Based Information Retrieval Systems for COVID-19 Literature
Sarvesh Soni, Kirk Roberts

TL;DR
This study empirically compares two commercial COVID-19 literature search engines from Google and Amazon with academic prototypes, revealing that top academic systems outperform commercial ones in retrieval effectiveness, informing future biomedical search development.
Contribution
It provides a comparative evaluation of commercial and academic COVID-19 literature search engines, highlighting the superior performance of academic systems in this context.
Findings
TREC-COVID top system outperformed others on all metrics
Academic prototypes showed higher retrieval effectiveness
Results inform future development of biomedical search tools
Abstract
The COVID-19 pandemic has resulted in a tremendous need for access to the latest scientific information, primarily through the use of text mining and search tools. This has led to both corpora for biomedical articles related to COVID-19 (such as the CORD-19 corpus (Wang et al., 2020)) as well as search engines to query such data. While most research in search engines is performed in the academic field of information retrieval (IR), most academic search enginesthough rigorously evaluatedare sparsely utilized, while major commercial web search engines (e.g., Google, Bing) dominate. This relates to COVID-19 because it can be expected that commercial search engines deployed for the pandemic will gain much higher traction than those produced in academic labs, and thus leads to questions about the empirical performance of these search tools. This paper seeks…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
