CO-Search: COVID-19 Information Retrieval with Semantic Search, Question Answering, and Abstractive Summarization
Andre Esteva, Anuprit Kale, Romain Paulus, Kazuma Hashimoto, Wenpeng, Yin, Dragomir Radev, Richard Socher

TL;DR
CO-Search is a semantic search engine tailored for COVID-19 literature, combining multiple retrieval and summarization techniques to assist health workers in quickly finding relevant scientific information during the pandemic.
Contribution
The paper introduces CO-Search, a novel retrieval system integrating semantic search, question answering, and summarization specifically designed for COVID-19 literature, with a unique training approach using citation graphs.
Findings
Achieved top performance on TREC-COVID datasets
Effective retrieval with high precision and recall metrics
Demonstrated utility in supporting health workers during COVID-19 crisis
Abstract
The COVID-19 global pandemic has resulted in international efforts to understand, track, and mitigate the disease, yielding a significant corpus of COVID-19 and SARS-CoV-2-related publications across scientific disciplines. As of May 2020, 128,000 coronavirus-related publications have been collected through the COVID-19 Open Research Dataset Challenge. Here we present CO-Search, a retriever-ranker semantic search engine designed to handle complex queries over the COVID-19 literature, potentially aiding overburdened health workers in finding scientific answers during a time of crisis. The retriever is built from a Siamese-BERT encoder that is linearly composed with a TF-IDF vectorizer, and reciprocal-rank fused with a BM25 vectorizer. The ranker is composed of a multi-hop question-answering module, that together with a multi-paragraph abstractive summarizer adjust retriever scores. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Multimodal Machine Learning Applications · Misinformation and Its Impacts
