A Search Engine for Discovery of Scientific Challenges and Directions
Dan Lahav, Jon Saad Falcon, Bailey Kuehl, Sophie Johnson, Sravanthi, Parasa, Noam Shomron, Duen Horng Chau, Diyi Yang, Eric Horvitz, Daniel S., Weld, Tom Hope

TL;DR
This paper introduces a novel search engine for scientific challenges and directions, using an expert-annotated corpus and models trained to facilitate rapid knowledge discovery across disciplines, especially in biomedicine and AI.
Contribution
It presents a new task, annotated dataset, and search engine for extracting and searching scientific challenges and directions, improving discovery over existing tools.
Findings
Outperforms popular scientific search engines in aiding knowledge discovery.
Models trained on the dataset generalize across biomedical and AI domains.
The resource is publicly available for broader research use.
Abstract
Keeping track of scientific challenges, advances and emerging directions is a fundamental part of research. However, researchers face a flood of papers that hinders discovery of important knowledge. In biomedicine, this directly impacts human lives. To address this problem, we present a novel task of extraction and search of scientific challenges and directions, to facilitate rapid knowledge discovery. We construct and release an expert-annotated corpus of texts sampled from full-length papers, labeled with novel semantic categories that generalize across many types of challenges and directions. We focus on a large corpus of interdisciplinary work relating to the COVID-19 pandemic, ranging from biomedicine to areas such as AI and economics. We apply a model trained on our data to identify challenges and directions across the corpus and build a dedicated search engine. In experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Misinformation and Its Impacts · Biomedical Text Mining and Ontologies
