Building a Relation Extraction Baseline for Gene-Disease Associations: A Reproducibility Study
Laura Menotti

TL;DR
This paper reproduces the DEXTER system for extracting gene-disease associations from biomedical texts to establish a reproducible benchmark for relation extraction research.
Contribution
It provides a reproducibility study of DEXTER, creating a benchmark dataset for future gene-disease relation extraction research.
Findings
Reproduced DEXTER system successfully
Established a benchmark for GDA relation extraction
Facilitates comparison of future methods
Abstract
Reproducibility is an important task in scientific research. It is crucial for researchers to compare newly developed systems with the state-of-the-art to assess whether they made a breakthrough. However previous works may not be immediately reproducible, for example due to the lack of source code. In this work we reproduce DEXTER, a system to automatically extract Gene-Disease Associations (GDAs) from biomedical abstracts. The goal is to provide a benchmark for future works regarding Relation Extraction (RE), enabling researchers to test and compare their results.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Semantic Web and Ontologies · Scientific Computing and Data Management
MethodsTest
