An Empirical Study on Relation Extraction in the Biomedical Domain

Yongkang Li

arXiv:2112.05910·cs.CL·December 14, 2021

An Empirical Study on Relation Extraction in the Biomedical Domain

Yongkang Li

PDF

Open Access

TL;DR

This paper empirically evaluates relation extraction methods in biomedical texts, revealing their strong generalization but dependence on large labeled datasets, guiding future model development in the domain.

Contribution

It provides the first comprehensive empirical analysis of relation extraction techniques specifically in biomedical research articles.

Findings

01

Document-level methods generalize well across datasets.

02

Existing methods need extensive labeled data for fine-tuning.

03

Insights may inspire development of more effective biomedical relation extraction models.

Abstract

Relation extraction is a fundamental problem in natural language processing. Most existing models are defined for relation extraction in the general domain. However, their performance on specific domains (e.g., biomedicine) is yet unclear. To fill this gap, this paper carries out an empirical study on relation extraction in biomedical research articles. Specifically, we consider both sentence-level and document-level relation extraction, and run a few state-of-the-art methods on several benchmark datasets. Our results show that (1) current document-level relation extraction methods have strong generalization ability; (2) existing methods require a large amount of labeled data for model fine-tuning in biomedicine. Our observations may inspire people in this field to develop more effective models for biomedical relation extraction.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques