Knowledge-augmented Pre-trained Language Models for Biomedical Relation Extraction

Mario S\"anger; Ulf Leser

arXiv:2505.00814·cs.CL·November 4, 2025

Knowledge-augmented Pre-trained Language Models for Biomedical Relation Extraction

Mario S\"anger, Ulf Leser

PDF

Open Access 1 Repo

TL;DR

This study evaluates how different pre-trained language models and contextual data enhancements affect biomedical relation extraction, highlighting the importance of model choice and hyperparameter tuning for optimal performance.

Contribution

It provides a comprehensive evaluation framework for PLMs with contextual information in biomedical RE, including extensive hyperparameter optimization and analysis of external data impact.

Findings

01

Model choice and hyperparameter tuning are crucial for RE performance.

02

External contextual data benefits smaller PLMs significantly.

03

Inclusion of context information yields minor overall improvements.

Abstract

Automatic relationship extraction (RE) from biomedical literature is critical for managing the vast amount of scientific knowledge produced each year. In recent years, utilizing pre-trained language models (PLMs) has become the prevalent approach in RE. Several studies report improved performance when incorporating additional context information while fine-tuning PLMs for RE. However, variations in the PLMs applied, the databases used for augmentation, hyper-parameter optimization, and evaluation methods complicate direct comparisons between studies and raise questions about the generalizability of these findings. Our study addresses this research gap by evaluating PLMs enhanced with contextual information on five datasets spanning four relation scenarios within a consistent evaluation framework. We evaluate three baseline PLMs and first conduct extensive hyperparameter optimization.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mariosaenger/biore-kplm-benchmark
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Biomedical Text Mining and Ontologies