GLiNER-BioMed: A Suite of Efficient Models for Open Biomedical Named Entity Recognition
Anthony Yazdani, Ihor Stepanov, Douglas Teodoro

TL;DR
GLiNER-BioMed introduces efficient, domain-adapted models for biomedical NER that leverage natural language labels and synthetic data to achieve superior zero- and few-shot recognition performance.
Contribution
The paper presents a novel suite of models that use natural language labels for zero-shot recognition and synthetic data generation, improving biomedical NER beyond traditional fixed-taxonomy approaches.
Findings
Outperforms state-of-the-art in zero- and few-shot scenarios
Achieves 5.96% higher F1-score than strongest baseline
Synthetic data generation enhances recognition accuracy
Abstract
Biomedical named entity recognition (NER) presents unique challenges due to specialized vocabularies, the sheer volume of entities, and the continuous emergence of novel entities. Traditional NER models, constrained by fixed taxonomies and human annotations, struggle to generalize beyond predefined entity types. To address these issues, we introduce GLiNER-BioMed, a domain-adapted suite of Generalist and Lightweight Model for NER (GLiNER) models specifically tailored for biomedicine. In contrast to conventional approaches, GLiNER uses natural language labels to infer arbitrary entity types, enabling zero-shot recognition. Our approach first distills the annotation capabilities of large language models (LLMs) into a smaller, more efficient model, enabling the generation of high-coverage synthetic biomedical NER data. We subsequently train two GLiNER architectures, uni- and bi-encoder, at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Ihor/OpenBioLLM-Text2Graph-8Bmodel· 5 dl5 dl
- 🤗Ihor/gliner-biomed-small-v1.0model· 217 dl· ♡ 3217 dl♡ 3
- 🤗Ihor/gliner-biomed-base-v1.0model· 118 dl· ♡ 5118 dl♡ 5
- 🤗Ihor/gliner-biomed-large-v1.0model· 234 dl· ♡ 14234 dl♡ 14
- 🤗Ihor/gliner-biomed-bi-small-v1.0model· 19 dl· ♡ 319 dl♡ 3
- 🤗Ihor/gliner-biomed-bi-base-v1.0model· 35 dl· ♡ 135 dl♡ 1
- 🤗Ihor/gliner-biomed-bi-large-v1.0model· 102 dl· ♡ 1102 dl♡ 1
- 🤗anthonyyazdaniml/gliner-biomed-large-v1.0-medication-regimen-nermodel· 3 dl· ♡ 13 dl♡ 1
- 🤗anthonyyazdaniml/gliner-biomed-bi-large-v1.0-medication-regimen-nermodel
- 🤗anthonyyazdaniml/gliner-biomed-large-v1.0-disease-chemical-gene-variant-species-cellline-nermodel· 16 dl· ♡ 216 dl♡ 2
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
