What Do Biomedical NER and Entity Linking Benchmarks Measure? A Corpus-Centric Diagnostic Framework

Robert Leaman; Rezarta Islamaj; Zhiyong Lu

arXiv:2605.20537·cs.CL·May 21, 2026

What Do Biomedical NER and Entity Linking Benchmarks Measure? A Corpus-Centric Diagnostic Framework

Robert Leaman, Rezarta Islamaj, Zhiyong Lu

PDF

1 Repo

TL;DR

This paper introduces a diagnostic framework to analyze biomedical NER and entity linking benchmarks, revealing significant corpus differences that impact evaluation and generalization.

Contribution

It presents a novel, open-source, corpus-centric diagnostic framework for characterizing biomedical NER and EL benchmarks beyond traditional statistics.

Findings

01

Corpus properties vary significantly across datasets.

02

Differences affect evaluation signals and generalization.

03

Standard statistics may be insufficient for benchmark characterization.

Abstract

Biomedical named entity recognition (NER) and entity linking (EL) strongly depend on annotated corpora, but the utility of these resources for benchmarking is often assumed rather than characterized. We present a corpus-centric framework for diagnosing benchmark-relevant properties directly from corpus annotations, concept links, train-test splits, document metadata, and terminology mappings. The framework organizes standardized statistics into five families: (1) scale, density and label distribution, (2) lexical and conceptual structure, (3) train-test overlap, (4) metadata composition, and (5) terminology coverage where applicable. Applying the framework to nine corpora spanning diseases, chemicals, and cell types, we find that corpus properties can differ substantially, even when they address the same apparent task. We find differences in the evaluation signal they provide, the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

null
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.