How Do Your Biomedical Named Entity Recognition Models Generalize to   Novel Entities?

Hyunjae Kim; Jaewoo Kang

arXiv:2101.00160·cs.CL·March 15, 2022

How Do Your Biomedical Named Entity Recognition Models Generalize to Novel Entities?

Hyunjae Kim, Jaewoo Kang

PDF

1 Repo

TL;DR

This paper systematically analyzes the generalization abilities of biomedical NER models, revealing their limitations in recognizing unseen entities and proposing a debiasing method to improve their performance on novel mentions.

Contribution

It provides a comprehensive analysis of BioNER models' recognition abilities and introduces a simple debiasing technique to enhance their generalization to unseen entities.

Findings

01

Current models overestimate their generalization abilities.

02

Models struggle with recognizing novel biomedical names and synonyms.

03

Debiasing improves model performance on unseen mentions.

Abstract

The number of biomedical literature on new biomedical concepts is rapidly increasing, which necessitates a reliable biomedical named entity recognition (BioNER) model for identifying new and unseen entity mentions. However, it is questionable whether existing models can effectively handle them. In this work, we systematically analyze the three types of recognition abilities of BioNER models: memorization, synonym generalization, and concept generalization. We find that although current best models achieve state-of-the-art performance on benchmarks based on overall performance, they have limitations in identifying synonyms and new biomedical concepts, indicating they are overestimated in terms of their generalization abilities. We also investigate failure cases of models and identify several difficulties in recognizing unseen mentions in biomedical literature as follows: (1) models tend…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dmis-lab/bioner-generalization
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.