TL;DR
This paper systematically analyzes the generalization abilities of biomedical NER models, revealing their limitations in recognizing unseen entities and proposing a debiasing method to improve their performance on novel mentions.
Contribution
It provides a comprehensive analysis of BioNER models' recognition abilities and introduces a simple debiasing technique to enhance their generalization to unseen entities.
Findings
Current models overestimate their generalization abilities.
Models struggle with recognizing novel biomedical names and synonyms.
Debiasing improves model performance on unseen mentions.
Abstract
The number of biomedical literature on new biomedical concepts is rapidly increasing, which necessitates a reliable biomedical named entity recognition (BioNER) model for identifying new and unseen entity mentions. However, it is questionable whether existing models can effectively handle them. In this work, we systematically analyze the three types of recognition abilities of BioNER models: memorization, synonym generalization, and concept generalization. We find that although current best models achieve state-of-the-art performance on benchmarks based on overall performance, they have limitations in identifying synonyms and new biomedical concepts, indicating they are overestimated in terms of their generalization abilities. We also investigate failure cases of models and identify several difficulties in recognizing unseen mentions in biomedical literature as follows: (1) models tend…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
