Augmenting Biomedical Named Entity Recognition with General-domain Resources
Yu Yin, Hyunjae Kim, Xiao Xiao, Chih Hsuan Wei, Jaewoo Kang, Zhiyong, Lu, Hua Xu, Meng Fang, Qingyu Chen

TL;DR
This paper presents GERBERA, a transfer learning approach that leverages general-domain NER datasets to improve biomedical NER performance, especially when biomedical data is limited or costly to annotate.
Contribution
GERBERA introduces a simple transfer learning method using general-domain NER data to enhance biomedical NER models, reducing reliance on extensive biomedical annotations.
Findings
GERBERA outperforms baseline models on 6 of 8 entity types.
Achieves an average of 0.9% F1 score improvement across entities.
Significantly boosts performance on datasets with limited biomedical data, up to 4.7% F1 increase.
Abstract
Training a neural network-based biomedical named entity recognition (BioNER) model usually requires extensive and costly human annotations. While several studies have employed multi-task learning with multiple BioNER datasets to reduce human effort, this approach does not consistently yield performance improvements and may introduce label ambiguity in different biomedical corpora. We aim to tackle those challenges through transfer learning from easily accessible resources with fewer concept overlaps with biomedical datasets. We proposed GERBERA, a simple-yet-effective method that utilized general-domain NER datasets for training. We performed multi-task learning to train a pre-trained biomedical language model with both the target BioNER dataset and the general-domain dataset. Subsequently, we fine-tuned the models specifically for the BioNER dataset. We systematically evaluated GERBERA…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
