VANER: Leveraging Large Language Model for Versatile and Adaptive Biomedical Named Entity Recognition
Junyi Biana, Weiqi Zhai, Xiaodi Huang, Jiaxuan Zheng, Shanfeng Zhu

TL;DR
VANER leverages LLaMA2 with instruction tuning and external knowledge bases to create a versatile biomedical NER model that outperforms previous LLM-based and traditional systems across multiple datasets.
Contribution
This paper introduces VANER, a novel LLM-based BioNER model that combines instruction tuning, external knowledge, and dataset mixing to enhance entity recognition performance.
Findings
VANER surpasses previous LLM-based BioNER models in F1 scores.
VANER outperforms many traditional BioNER systems on three datasets.
Instruction tuning and external knowledge integration are effective for biomedical NER.
Abstract
Prevalent solution for BioNER involves using representation learning techniques coupled with sequence labeling. However, such methods are inherently task-specific, demonstrate poor generalizability, and often require dedicated model for each dataset. To leverage the versatile capabilities of recently remarkable large language models (LLMs), several endeavors have explored generative approaches to entity extraction. Yet, these approaches often fall short of the effectiveness of previouly sequence labeling approaches. In this paper, we utilize the open-sourced LLM LLaMA2 as the backbone model, and design specific instructions to distinguish between different types of entities and datasets. By combining the LLM's understanding of instructions with sequence labeling techniques, we use mix of datasets to train a model capable of extracting various types of entities. Given that the backbone…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
