On-the-fly Definition Augmentation of LLMs for Biomedical NER

Monica Munnangi; Sergey Feldman; Byron C Wallace; Silvio Amir; Tom; Hope; Aakanksha Naik

arXiv:2404.00152·cs.CL·April 24, 2024·2 cites

On-the-fly Definition Augmentation of LLMs for Biomedical NER

Monica Munnangi, Sergey Feldman, Byron C Wallace, Silvio Amir, Tom, Hope, Aakanksha Naik

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method for enhancing biomedical named entity recognition in large language models by dynamically incorporating relevant definitions, significantly improving performance especially in limited data scenarios.

Contribution

It proposes a novel on-the-fly definition augmentation technique for LLMs that improves biomedical NER accuracy and explores effective prompting strategies for knowledge integration.

Findings

01

Definition augmentation yields a 15% relative F1 score improvement in GPT-4.

02

Careful prompting strategies enhance LLM performance beyond fine-tuned models.

03

The approach is effective for both open source and closed LLMs.

Abstract

Despite their general capabilities, LLMs still struggle on biomedical NER tasks, which are difficult due to the presence of specialized terminology and lack of training data. In this work we set out to improve LLM performance on biomedical NER in limited data settings via a new knowledge augmentation approach which incorporates definitions of relevant concepts on-the-fly. During this process, to provide a test bed for knowledge augmentation, we perform a comprehensive exploration of prompting strategies. Our experiments show that definition augmentation is useful for both open source and closed LLMs. For example, it leads to a relative improvement of 15\% (on average) in GPT-4 performance (F1) across all (six) of our test datasets. We conduct extensive ablations and analyses to demonstrate that our performance improvements stem from adding relevant definitional knowledge. We find that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/beacon
noneOfficial

Videos

On-the-fly Definition Augmentation of LLMs for Biomedical NER· underline

Taxonomy

TopicsAdvanced Sensor Technologies Research · Magnetic Field Sensors Techniques · Non-Destructive Testing Techniques

MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Layer Normalization · Multi-Head Attention · Adam · Byte Pair Encoding · Absolute Position Encodings · Softmax · Dense Connections