Exploring the Effectiveness of Instruction Tuning in Biomedical Language   Processing

Omid Rohanian; Mohammadmahdi Nouriborji; David A. Clifton

arXiv:2401.00579·cs.CL·January 2, 2024·Artif. Intell. Medicine·2 cites

Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing

Omid Rohanian, Mohammadmahdi Nouriborji, David A. Clifton

PDF

Open Access 1 Models

TL;DR

This paper explores how instruction tuning can enhance large language models for biomedical NLP tasks, demonstrating competitive performance with specialized models through a large, curated instruction dataset.

Contribution

It introduces a comprehensive instruction-tuned model for biomedical NLP, utilizing a large curated dataset, and analyzes its effectiveness compared to specialized models.

Findings

01

Instruction tuning improves biomedical NLP performance.

02

The curated dataset enhances model adaptability.

03

Results are comparable to specialized biomedical models.

Abstract

Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evolving. In this context, our study investigates the potential of instruction tuning for biomedical language processing, applying this technique to two general LLMs of substantial scale. We present a comprehensive, instruction-based model trained on a dataset that consists of approximately $200, 000$ instruction-focused samples. This dataset represents a carefully curated compilation of existing data, meticulously adapted and reformatted to align with the specific requirements of our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
pkhare/qwen3-8b-biomedical
model· 6 dl
6 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification

MethodsALIGN