Aloe: A Family of Fine-tuned Open Healthcare LLMs

Ashwin Kumar Gururajan; Enrique Lopez-Cuena; Jordi Bayarri-Planas,; Adrian Tormos; Daniel Hinjos; Pablo Bernabeu-Perez; Anna Arias-Duart; Pablo; Agustin Martin-Torres; Lucia Urcelay-Ganzabal; Marta Gonzalez-Mallo; Sergio; Alvarez-Napagao; Eduard Ayguad\'e-Parra; Ulises Cort\'es Dario; Garcia-Gasulla

arXiv:2405.01886·cs.CL·May 6, 2024·6 cites

Aloe: A Family of Fine-tuned Open Healthcare LLMs

Ashwin Kumar Gururajan, Enrique Lopez-Cuena, Jordi Bayarri-Planas,, Adrian Tormos, Daniel Hinjos, Pablo Bernabeu-Perez, Anna Arias-Duart, Pablo, Agustin Martin-Torres, Lucia Urcelay-Ganzabal, Marta Gonzalez-Mallo, Sergio, Alvarez-Napagao, Eduard Ayguad\'e-Parra

PDF

Open Access 1 Repo 4 Models 5 Datasets

TL;DR

Aloe introduces a family of open-source healthcare LLMs that are fine-tuned, aligned, and optimized with advanced inference techniques, achieving state-of-the-art performance and setting new standards for ethical and safe medical AI models.

Contribution

The paper presents Aloe, a new set of open healthcare LLMs trained on enhanced datasets, aligned with policy, and evaluated with comprehensive bias, toxicity, and risk assessments, along with improved inference strategies.

Findings

01

Aloe models achieve state-of-the-art results among open healthcare LLMs at 7B scale.

02

The models demonstrate improved ethical alignment and reduced bias and toxicity.

03

Advanced prompt engineering significantly boosts performance across benchmarks.

Abstract

As the capabilities of Large Language Models (LLMs) in healthcare and medicine continue to advance, there is a growing need for competitive open-source models that can safeguard public interest. With the increasing availability of highly competitive open base models, the impact of continued pre-training is increasingly uncertain. In this work, we explore the role of instruct tuning, model merging, alignment, red teaming and advanced inference schemes, as means to improve current open models. To that end, we introduce the Aloe family, a set of open medical LLMs highly competitive within its scale range. Aloe models are trained on the current best base models (Mistral, LLaMA 3), using a new custom dataset which combines public data sources improved with synthetic Chain of Thought (CoT). Aloe models undergo an alignment phase, becoming one of the first few policy-aligned open healthcare…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hpai-bsc/prompt_engine
none

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPhytochemistry and biological activity of medicinal plants

MethodsSparse Evolutionary Training · Balanced Selection · LLaMA