Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety

Trent R Northen; Mingxun Wang

arXiv:2603.09154·cs.CL·March 11, 2026

Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety

Trent R Northen, Mingxun Wang

PDF

Open Access 1 Models

TL;DR

This paper evaluates biases in large language models towards synthetic solutions over biological ones, and demonstrates that targeted fine-tuning can increase models' preference for biological approaches without harming their overall performance.

Contribution

It introduces a novel bioalignment benchmark, shows that fine-tuning with biological-focused data shifts model biases, and provides resources for further research.

Findings

01

Most models favor synthetic solutions according to the bioalignment metric.

02

Fine-tuning with biological data significantly increases preference for biological approaches.

03

The approach does not degrade the models' general capabilities.

Abstract

Large language models (LLMs) trained on internet-scale corpora can exhibit systematic biases that increase the probability of unwanted behavior. In this study, we examined potential biases towards synthetic vs. biological technological solutions across four domains (materials, energy, manufacturing, and algorithms). A sample of 5 frontier and 5 open-weight models were measured using 50 curated Bioalignment prompts with a Kelly criterion-inspired evaluation framework. According to this metric, most models were not bioaligned in that they exhibit biases in favor of synthetic (non-biological) solutions. We next examined if fine-tuning could increase the preferences of two open-weight models, Llama 3.2-3B-Instruct and Qwen2.5-3B-Instruct, for biological-based approaches. A curated corpus of ~22M tokens from 6,636 PMC articles emphasizing biological problem-solving was used first to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
Bioaligned/Qwen-2.5-3B-instruct-bioaligned-qlora
model· 23 dl
23 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Ethics and Social Impacts of AI · Biomedical Text Mining and Ontologies