Bioalignment: Measuring and Improving LLM Disposition Toward Biological Systems for AI Safety
Trent R Northen, Mingxun Wang

TL;DR
This paper evaluates biases in large language models towards synthetic solutions over biological ones, and demonstrates that targeted fine-tuning can increase models' preference for biological approaches without harming their overall performance.
Contribution
It introduces a novel bioalignment benchmark, shows that fine-tuning with biological-focused data shifts model biases, and provides resources for further research.
Findings
Most models favor synthetic solutions according to the bioalignment metric.
Fine-tuning with biological data significantly increases preference for biological approaches.
The approach does not degrade the models' general capabilities.
Abstract
Large language models (LLMs) trained on internet-scale corpora can exhibit systematic biases that increase the probability of unwanted behavior. In this study, we examined potential biases towards synthetic vs. biological technological solutions across four domains (materials, energy, manufacturing, and algorithms). A sample of 5 frontier and 5 open-weight models were measured using 50 curated Bioalignment prompts with a Kelly criterion-inspired evaluation framework. According to this metric, most models were not bioaligned in that they exhibit biases in favor of synthetic (non-biological) solutions. We next examined if fine-tuning could increase the preferences of two open-weight models, Llama 3.2-3B-Instruct and Qwen2.5-3B-Instruct, for biological-based approaches. A curated corpus of ~22M tokens from 6,636 PMC articles emphasizing biological problem-solving was used first to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Ethics and Social Impacts of AI · Biomedical Text Mining and Ontologies
