Training a Scientific Reasoning Model for Chemistry

Siddharth M. Narayanan; James D. Braza; Ryan-Rhys Griffiths; Albert Bou; Geemi Wellawatte; Mayk Caldas Ramos; Ludovico Mitchener; Samuel G. Rodriques; Andrew D. White

arXiv:2506.17238·cs.LG·December 3, 2025

Training a Scientific Reasoning Model for Chemistry

Siddharth M. Narayanan, James D. Braza, Ryan-Rhys Griffiths, Albert Bou, Geemi Wellawatte, Mayk Caldas Ramos, Ludovico Mitchener, Samuel G. Rodriques, Andrew D. White

PDF

1 Models 1 Datasets 1 Video

TL;DR

This paper introduces ether0, a large language model trained for scientific reasoning in chemistry, capable of understanding and generating chemical structures and solving diverse chemistry problems with high accuracy and data efficiency.

Contribution

The authors demonstrate that reasoning models can be effectively adapted to chemistry without domain-specific pretraining, using reinforcement learning on a large, diverse dataset.

Findings

01

ether0 outperforms existing chemistry models and human experts on molecular design tasks.

02

The model requires significantly less data than specialized models.

03

It can reason in natural language and generate chemical structures.

Abstract

Reasoning models are large language models that emit a long chain-of-thought before answering, providing both higher accuracy and explicit reasoning for their response. A major question has been whether language model reasoning generalizes beyond mathematics, programming, and logic, where most previous work has focused. We demonstrate that reasoning models can be post-trained for chemistry without additional domain pretraining, and require substantially less data compared to contemporary domain-specific models. We report ether0, a 24B parameter LLM (based on Mistral-Small-24B) that can reason in natural language and respond with chemical structures. This reasoning model was trained with reinforcement learning on 640,730 experimentally-grounded chemistry problems across 375 tasks ranging from synthesizability, to blood-brain barrier permeability, to human receptor activity, to scent. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

🤗
futurehouse/ether0
model· 210 dl· ♡ 69
210 dl♡ 69

Datasets

futurehouse/ether0-benchmark
dataset· 287 dl
287 dl

Videos

Training a Scientific Reasoning Model for Chemistry· slideslive