Olmo 3

Team Olmo: Allyson Ettinger; Amanda Bertsch; Bailey Kuehl; David Graham; David Heineman; Dirk Groeneveld; Faeze Brahman; Finbarr Timbers; Hamish Ivison; Jacob Morrison; Jake Poznanski; Kyle Lo; Luca Soldaini; Matt Jordan; Mayee Chen; Michael Noukhovitch; Nathan Lambert; Pete Walsh; Pradeep Dasigi; Robert Berry; Saumya Malik; Saurabh Shah; Scott Geng; Shane Arora; Shashank Gupta; Taira Anderson; Teng Xiao; Tyler Murray; Tyler Romero; Victoria Graf; Akari Asai; Akshita Bhagia; Alexander Wettig; Alisa Liu; Aman Rangapur; Chloe Anastasiades; Costa Huang; Dustin Schwenk; Harsh Trivedi; Ian Magnusson; Jaron Lochner; Jiacheng Liu; Lester James V. Miranda; Maarten Sap; Malia Morgan; Michael Schmitz; Michal Guerquin; Michael Wilson; Regan Huff; Ronan Le Bras; Rui Xin; Rulin Shao; Sam Skjonsberg; Shannon Zejiang Shen; Shuyue Stella Li; Tucker Wilde; Valentina Pyatkin; Will Merrill; Yapei Chang; Yuling Gu; Zhiyuan Zeng; Ashish Sabharwal; Luke Zettlemoyer; Pang Wei Koh; Ali Farhadi; Noah A. Smith; Hannaneh Hajishirzi

arXiv:2512.13961·cs.CL·April 15, 2026

Olmo 3

Team Olmo: Allyson Ettinger, Amanda Bertsch, Bailey Kuehl, David Graham, David Heineman, Dirk Groeneveld, Faeze Brahman, Finbarr Timbers, Hamish Ivison, Jacob Morrison, Jake Poznanski, Kyle Lo, Luca Soldaini, Matt Jordan, Mayee Chen, Michael Noukhovitch, Nathan Lambert

PDF

1 Repo 32 Models 50 Datasets

TL;DR

Olmo 3 introduces a family of fully open, large-scale language models designed for advanced reasoning, coding, and chat, with comprehensive details of their development process included.

Contribution

It presents the first complete lifecycle release of fully open models at 7B and 32B scales, emphasizing long-context reasoning and instruction following.

Findings

01

Olmo 3 Think 32B is the strongest fully-open thinking model to date.

02

The release includes all stages, checkpoints, and data used in model development.

Abstract

We introduce Olmo 3, a family of state-of-the-art, fully-open language models at the 7B and 32B parameter scales. Olmo 3 model construction targets long-context reasoning, function calling, coding, instruction following, general chat, and knowledge recall. This release includes the entire model flow, i.e., the full lifecycle of the family of models, including every stage, checkpoint, data point, and dependency used to build it. Our flagship model, Olmo 3 Think 32B, is the strongest fully-open thinking model released to-date.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

allenai/olmo-core
github

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.