gpt-oss-120b & gpt-oss-20b Model Card

OpenAI: Sandhini Agarwal; Lama Ahmad; Jason Ai; Sam Altman; Andy Applebaum; Edwin Arbus; Rahul K. Arora; Yu Bai; Bowen Baker; Haiming Bao; Boaz Barak; Ally Bennett; Tyler Bertao; Nivedita Brett; Eugene Brevdo; Greg Brockman; Sebastien Bubeck; Che Chang; Kai Chen; Mark Chen; Enoch Cheung; Aidan Clark; Dan Cook; Marat Dukhan; Casey Dvorak; Kevin Fives; Vlad Fomenko; Timur Garipov; Kristian Georgiev; Mia Glaese; Tarun Gogineni; Adam Goucher; Lukas Gross; Katia Gil Guzman; John Hallman; Jackie Hehir; Johannes Heidecke; Alec Helyar; Haitang Hu; Romain Huet; Jacob Huh; Saachi Jain; Zach Johnson; Chris Koch; Irina Kofman; Dominik Kundel; Jason Kwon; Volodymyr Kyrylov; Elaine Ya Le; Guillaume Leclerc; James Park Lennon; Scott Lessans; Mario Lezcano-Casado; Yuanzhi Li; Zhuohan Li; Ji Lin; Jordan Liss; Lily (Xiaoxuan) Liu; Jiancheng Liu; Kevin Lu; Chris Lu; Zoran Martinovic; Lindsay McCallum; Josh McGrath; Scott McKinney; Aidan McLaughlin; Song Mei; Steve Mostovoy; Tong Mu; Gideon Myles; Alexander Neitz; Alex Nichol; Jakub Pachocki; Alex Paino; Dana Palmie; Ashley Pantuliano; Giambattista Parascandolo; Jongsoo Park; Leher Pathak; Carolina Paz; Ludovic Peran; Dmitry Pimenov; Michelle Pokrass; Elizabeth Proehl; Huida Qiu; Gaby Raila; Filippo Raso; Hongyu Ren; Kimmy Richardson; David Robinson; Bob Rotsted; Hadi Salman; Suvansh Sanjeev; Max Schwarzer; D. Sculley; Harshit Sikchi; Kendal Simon; Karan Singhal; Yang Song; Dane Stuckey; Zhiqing Sun; Philippe Tillet; Sam Toizer; Foivos Tsimpourlas; Nikhil Vyas; Eric Wallace; Xin Wang; Miles Wang; Olivia Watkins; Kevin Weil; Amy Wendling; Kevin Whinnery; Cedric Whitney; Hannah Wong; Lin Yang; Yu Yang; Michihiro Yasunaga; Kristen Ying; Wojciech Zaremba; Wenting Zhan; Cyril Zhang; Brian Zhang; Eddie Zhang; Shengjia Zhao

arXiv:2508.10925·cs.CL·August 18, 2025

gpt-oss-120b & gpt-oss-20b Model Card

OpenAI: Sandhini Agarwal, Lama Ahmad, Jason Ai, Sam Altman, Andy Applebaum, Edwin Arbus, Rahul K. Arora, Yu Bai, Bowen Baker, Haiming Bao, Boaz Barak, Ally Bennett, Tyler Bertao, Nivedita Brett, Eugene Brevdo, Greg Brockman, Sebastien Bubeck, Che Chang, Kai Chen, Mark Chen

PDF

10 Models 4 Datasets

TL;DR

This paper introduces gpt-oss-120b and gpt-oss-20b, open-weight reasoning models with high accuracy and efficiency, designed for agentic tasks and released for broad research use.

Contribution

The paper presents two large open-weight models with an efficient architecture, trained with distillation and reinforcement learning, emphasizing agentic capabilities and open release.

Findings

01

Achieve strong benchmark results in mathematics, coding, and safety

02

Use an efficient mixture-of-expert transformer architecture

03

Support agentic functionalities like browsing and tool use

Abstract

We present gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models that push the frontier of accuracy and inference cost. The models use an efficient mixture-of-expert transformer architecture and are trained using large-scale distillation and reinforcement learning. We optimize the models to have strong agentic capabilities (deep research browsing, python tool use, and support for developer-provided functions), all while using a rendered chat format that enables clear instruction following and role delineation. Both models achieve strong results on benchmarks ranging from mathematics, coding, and safety. We release the model weights, inference implementations, tool environments, and tokenizers under an Apache 2.0 license to enable broad use and further research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.