gpt-oss-120b & gpt-oss-20b Model Card
OpenAI: Sandhini Agarwal, Lama Ahmad, Jason Ai, Sam Altman, Andy Applebaum, Edwin Arbus, Rahul K. Arora, Yu Bai, Bowen Baker, Haiming Bao, Boaz Barak, Ally Bennett, Tyler Bertao, Nivedita Brett, Eugene Brevdo, Greg Brockman, Sebastien Bubeck, Che Chang, Kai Chen, Mark Chen

TL;DR
This paper introduces gpt-oss-120b and gpt-oss-20b, open-weight reasoning models with high accuracy and efficiency, designed for agentic tasks and released for broad research use.
Contribution
The paper presents two large open-weight models with an efficient architecture, trained with distillation and reinforcement learning, emphasizing agentic capabilities and open release.
Findings
Achieve strong benchmark results in mathematics, coding, and safety
Use an efficient mixture-of-expert transformer architecture
Support agentic functionalities like browsing and tool use
Abstract
We present gpt-oss-120b and gpt-oss-20b, two open-weight reasoning models that push the frontier of accuracy and inference cost. The models use an efficient mixture-of-expert transformer architecture and are trained using large-scale distillation and reinforcement learning. We optimize the models to have strong agentic capabilities (deep research browsing, python tool use, and support for developer-provided functions), all while using a rendered chat format that enables clear instruction following and role delineation. Both models achieve strong results on benchmarks ranging from mathematics, coding, and safety. We release the model weights, inference implementations, tool environments, and tokenizers under an Apache 2.0 license to enable broad use and further research.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗nvidia/gpt-oss-puzzle-88Bmodel· 15k dl· ♡ 8915k dl♡ 89
- 🤗openai/gpt-oss-120bmodel· 4.1M dl· ♡ 46334.1M dl♡ 4633
- 🤗openai/gpt-oss-20bmodel· 6.1M dl· ♡ 44956.1M dl♡ 4495
- 🤗p-e-w/gpt-oss-20b-hereticmodel· 1.6k dl· ♡ 1061.6k dl♡ 106
- 🤗openai/gpt-oss-safeguard-20bmodel· 48k dl· ♡ 20348k dl♡ 203
- 🤗p-e-w/gpt-oss-20b-heretic-ara-v3model· 1.7k dl· ♡ 251.7k dl♡ 25
- 🤗llmfan46/gpt-oss-120b-ultra-hereticmodel· 414 dl· ♡ 3414 dl♡ 3
- 🤗openai/gpt-oss-safeguard-120bmodel· 22k dl· ♡ 9222k dl♡ 92
- 🤗ArliAI/gpt-oss-20b-Derestrictedmodel· 428 dl· ♡ 91428 dl♡ 91
- 🤗ArliAI/gpt-oss-120b-Derestrictedmodel· 2.1k dl· ♡ 822.1k dl♡ 82
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
