Nemotron-Cascade 2: Post-Training LLMs with Cascade RL and Multi-Domain On-Policy Distillation
Zhuolin Yang, Zihan Liu, Yang Chen, Wenliang Dai, Boxin Wang, Sheng-Chieh Lin, Chankyu Lee, Yangyi Chen, Dongfu Jiang, Jiafan He, Renjie Pi, Grace Lam, Nayeon Lee, Alexander Bukharin, Mohammad Shoeybi, Bryan Catanzaro, Wei Ping

TL;DR
Nemotron-Cascade 2 is a compact open 30B MoE model that achieves top-tier reasoning and agentic skills, rivaling larger models in mathematical and coding tasks through innovative training and distillation techniques.
Contribution
The paper introduces multi-domain on-policy distillation and expanded Cascade RL, significantly enhancing reasoning capabilities of a compact LLM.
Findings
Achieves Gold Medal-level performance in IMO, IOI, and ICPC with 20x fewer parameters.
Approaches frontier open models in mathematical and coding reasoning.
Demonstrates high intelligence density in a compact model.
Abstract
We introduce Nemotron-Cascade 2, an open 30B MoE model with 3B activated parameters that delivers best-in-class reasoning and strong agentic capabilities. Despite its compact size, its mathematical and coding reasoning performance approaches that of frontier open models. It is the second open-weight LLM, after DeepSeekV3.2-Speciale-671B-A37B, to achieve Gold Medal-level performance in the 2025 International Mathematical Olympiad (IMO), the International Olympiad in Informatics (IOI), and the ICPC World Finals, demonstrating remarkably high intelligence density with 20x fewer parameters. In contrast to Nemotron-Cascade 1, the key technical advancements are as follows. After SFT on a meticulously curated dataset, we substantially expand Cascade RL to cover a much broader spectrum of reasoning and agentic domains. Furthermore, we introduce multi-domain on-policy distillation from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗nvidia/Nemotron-Cascade-2-30B-A3Bmodel· 90k dl· ♡ 44090k dl♡ 440
- 🤗empero-ai/openNemo-Cascade-2-30B-A3Bmodel· 673 dl· ♡ 5673 dl♡ 5
- 🤗trohrbaugh/Nemotron-Cascade-2-30B-A3B-heretic-uncensoredmodel· 10 dl· ♡ 210 dl♡ 2
- 🤗trohrbaugh/Nemotron-Cascade-2-30B-A3B-heretic-ara-uncensoredmodel· 9 dl· ♡ 29 dl♡ 2
- 🤗stelterlab/Nemotron-Cascade-2-30B-A3B-AWQmodel· 8.7k dl· ♡ 18.7k dl♡ 1
- 🤗freddm/Nemotron-Cascade-2-30B-A3B-GGUFmodel· 965 dl· ♡ 2965 dl♡ 2
- 🤗bitL8-ByteShort/Nemotron-Cascade-2-30B-A3Bmodel· 9 dl9 dl
- 🤗trohrbaugh/Nemotron-Cascade-2-30B-A3B-heretic-ara-FP8model· 88 dl88 dl
- 🤗Basemvb/Nemotron-Cascade-2-30B-A3Bmodel· 158 dl158 dl
- 🤗Ujjwal-Tyagi/Nemotron-Cascade-2-30B-A3Bmodel· 130 dl130 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Explainable Artificial Intelligence (XAI)
