Jan-nano Technical Report
Alan Dao (Gia Tuan Dao), Dinh Bach Vu

TL;DR
Jan-nano is a highly efficient 4B parameter language model that excels in instant retrieval and reasoning tasks, achieved through novel training methods and specialization, enabling high performance on consumer hardware.
Contribution
The paper introduces Jan-nano, a specialized 4B model trained with a new RLVR system that eliminates traditional next token prediction, demonstrating high efficiency and performance.
Findings
Achieves 83.2% on SimpleQA benchmark
Operates effectively with 128K context length
Runs on consumer hardware with high efficiency
Abstract
Most language models face a fundamental tradeoff where powerful capabilities require substantial computational resources. We shatter this constraint with Jan-nano, a 4B parameter language model that redefines efficiency through radical specialization: instead of trying to know everything, it masters the art of finding anything instantly. Fine-tuned from Qwen3-4B using our novel multi-stage Reinforcement Learning with Verifiable Rewards (RLVR) system that completely eliminates reliance on next token prediction training (SFT), Jan-nano achieves 83.2% on SimpleQA benchmark with MCP integration while running on consumer hardware. With 128K context length, Jan-nano proves that intelligence isn't about scale, it's about strategy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Menlo/Jan-nanomodel· 5.3k dl· ♡ 5035.3k dl♡ 503
- 🤗Menlo/Jan-nano-ggufmodel· 1.9k dl· ♡ 1401.9k dl♡ 140
- 🤗Menlo/Jan-nano-128k-ggufmodel· 11k dl· ♡ 7111k dl♡ 71
- 🤗Menlo/Jan-nano-128kmodel· 37k dl· ♡ 22137k dl♡ 221
- 🤗warshanks/Jan-nano-AWQmodel· 172k dl· ♡ 3172k dl♡ 3
- 🤗warshanks/Jan-nano-128k-AWQmodel· 8 dl8 dl
- 🤗contextboxai/Qwen3-1.7B-FCmodel· 44 dl44 dl
- 🤗digitranslab/Megamind-nano-128k-ggufmodel· 60 dl60 dl
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNanowire Synthesis and Applications
