MiroThinker: Pushing the Performance Boundaries of Open-Source Research Agents via Model, Context, and Interactive Scaling
MiroMind Team, Song Bai, Lidong Bing, Carson Chen, Guanzheng Chen, Yuntao Chen, Zhe Chen, Ziyi Chen, Jifeng Dai, Xuan Dong, Wenhan Dou, Yue Deng, Yunjie Fu, Junqi Ge, Chenxia Han, Tammy Huang, Zhenhang Huang, Jerry Jiao, Shilei Jiang, Tianyu Jiao, Xiaoqi Jian, Lei Lei, Ruilin Li

TL;DR
MiroThinker v1.0 is an open-source research agent that enhances reasoning and information-seeking by leveraging interaction scaling through reinforcement learning, environment feedback, and external information acquisition.
Contribution
It introduces interaction scaling as a new dimension for improving research agent performance, beyond model size and context length, with systematic training and evaluation.
Findings
Achieves up to 81.9% accuracy on GAIA benchmark
Performs up to 600 tool calls per task with 256K context window
Interaction scaling improves research performance predictably
Abstract
We present MiroThinker v1.0, an open-source research agent designed to advance tool-augmented reasoning and information-seeking capabilities. Unlike previous agents that only scale up model size or context length, MiroThinker explores interaction scaling at the model level, systematically training the model to handle deeper and more frequent agent-environment interactions as a third dimension of performance improvement. Unlike LLM test-time scaling, which operates in isolation and risks degradation with longer reasoning chains, interactive scaling leverages environment feedback and external information acquisition to correct errors and refine trajectories. Through reinforcement learning, the model achieves efficient interaction scaling: with a 256K context window, it can perform up to 600 tool calls per task, enabling sustained multi-turn reasoning and complex real-world research…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗miromind-ai/MiroThinker-1.7-minimodel· 910 dl· ♡ 99910 dl♡ 99
- 🤗miromind-ai/MiroThinker-v1.0-8Bmodel· 168 dl· ♡ 64168 dl♡ 64
- 🤗miromind-ai/MiroThinker-1.7model· 1.1k dl· ♡ 1371.1k dl♡ 137
- 🤗miromind-ai/MiroThinker-v1.0-72Bmodel· 36 dl· ♡ 13036 dl♡ 130
- 🤗miromind-ai/MiroThinker-v1.0-30Bmodel· 26 dl· ♡ 5526 dl♡ 55
- 🤗cyankiwi/MiroThinker-v1.0-30B-AWQ-4bitmodel· 35 dl· ♡ 235 dl♡ 2
- 🤗Mungert/MiroThinker-v1.0-30B-GGUFmodel· 119 dl· ♡ 3119 dl♡ 3
- 🤗cyankiwi/MiroThinker-v1.0-30B-AWQ-8bitmodel· 16 dl16 dl
- 🤗jaigouk/qwen3-4b-german-teacher-v1model· 111 dl111 dl
- 🤗Doradus-AI/MiroThinker-v1.0-30B-FP8model· 11 dl· ♡ 411 dl♡ 4
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
