RynnBrain: Open Embodied Foundation Models
Ronghao Dang, Jiayan Guo, Bohan Hou, Sicong Leng, Kehan Li, Xin Li, Jiangpin Liu, Yunxuan Mao, Zhikai Wang, Yuqian Yuan, Minghao Zhu, Xiao Lin, Yang Bai, Qian Jiang, Yaxi Zhao, Minghua Zeng, Junlong Gao, Yuming Jiang, Jun Cen, Siteng Huang, Liuyi Wang, Wenqiao Zhang, Chengju Liu

TL;DR
RynnBrain is an open-source, multi-scale foundation model designed for embodied intelligence, integrating perception, reasoning, and planning within real-world dynamics, and demonstrating superior performance on diverse benchmarks.
Contribution
The paper introduces RynnBrain, a unified, physically grounded foundation model for embodied intelligence, with multiple scales and variants tailored for various embodied and spatial reasoning tasks.
Findings
Outperforms existing embodied models on 20 benchmarks
Demonstrates strong capabilities in perception, reasoning, and planning
Enables efficient adaptation to diverse embodied tasks
Abstract
Despite rapid progress in multimodal foundation models, embodied intelligence community still lacks a unified, physically grounded foundation model that integrates perception, reasoning, and planning within real-world spatial-temporal dynamics. We introduce RynnBrain, an open-source spatiotemporal foundation model for embodied intelligence. RynnBrain strengthens four core capabilities in a unified framework: comprehensive egocentric understanding, diverse spatiotemporal localization, physically grounded reasoning, and physics-aware planning. The RynnBrain family comprises three foundation model scales (2B, 8B, and 30B-A3B MoE) and four post-trained variants tailored for downstream embodied tasks (i.e., RynnBrain-Nav, RynnBrain-Plan, and RynnBrain-VLA) or complex spatial reasoning tasks (i.e., RynnBrain-CoP). In terms of extensive evaluations on 20 embodied benchmarks and 8 general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗Alibaba-DAMO-Academy/RynnBrain-Nav-8Bmodel· 2.1k dl· ♡ 122.1k dl♡ 12
- 🤗Alibaba-DAMO-Academy/RynnBrain-Plan-8Bmodel· 419 dl· ♡ 7419 dl♡ 7
- 🤗Alibaba-DAMO-Academy/RynnBrain-8Bmodel· 2.4k dl· ♡ 122.4k dl♡ 12
- 🤗Alibaba-DAMO-Academy/RynnBrain-30B-A3Bmodel· 240 dl· ♡ 18240 dl♡ 18
- 🤗Alibaba-DAMO-Academy/RynnBrain-2Bmodel· 1.1k dl· ♡ 251.1k dl♡ 25
- 🤗Alibaba-DAMO-Academy/RynnBrain-CoP-8Bmodel· 82 dl· ♡ 682 dl♡ 6
- 🤗Alibaba-DAMO-Academy/RynnBrain-Plan-30B-A3Bmodel· 131 dl· ♡ 5131 dl♡ 5
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · AI-based Problem Solving and Planning · Spatial Cognition and Navigation
