Fengshenbang 1.0: Being the Foundation of Chinese Cognitive Intelligence
Jiaxing Zhang, Ruyi Gan, Junjie Wang, Yuxiang Zhang, Lin Zhang, Ping, Yang, Xinyu Gao, Ziwei Wu, Xiaoqun Dong, Junqing He, Jianheng Zhuo, Qi Yang,, Yongfeng Huang, Xiayu Li, Yanghan Wu, Junyu Lu, Xinyu Zhu, Weifeng Chen, Ting, Han, Kunhao Pan, Rui Wang, Hao Wang, Xiaojun Wu

TL;DR
Fengshenbang 1.0 is an open-source project that provides comprehensive Chinese-language foundation models, tools, and benchmarks to support the development of Chinese cognitive intelligence and democratize access to large-scale models.
Contribution
It introduces a complete ecosystem for Chinese foundation models, including models, frameworks, benchmarks, and datasets, fostering community collaboration and resource sharing.
Findings
Launch of large pre-trained Chinese models
Development of user-friendly APIs and benchmarks
Promotion of open-source ecosystem for Chinese AI models
Abstract
Nowadays, foundation models become one of fundamental infrastructures in artificial intelligence, paving ways to the general intelligence. However, the reality presents two urgent challenges: existing foundation models are dominated by the English-language community; users are often given limited resources and thus cannot always use foundation models. To support the development of the Chinese-language community, we introduce an open-source project, called Fengshenbang, which leads by the research center for Cognitive Computing and Natural Language (CCNL). Our project has comprehensive capabilities, including large pre-trained models, user-friendly APIs, benchmarks, datasets, and others. We wrap all these in three sub-projects: the Fengshenbang Model, the Fengshen Framework, and the Fengshen Benchmark. An open-source roadmap, Fengshenbang, aims to re-evaluate the open-source community of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗IDEA-CCNL/Erlangshen-Roberta-330M-NLImodel· 444 dl· ♡ 8444 dl♡ 8
- 🤗IDEA-CCNL/Erlangshen-MegatronBert-1.3Bmodel· 68 dl· ♡ 2768 dl♡ 27
- 🤗IDEA-CCNL/Randeng-MegatronT5-770Mmodel· 5 dl· ♡ 75 dl♡ 7
- 🤗IDEA-CCNL/Wenzhong-GPT2-3.5Bmodel· 22 dl· ♡ 1522 dl♡ 15
- 🤗IDEA-CCNL/Yuyuan-GPT2-3.5Bmodel· 21 dl· ♡ 821 dl♡ 8
- 🤗IDEA-CCNL/Zhouwenwang-Unified-1.3Bmodel· 6 dl· ♡ 26 dl♡ 2
- 🤗IDEA-CCNL/Zhouwenwang-Unified-110Mmodel· 17 dl· ♡ 417 dl♡ 4
- 🤗IDEA-CCNL/Erlangshen-Longformer-110Mmodel· 66 dl· ♡ 1266 dl♡ 12
- 🤗IDEA-CCNL/Erlangshen-Longformer-330Mmodel· 29 dl· ♡ 529 dl♡ 5
- 🤗IDEA-CCNL/YuyuanQA-GPT2-3.5Bmodel· 16 dl· ♡ 716 dl♡ 7
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Topic Modeling
