Yuan3.0 Flash: An Open Multimodal Large Language Model for Enterprise Applications

YuanLab.ai: Shawn Wu; Sean Wang; Louie Li; Darcy Chen; Allen Wang; Jiangang Luo; Xudong Zhao; Joseph Shen; Gawain Ma; Jasper Jia; Marcus Mao; Claire Wang; Hunter He; Carol Wang; Zera Zhang; Jason Wang; Chonly Shen; Leo Zhang; Logan Chen; Qasim Meng; James Gong; Danied Zhao; Penn Zheng; Owen Zhu; Tong Yu

arXiv:2601.01718·cs.AI·January 6, 2026

Yuan3.0 Flash: An Open Multimodal Large Language Model for Enterprise Applications

YuanLab.ai: Shawn Wu, Sean Wang, Louie Li, Darcy Chen, Allen Wang, Jiangang Luo, Xudong Zhao, Joseph Shen, Gawain Ma, Jasper Jia, Marcus Mao, Claire Wang, Hunter He, Carol Wang, Zera Zhang, Jason Wang, Chonly Shen, Leo Zhang, Logan Chen, Qasim Meng, James Gong, Danied Zhao

PDF

Open Access 3 Models

TL;DR

Yuan3.0 Flash is an open-source multimodal large language model optimized for enterprise tasks, featuring a novel RL training method to reduce overthinking and achieving high performance with fewer tokens.

Contribution

It introduces Yuan3.0 Flash, a multimodal LLM with a new RL training algorithm (RAPO) to improve enterprise task performance and reduce overthinking, while being openly available for research.

Findings

01

Superior performance on enterprise tasks like RAG and summarization.

02

Achieves high reasoning accuracy with fewer tokens.

03

Open-sourced for community research and deployment.

Abstract

We introduce Yuan3.0 Flash, an open-source Mixture-of-Experts (MoE) MultiModal Large Language Model featuring 3.7B activated parameters and 40B total parameters, specifically designed to enhance performance on enterprise-oriented tasks while maintaining competitive capabilities on general-purpose tasks. To address the overthinking phenomenon commonly observed in Large Reasoning Models (LRMs), we propose Reflection-aware Adaptive Policy Optimization (RAPO), a novel RL training algorithm that effectively regulates overthinking behaviors. In enterprise-oriented tasks such as retrieval-augmented generation (RAG), complex table understanding, and summarization, Yuan3.0 Flash consistently achieves superior performance. Moreover, it also demonstrates strong reasoning capabilities in domains such as mathematics, science, etc., attaining accuracy comparable to frontier model while requiring only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)