Federated Learning in Big Model Era: Domain-Specific Multimodal Large Models
Zengxiang Li, Zhaoxiang Hou, Hui Liu, Ying Wang, Tongzhi, Li, Longfei Xie, Chao Shi, Chengyi Yang, Weishan Zhang, Zelei, Liu, Liang Xu

TL;DR
This paper introduces a federated learning framework for multimodal large models tailored to specific industrial domains, enabling collaborative training across enterprises to improve domain-specific AI services while addressing privacy and heterogeneity challenges.
Contribution
It proposes a novel multimodal federated learning approach for vertical domains, including technical innovations and a case study on city safety management.
Findings
Enhanced multimodal model capabilities through federated learning
Improved domain-specific AI services in city safety applications
Established a cooperative ecosystem for large-scale industrial AI deployment
Abstract
Multimodal data, which can comprehensively perceive and recognize the physical world, has become an essential path towards general artificial intelligence. However, multimodal large models trained on public datasets often underperform in specific industrial domains. This paper proposes a multimodal federated learning framework that enables multiple enterprises to utilize private domain data to collaboratively train large models for vertical domains, achieving intelligent services across scenarios. The authors discuss in-depth the strategic transformation of federated learning in terms of intelligence foundation and objectives in the era of big model, as well as the new challenges faced in heterogeneous data, model aggregation, performance and cost trade-off, data privacy, and incentive mechanism. The paper elaborates a case study of leading enterprises contributing multimodal data and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Smart Cities and Technologies
