Unimodal Training-Multimodal Prediction: Cross-modal Federated Learning with Hierarchical Aggregation
Rongyu Zhang, Xiaowei Chi, Guiliang Liu, Wenyi Zhang, Yuan Du, Fangxin, Wang

TL;DR
This paper introduces a novel federated learning framework that enables unimodal training with multimodal prediction, improving accuracy by aggregating knowledge across clients and modalities, especially in non-IID data scenarios.
Contribution
It proposes HA-Fedformer, a transformer-based model with uncertainty-aware and cross-modal aggregation techniques for effective multimodal federated learning.
Findings
Achieves 15%-20% performance improvement on sentiment analysis benchmarks.
Effectively handles non-IID data and unaligned language sequences.
Outperforms existing multimodal models in federated settings.
Abstract
Multimodal learning has seen great success mining data features from multiple modalities with remarkable model performance improvement. Meanwhile, federated learning (FL) addresses the data sharing problem, enabling privacy-preserved collaborative training to provide sufficient precious data. Great potential, therefore, arises with the confluence of them, known as multimodal federated learning. However, limitation lies in the predominant approaches as they often assume that each local dataset records samples from all modalities. In this paper, we aim to bridge this gap by proposing an Unimodal Training - Multimodal Prediction (UTMP) framework under the context of multimodal federated learning. We design HA-Fedformer, a novel transformer-based model that empowers unimodal training with only a unimodal dataset at the client and multimodal testing by aggregating multiple clients' knowledge…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTraffic Prediction and Management Techniques · Privacy-Preserving Technologies in Data · Recommender Systems and Techniques
