Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning
Jiajun Guo, Xin Luo, Jiayin Zheng, Yiqun Wang, Kai-Wei Chang, Wei Wang, Jie Liu

TL;DR
Quantized-TinyLLaVA introduces a communication-efficient split learning framework for multimodal models by quantizing intermediate features, significantly reducing data transmission costs while maintaining performance and enhancing privacy.
Contribution
This work presents a novel quantization-based split learning approach for multimodal models, reducing communication overhead and improving privacy in distributed training.
Findings
87.5% reduction in communication overhead with 2-bit quantization
Maintains model performance across five benchmark datasets
Enhanced resistance to feature inversion attacks
Abstract
Multimodal foundation models are increasingly trained on sensitive data across domains such as finance, biomedicine, and personal identifiers. However, this distributed setup raises serious privacy concerns due to the need for cross-partition data sharing. Split learning addresses these concerns by enabling collaborative model training without raw data exchange between partitions, yet it introduces a significant challenge: transmitting high-dimensional intermediate feature representations between partitions leads to substantial communication costs. To address this challenge, we propose Quantized-TinyLLaVA, a multimodal foundation model with an integrated communication-efficient split learning framework. Our approach adopts a compression module that quantizes intermediate feature into discrete representations before transmission, substantially reducing communication overhead. Besides, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Face recognition and analysis
