Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning

Jiajun Guo; Xin Luo; Jiayin Zheng; Yiqun Wang; Kai-Wei Chang; Wei Wang; Jie Liu

arXiv:2511.23402·cs.LG·February 3, 2026

Quantized-Tinyllava: a new multimodal foundation model enables efficient split learning

Jiajun Guo, Xin Luo, Jiayin Zheng, Yiqun Wang, Kai-Wei Chang, Wei Wang, Jie Liu

PDF

Open Access

TL;DR

Quantized-TinyLLaVA introduces a communication-efficient split learning framework for multimodal models by quantizing intermediate features, significantly reducing data transmission costs while maintaining performance and enhancing privacy.

Contribution

This work presents a novel quantization-based split learning approach for multimodal models, reducing communication overhead and improving privacy in distributed training.

Findings

01

87.5% reduction in communication overhead with 2-bit quantization

02

Maintains model performance across five benchmark datasets

03

Enhanced resistance to feature inversion attacks

Abstract

Multimodal foundation models are increasingly trained on sensitive data across domains such as finance, biomedicine, and personal identifiers. However, this distributed setup raises serious privacy concerns due to the need for cross-partition data sharing. Split learning addresses these concerns by enabling collaborative model training without raw data exchange between partitions, yet it introduces a significant challenge: transmitting high-dimensional intermediate feature representations between partitions leads to substantial communication costs. To address this challenge, we propose Quantized-TinyLLaVA, a multimodal foundation model with an integrated communication-efficient split learning framework. Our approach adopts a compression module that quantizes intermediate feature into discrete representations before transmission, substantially reducing communication overhead. Besides, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Face recognition and analysis