Empower Vision Applications with LoRA LMM
Liang Mi, Weijun Wang, Wenming Tu, Qingfeng He, Rui Kong, Xinyu Fang,, Yazhu Dong, Yikang Zhang, Yunchun Li, Meng Li, Haipeng Dai, Guihai Chen,, Yunxin Liu

TL;DR
This paper introduces VaLoRA, an end-to-end system that enhances vision applications with LoRA LMMs, achieving high accuracy and low latency through innovative adapter generation, batching, and orchestration techniques.
Contribution
The paper presents a novel system, VaLoRA, that significantly improves the efficiency and accuracy of LoRA-based vision tasks in large multimodal models.
Findings
Improves accuracy by 24-62% over original LMMs.
Reduces latency by 20-89% compared to existing LoRA serving systems.
Demonstrates effectiveness on five vision tasks across three LMMs.
Abstract
Large Multimodal Models (LMMs) have shown significant progress in various complex vision tasks with the solid linguistic and reasoning capacity inherited from large language models (LMMs). Low-rank adaptation (LoRA) offers a promising method to integrate external knowledge into LMMs, compensating for their limitations on domain-specific tasks. However, the existing LoRA model serving is excessively computationally expensive and causes extremely high latency. In this paper, we present an end-to-end solution that empowers diverse vision tasks and enriches vision applications with LoRA LMMs. Our system, VaLoRA, enables accurate and efficient vision tasks by 1) an accuracy-aware LoRA adapter generation approach that generates LoRA adapters rich in domain-specific knowledge to meet application-specific accuracy requirements, 2) an adaptive-tiling LoRA adapters batching operator that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCCD and CMOS Imaging Sensors · Image Processing Techniques and Applications
MethodsAdapter
