ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language   Models

Yi-Lin Sung; Jaehong Yoon; Mohit Bansal

arXiv:2310.02998·cs.CV·January 29, 2024·1 cites

ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models

Yi-Lin Sung, Jaehong Yoon, Mohit Bansal

PDF

Open Access

TL;DR

ECoFLaP is a two-stage pruning method for large vision-language models that efficiently reduces model size by combining global importance scoring with layer-wise pruning, maintaining high performance.

Contribution

It introduces a global importance-based sparsity determination followed by layer-wise pruning, improving compression efficiency and performance over existing methods.

Findings

01

Significant performance gains at high sparsity levels.

02

Effective model compression across various multimodal datasets.

03

Outperforms prevalent pruning techniques in experiments.

Abstract

Large Vision-Language Models (LVLMs) can understand the world comprehensively by integrating rich information from different modalities, achieving remarkable advancements on various multimodal downstream tasks. However, deploying LVLMs is often problematic due to their massive computational/energy costs and carbon consumption. Such issues make it infeasible to adopt conventional iterative global pruning, which is costly due to computing the Hessian matrix of the entire large model for sparsification. Alternatively, several studies have recently proposed layer-wise pruning approaches to avoid the expensive computation of global pruning and efficiently compress model weights according to their importance within a layer. However, they often suffer from suboptimal model compression due to their lack of a global perspective. To address this limitation in recent efficient pruning methods for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques · Genomics and Phylogenetic Studies

MethodsPruning