EffiVLM-BENCH: A Comprehensive Benchmark for Evaluating Training-Free Acceleration in Large Vision-Language Models

Zekun Wang; Minghua Ma; Zexin Wang; Rongchuan Mu; Liping Shan; Ming Liu; Bing Qin

arXiv:2506.00479·cs.CL·June 3, 2025

EffiVLM-BENCH: A Comprehensive Benchmark for Evaluating Training-Free Acceleration in Large Vision-Language Models

Zekun Wang, Minghua Ma, Zexin Wang, Rongchuan Mu, Liping Shan, Ming Liu, Bing Qin

PDF

Open Access

TL;DR

This paper introduces EffiVLM-Bench, a comprehensive benchmarking framework for evaluating various training-free acceleration techniques in large vision-language models, addressing the need for systematic assessment of efficiency methods.

Contribution

It provides a unified platform to evaluate acceleration methods across multiple backbones and metrics, including performance, generalization, and trade-offs, filling a gap in current evaluation practices.

Findings

01

Token and parameter compression techniques vary in effectiveness.

02

EffiVLM-Bench reveals trade-offs between acceleration and model fidelity.

03

Open-source code facilitates future research in LVLM efficiency.

Abstract

Large Vision-Language Models (LVLMs) have achieved remarkable success, yet their significant computational demands hinder practical deployment. While efforts to improve LVLM efficiency are growing, existing methods lack comprehensive evaluation across diverse backbones, benchmarks, and metrics. In this work, we systematically evaluate mainstream acceleration techniques for LVLMs, categorized into token and parameter compression. We introduce EffiVLM-Bench, a unified framework for assessing not only absolute performance but also generalization and loyalty, while exploring Pareto-optimal trade-offs. Our extensive experiments and in-depth analyses offer insights into optimal strategies for accelerating LVLMs. We open-source code and recipes for EffiVLM-Bench to foster future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Advanced Neural Network Applications