Topology-Aware Layer Pruning for Large Vision-Language Models

Pengcheng Zheng; Chaoning Zhang; Ya Wen; Wang Liu; Qigan Sun; Jiarong Mo; Jiaquan Zhang; Jewon Lee; Tae-Ho Kim; Kuien Liu; Tianyu Li; Caiyan Qin; Yang Yang

arXiv:2604.16502·cs.CV·April 21, 2026

Topology-Aware Layer Pruning for Large Vision-Language Models

Pengcheng Zheng, Chaoning Zhang, Ya Wen, Wang Liu, Qigan Sun, Jiarong Mo, Jiaquan Zhang, Jewon Lee, Tae-Ho Kim, Kuien Liu, Tianyu Li, Caiyan Qin, Yang Yang

PDF

1 Repo

TL;DR

This paper introduces a topology-aware layer pruning method for large vision-language models that preserves critical representational transitions using simplicial complexes and zigzag persistent homology, improving efficiency without sacrificing performance.

Contribution

It proposes a novel topology-based framework for adaptive layer pruning in LVLMs, capturing global representation evolution to maintain model effectiveness.

Findings

01

Outperforms existing pruning methods across various benchmarks.

02

Effectively preserves critical model transitions during pruning.

03

Achieves better sparsity-performance trade-offs.

Abstract

Large Language Models (LLMs) have demonstrated strong capabilities in natural language understanding and reasoning, while recent extensions that incorporate visual inputs enable them to process multimodal information. Despite these advances, Large Vision-Language Models (LVLMs) incur substantial computational and memory costs, hindering deployment in resource-constrained scenarios. Existing layer pruning methods typically rely on local similarity metrics or static proxy signals, failing to capture the global and dynamic evolution of representations across model depth, which often leads to the removal of transition-critical layers. To address this limitation, we propose a topology-aware layer pruning framework for LVLMs. Specifically, we represent layer wise hidden states as point clouds and models their evolution using \textit{simplicial complexes}. By leveraging \textit{zigzag…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zpc456/TopoVLM
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.