Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer

Chenhang Cui; An Zhang; Yuxin Chen; Gelei Deng; Jingnan Zheng; Zhenkai Liang; Xiang Wang; Tat-Seng Chua

arXiv:2602.19058·cs.CL·February 24, 2026

Do LLMs and VLMs Share Neurons for Inference? Evidence and Mechanisms of Cross-Modal Transfer

Chenhang Cui, An Zhang, Yuxin Chen, Gelei Deng, Jingnan Zheng, Zhenkai Liang, Xiang Wang, Tat-Seng Chua

PDF

Open Access

TL;DR

This paper uncovers significant shared neurons between large language models and vision-language models, demonstrating a modality-invariant inference subspace, and introduces a low-rank fusion method to transfer inference capabilities across models.

Contribution

The study reveals shared neurons responsible for inference in LLMs and LVLMs and proposes a parameter-efficient transfer method called SNRF that enhances multimodal inference without extensive fine-tuning.

Findings

01

Over half of top-activated units are shared between LLMs and LVLMs during inference.

02

Shared neurons encode consistent, interpretable concept-level effects.

03

SNRF improves LVLM inference performance across benchmarks with minimal parameter updates.

Abstract

Large vision-language models (LVLMs) have rapidly advanced across various domains, yet they still lag behind strong text-only large language models (LLMs) on tasks that require multi-step inference and compositional decision-making. Motivated by their shared transformer architectures, we investigate whether the two model families rely on common internal computation for such inference. At the neuron level, we uncover a surprisingly large overlap: more than half of the top-activated units during multi-step inference are shared between representative LLMs and LVLMs, revealing a modality-invariant inference subspace. Through causal probing via activation amplification, we further show that these shared neurons encode consistent and interpretable concept-level effects, demonstrating their functional contribution to inference. Building on this insight, we propose Shared Neuron Low-Rank…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Explainable Artificial Intelligence (XAI)