Linguistic Neuron Overlap Patterns to Facilitate Cross-lingual Transfer on Low-resource Languages
Yuemei Xu, Kexin Xu, Jian Zhou, Ling Hu, Lin Gui

TL;DR
This paper introduces BridgeX-ICL, a neuron-sharing method that enhances zero-shot cross-lingual learning in large language models, especially benefiting low-resource languages by leveraging linguistic neuron overlaps.
Contribution
It proposes a novel neuron-sharing approach, BridgeX-ICL, guided by an HSIC-based metric, to improve cross-lingual transfer in LLMs without additional fine-tuning.
Findings
BridgeX-ICL improves performance on 4 cross-lingual tasks.
Neuron overlap patterns correlate with transfer success.
Method is effective across diverse language pairs.
Abstract
The current Large Language Models (LLMs) face significant challenges in improving their performance on low-resource languages and urgently need data-efficient methods without costly fine-tuning. From the perspective of language-bridge, we propose a simple yet effective method, namely BridgeX-ICL, to improve the zero-shot Cross-lingual In-Context Learning (X-ICL) for low-resource languages. Unlike existing works focusing on language-specific neurons, BridgeX-ICL explores whether sharing neurons can improve cross-lingual performance in LLMs. We construct neuron probe data from the ground-truth MUSE bilingual dictionaries, and define a subset of language overlap neurons accordingly to ensure full activation of these anchored neurons. Subsequently, we propose an HSIC-based metric to quantify LLMs' internal linguistic spectrum based on overlapping neurons, guiding optimal bridge selection.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · ICT in Developing Communities
