From Neurons to Semantics: Evaluating Cross-Linguistic Alignment Capabilities of Large Language Models via Neurons Alignment
Chongxuan Huang, Yongshi Ye, Biao Fu, Qifeng Su, Xiaodong Shi

TL;DR
This paper introduces NeuronXA, a neuron-based method for evaluating cross-lingual alignment in multilingual LLMs, demonstrating high correlation with task performance using minimal data.
Contribution
The paper proposes a novel neuron state-based approach for assessing cross-lingual alignment, grounded in neuroscientific insights, and validates its effectiveness across multiple models and benchmarks.
Findings
NeuronXA achieves a Pearson correlation of 0.9556 with downstream task performance.
It attains a correlation of 0.8514 with transferability metrics.
Effective with only 100 parallel sentence pairs.
Abstract
Large language models (LLMs) have demonstrated remarkable multilingual capabilities, however, how to evaluate cross-lingual alignment remains underexplored. Existing alignment benchmarks primarily focus on sentence embeddings, but prior research has shown that neural models tend to induce a non-smooth representation space, which impact of semantic alignment evaluation on low-resource languages. Inspired by neuroscientific findings that similar information activates overlapping neuronal regions, we propose a novel Neuron State-Based Cross-Lingual Alignment (NeuronXA) to assess the cross-lingual a lignment capabilities of LLMs, which offers a more semantically grounded approach to assess cross-lingual alignment. We evaluate NeuronXA on several prominent multilingual LLMs (LLaMA, Qwen, Mistral, GLM, and OLMo) across two transfer tasks and three multilingual benchmarks. The results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling
