Extracting and Combining Abilities For Building Multi-lingual Ability-enhanced Large Language Models
Zhipeng Chen, Kun Zhou, Liang Song, Wayne Xin Zhao, Bingning Wang, Weipeng Chen, Ji-Rong Wen

TL;DR
This paper introduces MAEC, a method to extract and combine language-agnostic abilities from large language models, enabling multi-lingual performance enhancement without additional training, especially benefiting low-resource languages.
Contribution
The paper proposes a novel ability extraction and combination approach that does not require training, improving multi-lingual capabilities of LLMs across resource scenarios.
Findings
MAEC effectively extracts transferable abilities from LLMs.
It enhances multi-lingual performance on scientific and mathematical tasks.
Achieves comparable results to training-based methods like PaLM.
Abstract
Multi-lingual ability transfer has become increasingly important for the broad application of large language models (LLMs). Existing work highly relies on training with the multi-lingual ability-related data, which may not be available for low-resource languages. To solve it, we propose a Multi-lingual Abilities Extraction and Combination approach, named as MAEC. Our key idea is to decompose and extract language-agnostic ability-related weights from LLMs, and combine them across different languages by simple addition and subtraction operations without training. Specifically, our MAEC consists of the extraction and combination stages. In the extraction stage, we firstly locate key neurons that are highly related to specific abilities, and then employ them to extract the transferable ability-related weights. In the combination stage, we further select the ability-related tensors that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Text Readability and Simplification
