GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language Models at Almost No Cost
Qingcheng Zeng, Lucas Garay, Peilin Zhou, Dading Chong, Yining Hua,, Jiageng Wu, Yikang Pan, Han Zhou, Rob Voigt, Jie Yang

TL;DR
GreenPLM introduces an energy-efficient method to transfer monolingual pre-trained language models across languages using bilingual lexicons, significantly reducing costs and energy consumption while improving performance in low-resource settings.
Contribution
The paper presents a novel framework that enables cross-lingual transfer of pre-trained models with minimal additional cost and energy, outperforming original models in several languages.
Findings
Comparable or better performance than high-cost heuristics
Outperforms original models with up to 200x less pre-training
Reduces language inequalities and energy consumption
Abstract
Large pre-trained models have revolutionized natural language processing (NLP) research and applications, but high training costs and limited data resources have prevented their benefits from being shared equally amongst speakers of all the world's languages. To address issues of cross-linguistic access to such models and reduce energy consumption for sustainability during large-scale model training, this study proposes an effective and energy-efficient framework called GreenPLM that uses bilingual lexicons to directly "translate" pre-trained language models of one language into another at almost no additional cost. We validate this approach in 18 languages' BERT models and show that this framework is comparable to, if not better than, other heuristics with high training costs. In addition, given lightweight continued pre-training on limited data where available, this framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Linear Warmup With Linear Decay · Dense Connections · WordPiece · Attention Dropout · Softmax · Layer Normalization
