GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language   Models at Almost No Cost

Qingcheng Zeng; Lucas Garay; Peilin Zhou; Dading Chong; Yining Hua,; Jiageng Wu; Yikang Pan; Han Zhou; Rob Voigt; Jie Yang

arXiv:2211.06993·cs.CL·May 29, 2023·1 cites

GreenPLM: Cross-Lingual Transfer of Monolingual Pre-Trained Language Models at Almost No Cost

Qingcheng Zeng, Lucas Garay, Peilin Zhou, Dading Chong, Yining Hua,, Jiageng Wu, Yikang Pan, Han Zhou, Rob Voigt, Jie Yang

PDF

Open Access 1 Repo

TL;DR

GreenPLM introduces an energy-efficient method to transfer monolingual pre-trained language models across languages using bilingual lexicons, significantly reducing costs and energy consumption while improving performance in low-resource settings.

Contribution

The paper presents a novel framework that enables cross-lingual transfer of pre-trained models with minimal additional cost and energy, outperforming original models in several languages.

Findings

01

Comparable or better performance than high-cost heuristics

02

Outperforms original models with up to 200x less pre-training

03

Reduces language inequalities and energy consumption

Abstract

Large pre-trained models have revolutionized natural language processing (NLP) research and applications, but high training costs and limited data resources have prevented their benefits from being shared equally amongst speakers of all the world's languages. To address issues of cross-linguistic access to such models and reduce energy consumption for sustainability during large-scale model training, this study proposes an effective and energy-efficient framework called GreenPLM that uses bilingual lexicons to directly "translate" pre-trained language models of one language into another at almost no additional cost. We validate this approach in 18 languages' BERT models and show that this framework is comparable to, if not better than, other heuristics with high training costs. In addition, given lightweight continued pre-training on limited data where available, this framework…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

qcznlp/greenplms
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Multi-Head Attention · Linear Warmup With Linear Decay · Dense Connections · WordPiece · Attention Dropout · Softmax · Layer Normalization