ProxyLM: Predicting Language Model Performance on Multilingual Tasks via   Proxy Models

David Anugraha; Genta Indra Winata; Chenyue Li; Patrick Amadeus; Irawan; En-Shiun Annie Lee

arXiv:2406.09334·cs.CL·December 17, 2024·1 cites

ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models

David Anugraha, Genta Indra Winata, Chenyue Li, Patrick Amadeus, Irawan, En-Shiun Annie Lee

PDF

Open Access 1 Repo 1 Video

TL;DR

ProxyLM introduces a scalable, language-agnostic framework that uses proxy models to accurately predict multilingual NLP task performance, significantly reducing computational costs while outperforming existing methods.

Contribution

We propose ProxyLM, a novel framework that employs proxy models to efficiently predict language model performance across multilingual tasks and unseen languages.

Findings

01

Achieves up to 37.08x speedup in performance prediction.

02

Outperforms state-of-the-art by at least 1.78x in RMSE.

03

Effectively generalizes to unseen languages and datasets.

Abstract

Performance prediction is a method to estimate the performance of Language Models (LMs) on various Natural Language Processing (NLP) tasks, mitigating computational costs associated with model capacity and data for fine-tuning. Our paper presents ProxyLM, a scalable task- and language-agnostic framework designed to predict the performance of LMs using proxy models. These proxy models act as surrogates, approximating the performance of the LM of interest. By leveraging these proxy models, ProxyLM significantly reduces computational overhead in task evaluations, achieving up to a 37.08x speedup over traditional methods, even with our smallest proxy models. Our results across multiple multilingual NLP tasks and various robustness tests demonstrate that ProxyLM not only adapts well to previously unseen languages in pre-trained LMs, but also generalizes effectively across different datasets,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

davidanugraha/proxylm
noneOfficial

Videos

ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models· underline

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification