ProxyLM: Predicting Language Model Performance on Multilingual Tasks via Proxy Models
David Anugraha, Genta Indra Winata, Chenyue Li, Patrick Amadeus, Irawan, En-Shiun Annie Lee

TL;DR
ProxyLM introduces a scalable, language-agnostic framework that uses proxy models to accurately predict multilingual NLP task performance, significantly reducing computational costs while outperforming existing methods.
Contribution
We propose ProxyLM, a novel framework that employs proxy models to efficiently predict language model performance across multilingual tasks and unseen languages.
Findings
Achieves up to 37.08x speedup in performance prediction.
Outperforms state-of-the-art by at least 1.78x in RMSE.
Effectively generalizes to unseen languages and datasets.
Abstract
Performance prediction is a method to estimate the performance of Language Models (LMs) on various Natural Language Processing (NLP) tasks, mitigating computational costs associated with model capacity and data for fine-tuning. Our paper presents ProxyLM, a scalable task- and language-agnostic framework designed to predict the performance of LMs using proxy models. These proxy models act as surrogates, approximating the performance of the LM of interest. By leveraging these proxy models, ProxyLM significantly reduces computational overhead in task evaluations, achieving up to a 37.08x speedup over traditional methods, even with our smallest proxy models. Our results across multiple multilingual NLP tasks and various robustness tests demonstrate that ProxyLM not only adapts well to previously unseen languages in pre-trained LMs, but also generalizes effectively across different datasets,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
