Democratizing LLMs for Low-Resource Languages by Leveraging their   English Dominant Abilities with Linguistically-Diverse Prompts

Xuan-Phi Nguyen; Sharifah Mahani Aljunied; Shafiq Joty and; Lidong Bing

arXiv:2306.11372·cs.CL·July 22, 2024·2 cites

Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts

Xuan-Phi Nguyen, Sharifah Mahani Aljunied, Shafiq Joty and, Lidong Bing

PDF

Open Access 2 Models 1 Video

TL;DR

This paper introduces an unsupervised prompting approach that leverages high-resource languages to enable LLMs to perform well on low-resource languages, especially in translation and summarization tasks, without requiring supervised data.

Contribution

The authors propose a novel unsupervised prompting method using synthetic exemplars from diverse high-resource languages to improve LLM performance on low-resource languages.

Findings

01

Achieves comparable performance to supervised few-shot learning in translation tasks.

02

Fine-tuning on generated data enhances performance of smaller models to match larger ones.

03

Outperforms supervised prompting and baseline methods in multilingual summarization.

Abstract

Large language models (LLMs) are known to effectively perform tasks by simply observing few exemplars. However, in low-resource languages, obtaining such hand-picked exemplars can still be challenging, where unsupervised techniques may be necessary. Moreover, competent generative capabilities of LLMs are observed only in high-resource languages, while their performances among under-represented languages fall behind due to pre-training data imbalance. To elicit LLMs' ability onto low-resource languages without any supervised data, we propose to assemble synthetic exemplars from a diverse set of high-resource languages to prompt the LLMs to translate from any language into English. These prompts are then used to create intra-lingual exemplars to perform tasks in the target languages. Our unsupervised prompting method performs on par with supervised few-shot learning in LLMs of different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Models

Videos

Democratizing LLMs for Low-Resource Languages by Leveraging their English Dominant Abilities with Linguistically-Diverse Prompts· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Label Smoothing · Layer Normalization · Adam · Absolute Position Encodings · Softmax · Residual Connection