Multilingual Few-Shot Learning via Language Model Retrieval

Genta Indra Winata; Liang-Kang Huang; Soumya Vadlamannati; Yash; Chandarana

arXiv:2306.10964·cs.CL·June 21, 2023·2 cites

Multilingual Few-Shot Learning via Language Model Retrieval

Genta Indra Winata, Liang-Kang Huang, Soumya Vadlamannati, Yash, Chandarana

PDF

Open Access

TL;DR

This paper explores a retrieval-based approach for multilingual few-shot learning, demonstrating that selecting semantically similar samples improves model performance across various natural language understanding tasks in multiple languages.

Contribution

It introduces a retrieval method for selecting relevant few-shot samples, enhancing multilingual and cross-lingual model performance without gradient updates.

Findings

01

Consistently outperforms random sampling in monolingual tasks.

02

Improves accuracy in cross-lingual settings.

03

Effective across multiple NLP tasks.

Abstract

Transformer-based language models have achieved remarkable success in few-shot in-context learning and drawn a lot of research interest. However, these models' performance greatly depends on the choice of the example prompts and also has high variability depending on how samples are chosen. In this paper, we conduct a comprehensive study of retrieving semantically similar few-shot samples and using them as the context, as it helps the model decide the correct label without any gradient update in the multilingual and cross-lingual settings. We evaluate the proposed method on five natural language understanding datasets related to intent detection, question classification, sentiment analysis, and topic classification. The proposed method consistently outperforms random sampling in monolingual and cross-lingual tasks in non-English languages.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Text and Document Classification Technologies