XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training,   Understanding and Generation

Yaobo Liang; Nan Duan; Yeyun Gong; Ning Wu; Fenfei Guo; Weizhen Qi,; Ming Gong; Linjun Shou; Daxin Jiang; Guihong Cao; Xiaodong Fan; Ruofei Zhang,; Rahul Agrawal; Edward Cui; Sining Wei; Taroon Bharti; Ying Qiao; Jiun-Hung; Chen; Winnie Wu; Shuguang Liu; Fan Yang; Daniel Campos; Rangan Majumder; Ming; Zhou

arXiv:2004.01401·cs.CL·May 25, 2020·67 cites

XGLUE: A New Benchmark Dataset for Cross-lingual Pre-training, Understanding and Generation

Yaobo Liang, Nan Duan, Yeyun Gong, Ning Wu, Fenfei Guo, Weizhen Qi,, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Ruofei Zhang,, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Ying Qiao, Jiun-Hung, Chen, Winnie Wu, Shuguang Liu, Fan Yang, Daniel Campos

PDF

Open Access 2 Repos 3 Models 5 Datasets

TL;DR

XGLUE is a comprehensive multilingual benchmark dataset designed to evaluate cross-lingual models across understanding and generation tasks, offering diverse tasks and multi-language labels to advance multilingual NLP research.

Contribution

The paper introduces XGLUE, a new benchmark dataset with 11 diverse tasks and multi-language labels, and extends a cross-lingual model to evaluate performance across understanding and generation.

Findings

01

Unicoder achieves strong baseline performance on XGLUE.

02

Multilingual BERT, XLM, and XLM-R are evaluated for comparison.

03

XGLUE covers both understanding and generation tasks across multiple languages.

Abstract

In this paper, we introduce XGLUE, a new benchmark dataset that can be used to train large-scale cross-lingual pre-trained models using multilingual and bilingual corpora and evaluate their performance across a diverse set of cross-lingual tasks. Comparing to GLUE(Wang et al., 2019), which is labeled in English for natural language understanding tasks only, XGLUE has two main advantages: (1) it provides 11 diversified tasks that cover both natural language understanding and generation scenarios; (2) for each task, it provides labeled data in multiple languages. We extend a recent cross-lingual pre-trained model Unicoder(Huang et al., 2019) to cover both understanding and generation tasks, which is evaluated on XGLUE as a strong baseline. We also evaluate the base versions (12-layer) of Multilingual BERT, XLM and XLM-R for comparison.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification

MethodsXLM-R · Linear Layer · Weight Decay · WordPiece · Linear Warmup With Linear Decay · BERT · Multi-Head Attention · Residual Connection · Attention Dropout · Byte Pair Encoding