The Massively Multilingual Natural Language Understanding 2022   (MMNLU-22) Workshop and Competition

Christopher Hench; Charith Peris; Jack FitzGerald; Kay Rottmann

arXiv:2212.06346·cs.CL·December 14, 2022

The Massively Multilingual Natural Language Understanding 2022 (MMNLU-22) Workshop and Competition

Christopher Hench, Charith Peris, Jack FitzGerald, Kay Rottmann

PDF

Open Access

TL;DR

This paper introduces a new multilingual NLU dataset, organizes a challenge and workshop to advance multilingual NLU technology, and reports on progress and findings from these initiatives.

Contribution

It provides a large multilingual dataset, a competitive challenge, and a workshop to foster research and improve transferability of NLU models across languages.

Findings

01

Progress in multilingual NLU transferability

02

Enhanced dataset for voice assistant evaluation

03

Community engagement through challenge and workshop

Abstract

Despite recent progress in Natural Language Understanding (NLU), the creation of multilingual NLU systems remains a challenge. It is common to have NLU systems limited to a subset of languages due to lack of available data. They also often vary widely in performance. We launch a three-phase approach to address the limitations in NLU and help propel NLU technology to new heights. We release a 52 language dataset called the Multilingual Amazon SLU resource package (SLURP) for Slot-filling, Intent classification, and Virtual assistant Evaluation, or MASSIVE, in an effort to address parallel data availability for voice assistants. We organize the Massively Multilingual NLU 2022 Challenge to provide a competitive environment and push the state-of-the art in the transferability of models into other languages. Finally, we host the first Massively Multilingual NLU workshop which brings these…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Multimodal Machine Learning Applications