Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP   models

Alena Fenogenova; Maria Tikhonova; Vladislav Mikhailov; Tatiana; Shavrina; Anton Emelyanov; Denis Shevelev; Alexandr Kukushkin; Valentin; Malykh; Ekaterina Artemova

arXiv:2202.07791·cs.CL·February 17, 2022

Russian SuperGLUE 1.1: Revising the Lessons not Learned by Russian NLP models

Alena Fenogenova, Maria Tikhonova, Vladislav Mikhailov, Tatiana, Shavrina, Anton Emelyanov, Denis Shevelev, Alexandr Kukushkin, Valentin, Malykh, Ekaterina Artemova

PDF

Open Access 1 Datasets

TL;DR

Russian SuperGLUE 1.1 is an updated benchmark for evaluating Russian NLP models, incorporating new tasks, improved evaluation tools, and integration with industrial assessment frameworks to better measure model understanding and performance.

Contribution

The paper introduces Russian SuperGLUE 1.1 with new tasks, methodological improvements, and enhanced evaluation tools, addressing previous vulnerabilities and supporting recent models.

Findings

01

Enhanced benchmark with new understanding tasks

02

Improved evaluation toolkit supporting latest models

03

Integration with industrial evaluation framework

Abstract

In the last year, new neural architectures and multilingual pre-trained models have been released for Russian, which led to performance evaluation problems across a range of language understanding tasks. This paper presents Russian SuperGLUE 1.1, an updated benchmark styled after GLUE for Russian NLP models. The new version includes a number of technical, user experience and methodological improvements, including fixes of the benchmark vulnerabilities unresolved in the previous version: novel and improved tests for understanding the meaning of a word in context (RUSSE) along with reading comprehension and common sense reasoning (DaNetQA, RuCoS, MuSeRC). Together with the release of the updated datasets, we improve the benchmark toolkit based on \texttt{jiant} framework for consistent training and evaluation of NLP-models of various architectures which now supports the most recent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

RussianNLP/russian_super_glue
dataset· 894 dl
894 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification