Unreasonable Effectiveness of Rule-Based Heuristics in Solving Russian SuperGLUE Tasks
Tatyana Iazykova, Denis Kapelyushnik, Olga Bystrova, Andrey Kutuzov

TL;DR
This paper demonstrates that simple rule-based heuristics can effectively solve Russian SuperGLUE tasks, revealing vulnerabilities in the benchmark and questioning the true language understanding of top models.
Contribution
The study shows that Russian SuperGLUE datasets are susceptible to shallow heuristics, and offers recommendations to improve the benchmark's robustness and representativeness.
Findings
Simple rules outperform or match advanced models on RSG tasks.
Russian SuperGLUE datasets contain exploitable statistical cues.
Shallow heuristics significantly influence top model performance.
Abstract
Leader-boards like SuperGLUE are seen as important incentives for active development of NLP, since they provide standard benchmarks for fair comparison of modern language models. They have driven the world's best engineering teams as well as their resources to collaborate and solve a set of tasks for general language understanding. Their performance scores are often claimed to be close to or even higher than the human performance. These results encouraged more thorough analysis of whether the benchmark datasets featured any statistical cues that machine learning based language models can exploit. For English datasets, it was shown that they often contain annotation artifacts. This allows solving certain tasks with very simple rules and achieving competitive rankings. In this paper, a similar analysis was done for the Russian SuperGLUE (RSG), a recently published benchmark set and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification
MethodsLinear Layer · Cosine Annealing · Linear Warmup With Cosine Annealing · 15 Ways to Contact How can i speak to someone at Delta Airlines · Byte Pair Encoding · Softmax · WordPiece · Dense Connections · Attention Is All You Need · {Dispute@FaQ-s}How to file a dispute with Expedia?
