Improving Results on Russian Sentiment Datasets

Anton Golubev; Natalia Loukachevitch

arXiv:2007.14310·cs.CL·July 29, 2020

Improving Results on Russian Sentiment Datasets

Anton Golubev, Natalia Loukachevitch

PDF

1 Repo

TL;DR

This paper evaluates neural network and BERT models on Russian sentiment datasets, finding that BERT-NLI achieves near-human performance, especially with conversational Russian BERT variants.

Contribution

It demonstrates the effectiveness of BERT-NLI for Russian sentiment analysis and compares different BERT variants, highlighting the superiority of conversational BERT.

Findings

01

BERT-NLI achieves near-human performance on one dataset.

02

Conversational Russian BERT variants outperform other models.

03

BERT-based models outperform traditional neural networks.

Abstract

In this study, we test standard neural network architectures (CNN, LSTM, BiLSTM) and recently appeared BERT architectures on previous Russian sentiment evaluation datasets. We compare two variants of Russian BERT and show that for all sentiment tasks in this study the conversational variant of Russian BERT performs better. The best results were achieved by BERT-NLI model, which treats sentiment classification tasks as a natural language inference task. On one of the datasets, this model practically achieves the human level.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

antongolubev5/Targeted-SA-for-Russian-Datasets
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Layer · Multi-Head Attention · Dense Connections · WordPiece · Residual Connection · Attention Is All You Need · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Linear Warmup With Linear Decay · Weight Decay