TL;DR
This paper evaluates neural network and BERT models on Russian sentiment datasets, finding that BERT-NLI achieves near-human performance, especially with conversational Russian BERT variants.
Contribution
It demonstrates the effectiveness of BERT-NLI for Russian sentiment analysis and compares different BERT variants, highlighting the superiority of conversational BERT.
Findings
BERT-NLI achieves near-human performance on one dataset.
Conversational Russian BERT variants outperform other models.
BERT-based models outperform traditional neural networks.
Abstract
In this study, we test standard neural network architectures (CNN, LSTM, BiLSTM) and recently appeared BERT architectures on previous Russian sentiment evaluation datasets. We compare two variants of Russian BERT and show that for all sentiment tasks in this study the conversational variant of Russian BERT performs better. The best results were achieved by BERT-NLI model, which treats sentiment classification tasks as a natural language inference task. On one of the datasets, this model practically achieves the human level.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLinear Layer · Multi-Head Attention · Dense Connections · WordPiece · Residual Connection · Attention Is All You Need · Refunds@Expedia|||How do I get a full refund from Expedia? · Adam · Linear Warmup With Linear Decay · Weight Decay
