Practical Text Classification With Large Pre-Trained Language Models
Neel Kant, Raul Puri, Nikolai Yakovenko, Bryan Catanzaro

TL;DR
This paper demonstrates that large-scale unsupervised language models combined with finetuning can effectively perform multi-emotion sentiment classification on complex, real-world datasets, achieving competitive results and surpassing commercial APIs.
Contribution
It introduces a practical approach using large pre-trained transformers and finetuning for multi-emotion sentiment classification on challenging datasets.
Findings
Achieved 0.69 F1 score on SemEval emotion classification.
Outperformed commercial sentiment analysis APIs on real Twitter data.
Effective on difficult categories like Fear, Disgust, and Anger.
Abstract
Multi-emotion sentiment classification is a natural language processing (NLP) problem with valuable use cases on real-world data. We demonstrate that large-scale unsupervised language modeling combined with finetuning offers a practical solution to this task on difficult datasets, including those with label class imbalance and domain-specific context. By training an attention-based Transformer network (Vaswani et al. 2017) on 40GB of text (Amazon reviews) (McAuley et al. 2015) and fine-tuning on the training set, our model achieves a 0.69 F1 score on the SemEval Task 1:E-c multi-dimensional emotion classification problem (Mohammad et al. 2018), based on the Plutchik wheel of emotions (Plutchik 1979). These results are competitive with state of the art models, including strong F1 scores on difficult (emotion) categories such as Fear (0.73), Disgust (0.77) and Anger (0.78), as well as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Text and Document Classification Technologies
MethodsLinear Layer · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Residual Connection · Byte Pair Encoding · Dense Connections · Label Smoothing · *Communicated@Fast*How Do I Communicate to Expedia? · Adam · Softmax
