Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting   for Label Imbalance

Selim F. Yilmaz; E. Batuhan Kaynak; Aykut Ko\c{c}; Hamdi; Dibeklio\u{g}lu; Suleyman S. Kozat

arXiv:2008.11573·cs.LG·February 8, 2022

Multi-Label Sentiment Analysis on 100 Languages with Dynamic Weighting for Label Imbalance

Selim F. Yilmaz, E. Batuhan Kaynak, Aykut Ko\c{c}, Hamdi, Dibeklio\u{g}lu, Suleyman S. Kozat

PDF

1 Repo

TL;DR

This paper presents a novel multi-label sentiment analysis framework for 100 languages that uses dynamic class weighting and optimized thresholds, achieving state-of-the-art results across multiple languages and metrics.

Contribution

It introduces a dynamic weighting method and an efficient threshold selection technique for multi-label sentiment analysis across many languages, improving over static methods and previous benchmarks.

Findings

01

Achieves state-of-the-art performance in 7 out of 9 metrics across 3 languages.

02

Effectively handles class imbalance with dynamic weighting.

03

Supports sentiment analysis in 100 languages with a single model.

Abstract

We investigate cross-lingual sentiment analysis, which has attracted significant attention due to its applications in various areas including market research, politics and social sciences. In particular, we introduce a sentiment analysis framework in multi-label setting as it obeys Plutchik wheel of emotions. We introduce a novel dynamic weighting method that balances the contribution from each class during training, unlike previous static weighting methods that assign non-changing weights based on their class frequency. Moreover, we adapt the focal loss that favors harder instances from single-label object recognition literature to our multi-label setting. Furthermore, we derive a method to choose optimal class-specific thresholds that maximize the macro-f1 score in linear time complexity. Through an extensive set of experiments, we show that our method obtains the state-of-the-art…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

selimfirat/multilingual-sentiment-analysis
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsFocal Loss