A Weakly Supervised Dataset of Fine-Grained Emotions in Portuguese

Diogo Cortiz; Jefferson O. Silva; Newton Calegari; Ana Lu\'isa; Freitas; Ana Ang\'elica Soares; Carolina Botelho; Gabriel Gaudencio R\^ego,; Waldir Sampaio; Paulo Sergio Boggio

arXiv:2108.07638·cs.CL·October 11, 2021·1 cites

A Weakly Supervised Dataset of Fine-Grained Emotions in Portuguese

Diogo Cortiz, Jefferson O. Silva, Newton Calegari, Ana Lu\'isa, Freitas, Ana Ang\'elica Soares, Carolina Botelho, Gabriel Gaudencio R\^ego,, Waldir Sampaio, Paulo Sergio Boggio

PDF

Open Access 1 Repo

TL;DR

This paper presents a weakly supervised, lexical-based dataset for fine-grained emotion recognition in Portuguese, demonstrating its effectiveness by fine-tuning a BERT model with promising results in a low-resource setting.

Contribution

It introduces a novel weakly supervised dataset for fine-grained emotion recognition in Portuguese, suitable for low-resource NLP environments.

Findings

01

F1-score of 0.64 on validation set

02

Lexical-based weak supervision is effective for low-resource languages

03

Dataset enables initial emotion recognition research in Portuguese

Abstract

Affective Computing is the study of how computers can recognize, interpret and simulate human affects. Sentiment Analysis is a common task inNLP related to this topic, but it focuses only on emotion valence (positive, negative, neutral). An emerging approach in NLP is Emotion Recognition, which relies on fined-grained classification. This research describes an approach to create a lexical-based weakly supervised corpus for fine-grained emotion in Portuguese. We evaluated our dataset by fine-tuning a transformer-based language model (BERT) and validating it on a Gold Standard annotated validation set. Our results (F1-score=.64) suggest lexical-based weak supervision as an appropriate strategy for initial work in low resourced environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

diogocortiz/portugueseemotionrecognitionweaksupervision
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Emotion and Mood Recognition