AIR-JPMC@SMM4H'22: Classifying Self-Reported Intimate Partner Violence   in Tweets with Multiple BERT-based Models

Alec Candidato; Akshat Gupta; Xiaomo Liu; Sameena Shah

arXiv:2209.10763·cs.CL·September 23, 2022

AIR-JPMC@SMM4H'22: Classifying Self-Reported Intimate Partner Violence in Tweets with Multiple BERT-based Models

Alec Candidato, Akshat Gupta, Xiaomo Liu, Sameena Shah

PDF

Open Access

TL;DR

This paper describes an ensemble of five RoBERTa models that effectively classifies self-reported intimate partner violence in tweets, achieving top performance in a shared task.

Contribution

It introduces a novel ensemble approach using multiple BERT-based models for classifying sensitive social media content.

Findings

01

System outperformed baseline by 13%

02

Achieved best overall performance in shared task

03

Ensemble approach improved classification accuracy

Abstract

This paper presents our submission for the SMM4H 2022-Shared Task on the classification of self-reported intimate partner violence on Twitter (in English). The goal of this task was to accurately determine if the contents of a given tweet demonstrated someone reporting their own experience with intimate partner violence. The submitted system is an ensemble of five RoBERTa models each weighted by their respective F1-scores on the validation data-set. This system performed 13% better than the baseline and was the best performing system overall for this shared task.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection

MethodsAttention Is All You Need · Linear Layer · Softmax · Dropout · Weight Decay · Attention Dropout · Adam · Residual Connection · Layer Normalization · Dense Connections