MaNLP@SMM4H22: BERT for Classification of Twitter Posts

Keshav Kapur; Rajitha Harikrishnan

arXiv:2301.05395·cs.CL·January 16, 2023

MaNLP@SMM4H22: BERT for Classification of Twitter Posts

Keshav Kapur, Rajitha Harikrishnan

PDF

Open Access

TL;DR

This paper presents a BERT-based binary classifier for identifying tweets reporting exact age, achieving around 80-81% F1 score in the SMM4H shared task.

Contribution

It introduces a straightforward BERT-based approach with different preprocessing strategies for classifying age-related tweets.

Findings

01

F1 scores of 0.80 and 0.81 achieved

02

Effective binary classification of age-related tweets

03

Simple preprocessing variations impact performance

Abstract

The reported work is our straightforward approach for the shared task Classification of tweets self-reporting age organized by the Social Media Mining for Health Applications (SMM4H) workshop. This literature describes the approach that was used to build a binary classification system, that classifies the tweets related to birthday posts into two classes namely, exact age(positive class) and non-exact age(negative class). We made two submissions with variations in the preprocessing of text which yielded F1 scores of 0.80 and 0.81 when evaluated by the organizers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuthorship Attribution and Profiling · Topic Modeling · Recommender Systems and Techniques