Multilingual Detection of Personal Employment Status on Twitter

Manuel Tonneau; Dhaval Adjodah; Jo\~ao Palotti; Nir Grinberg; Samuel; Fraiberger

arXiv:2203.09178·cs.CL·June 14, 2022

Multilingual Detection of Personal Employment Status on Twitter

Manuel Tonneau, Dhaval Adjodah, Jo\~ao Palotti, Nir Grinberg, Samuel, Fraiberger

PDF

1 Repo 10 Models

TL;DR

This paper explores active learning strategies to improve the detection of personal employment disclosures on Twitter across multiple languages, demonstrating significant gains in accuracy with minimal labeled data.

Contribution

It evaluates three active learning methods for multilingual employment status detection, showing their effectiveness in extreme class imbalance scenarios using BERT models.

Findings

01

Active learning improves precision and recall significantly.

02

Few iterations of active learning yield substantial performance gains.

03

No single active learning strategy is universally best.

Abstract

Detecting disclosures of individuals' employment status on social media can provide valuable information to match job seekers with suitable vacancies, offer social protection, or measure labor market flows. However, identifying such personal disclosures is a challenging task due to their rarity in a sea of social media content and the variety of linguistic forms used to describe them. Here, we examine three Active Learning (AL) strategies in real-world settings of extreme class imbalance, and identify five types of disclosures about individuals' employment status (e.g. job loss) in three languages using BERT-based classification models. Our findings show that, even under extreme imbalance settings, a small number of AL iterations is sufficient to obtain large and significant gains in precision, recall, and diversity of results compared to a supervised baseline with the same number of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

manueltonneau/twitter-unemployment
noneOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · WordPiece · Dropout