Improving Social Meaning Detection with Pragmatic Masking and Surrogate Fine-Tuning
Chiyu Zhang, Muhammad Abdul-Mageed

TL;DR
This paper introduces pragmatic masking and surrogate fine-tuning strategies to improve social meaning detection in social media, achieving significant gains across multiple datasets and languages, especially in few-shot scenarios.
Contribution
It presents novel masking and fine-tuning methods that leverage social cues, outperforming existing models on diverse social meaning detection tasks.
Findings
Achieved 2.34% higher F1 than baseline on Twitter datasets.
Significantly improved few-shot learning performance with only 5% training data.
Demonstrated language-agnostic effectiveness in zero-shot multilingual settings.
Abstract
Masked language models (MLMs) are pre-trained with a denoising objective that is in a mismatch with the objective of downstream fine-tuning. We propose pragmatic masking and surrogate fine-tuning as two complementing strategies that exploit social cues to drive pre-trained representations toward a broad set of concepts useful for a wide class of social meaning tasks. We test our models on different Twitter datasets for social meaning detection. Our methods achieve over a competitive baseline, while outperforming domain-specific language models pre-trained on large datasets. Our methods also excel in few-shot learning: with only of training data (severely few-shot), our methods enable an impressive average . The methods are also language agnostic, as we show in a zero-shot setting involving six datasets from three different languages.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Topic Modeling · Natural Language Processing Techniques
