On the Limitations of Sociodemographic Adaptation with Transformers
Chia-Chien Hung, Anne Lauscher, Dirk Hovy, Simone Paolo Ponzetto,, Goran Glava\v{s}

TL;DR
This paper examines the effectiveness of sociodemographic adaptation in pretrained Transformers, revealing that while it can improve NLP performance, the benefits are often confounded by other factors, highlighting ongoing challenges in the field.
Contribution
It evaluates sociodemographic adaptation methods in Transformers across multiple languages, demonstrating their limitations and the influence of confounding factors on performance gains.
Findings
Sociodemographic adaptation improves performance but is confounded by domain and language effects.
Performance gains are consistent across four languages with multilingual models.
Sociodemographic specialization remains an unresolved challenge in NLP.
Abstract
Sociodemographic factors (e.g., gender or age) shape our language. Previous work showed that incorporating specific sociodemographic factors can consistently improve performance for various NLP tasks in traditional NLP models. We investigate whether these previous findings still hold with state-of-the-art pretrained Transformers. We use three common specialization methods proven effective for incorporating external knowledge into pretrained Transformers (e.g., domain-specific or geographic knowledge). We adapt the language representations for the sociodemographic dimensions of gender and age, using continuous language modeling and dynamic multi-task learning for adaptation, where we couple language modeling with the prediction of a sociodemographic class. Our results when employing a multilingual model show substantial performance gains across four languages (English, German, French,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling
