TL;DR
This paper introduces geoadaptation, a multi-task training method that enhances pretrained language models with geolinguistic knowledge, significantly improving their performance on geolocation and language identification tasks.
Contribution
It presents a novel geoadaptation approach that couples language modeling with geolocation prediction, effectively injecting geographic knowledge into PLMs.
Findings
Geoadapted PLMs outperform baseline models on geolocation and language ID tasks.
Achieves state-of-the-art results on two geolocation benchmarks.
Geoadaptation improves zero-shot prediction capabilities.
Abstract
While pretrained language models (PLMs) have been shown to possess a plethora of linguistic knowledge, the existing body of research has largely neglected extralinguistic knowledge, which is generally difficult to obtain by pretraining on text alone. Here, we contribute to closing this gap by examining geolinguistic knowledge, i.e., knowledge about geographic variation in language. We introduce geoadaptation, an intermediate training step that couples language modeling with geolocation prediction in a multi-task learning setup. We geoadapt four PLMs, covering language groups from three geographic areas, and evaluate them on five different tasks: fine-tuned (i.e., supervised) geolocation prediction, zero-shot (i.e., unsupervised) geolocation prediction, fine-tuned language identification, zero-shot language identification, and zero-shot prediction of dialect features. Geoadaptation is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
