Geographic Adaptation of Pretrained Language Models

Valentin Hofmann; Goran Glava\v{s}; Nikola Ljube\v{s}i\'c; Janet B.; Pierrehumbert; Hinrich Sch\"utze

arXiv:2203.08565·cs.CL·January 30, 2024

Geographic Adaptation of Pretrained Language Models

Valentin Hofmann, Goran Glava\v{s}, Nikola Ljube\v{s}i\'c, Janet B., Pierrehumbert, Hinrich Sch\"utze

PDF

1 Repo

TL;DR

This paper introduces geoadaptation, a multi-task training method that enhances pretrained language models with geolinguistic knowledge, significantly improving their performance on geolocation and language identification tasks.

Contribution

It presents a novel geoadaptation approach that couples language modeling with geolocation prediction, effectively injecting geographic knowledge into PLMs.

Findings

01

Geoadapted PLMs outperform baseline models on geolocation and language ID tasks.

02

Achieves state-of-the-art results on two geolocation benchmarks.

03

Geoadaptation improves zero-shot prediction capabilities.

Abstract

While pretrained language models (PLMs) have been shown to possess a plethora of linguistic knowledge, the existing body of research has largely neglected extralinguistic knowledge, which is generally difficult to obtain by pretraining on text alone. Here, we contribute to closing this gap by examining geolinguistic knowledge, i.e., knowledge about geographic variation in language. We introduce geoadaptation, an intermediate training step that couples language modeling with geolocation prediction in a multi-task learning setup. We geoadapt four PLMs, covering language groups from three geographic areas, and evaluate them on five different tasks: fine-tuned (i.e., supervised) geolocation prediction, zero-shot (i.e., unsupervised) geolocation prediction, fine-tuned language identification, zero-shot language identification, and zero-shot prediction of dialect features. Geoadaptation is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

valentinhofmann/geoadaptation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.