Do Language Models Know the Way to Rome?

Bastien Li\'etard; Mostafa Abdou; Anders S{\o}gaard

arXiv:2109.07971·cs.CL·September 17, 2021

Do Language Models Know the Way to Rome?

Bastien Li\'etard, Mostafa Abdou, Anders S{\o}gaard

PDF

TL;DR

This paper investigates whether language models encode meaningful geographic information by evaluating their understanding of city and country locations relative to real-world geography, revealing limited but improving geographic knowledge in larger models.

Contribution

It introduces a novel evaluation method using geographic ground truths to assess the spatial knowledge encoded in language models, highlighting the role of model size.

Findings

01

Larger models encode more geographic information.

02

Language models show limited geographic understanding.

03

Geographic knowledge improves with model size.

Abstract

The global geometry of language models is important for a range of applications, but language model probes tend to evaluate rather local relations, for which ground truths are easily obtained. In this paper we exploit the fact that in geography, ground truths are available beyond local relations. In a series of experiments, we evaluate the extent to which language model representations of city and country names are isomorphic to real-world geography, e.g., if you tell a language model where Paris and Berlin are, does it know the way to Rome? We find that language models generally encode limited geographic information, but with larger models performing the best, suggesting that geographic knowledge can be induced from higher-order co-occurrence statistics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.