GeoLLM: Extracting Geospatial Knowledge from Large Language Models
Rohin Manvi, Samar Khanna, Gengchen Mai, Marshall Burke, David Lobell,, Stefano Ermon

TL;DR
This paper introduces GeoLLM, a novel method that leverages large language models and auxiliary map data to extract geospatial knowledge, significantly improving prediction accuracy for tasks like population density and economic livelihoods.
Contribution
GeoLLM is the first approach to effectively extract geospatial knowledge from LLMs using auxiliary map data, outperforming existing baselines and satellite benchmarks.
Findings
GeoLLM achieves 70% performance improvement over baselines.
GPT-3.5 outperforms Llama 2 and RoBERTa by 19% and 51%.
LLMs are rich in geospatial information and sample-efficient.
Abstract
The application of machine learning (ML) in a range of geospatial tasks is increasingly common but often relies on globally available covariates such as satellite imagery that can either be expensive or lack predictive power. Here we explore the question of whether the vast amounts of knowledge found in Internet language corpora, now compressed within large language models (LLMs), can be leveraged for geospatial prediction tasks. We first demonstrate that LLMs embed remarkable spatial information about locations, but naively querying LLMs using geographic coordinates alone is ineffective in predicting key indicators like population density. We then present GeoLLM, a novel method that can effectively extract geospatial knowledge from LLMs with auxiliary map data from OpenStreetMap. We demonstrate the utility of our approach across multiple tasks of central interest to the international…
Peer Reviews
Decision·ICLR 2024 poster
The paper is well presented, and the experiments cover several different geospatial datasets and tasks in relation to census and demographic data.
• It seems that the proposed GeoLLM can only perform one specific task. After fine-tuning, is the fine-tuned LLMs (e.g., GPT-3.5) able to retain the ability to answer general questions that is not related to the specific task? Including some discussions about the generalization part could be useful. • The proposed model can only handle static/tabular geo information. It does not handle other types of spatial data, or spatiotemporal data and tasks. • Baselines are too simple. Considering some m
1. It proposed a novel method for efficiently extracting geospatial knowledge from large language models. 2. The paper outlined experiments to evaluate extracting geospatial knowledge from large language models, which included constructing a comprehensive benchmark, developing a robust set of baselines, and presenting results and an ablation study. 3. The paper revealed that GeoLLMs are sample-efficient, rich in geospatial information, and robust across the globe.
1. The paper does not provide a detailed analysis of the potential biases of LLMs and their training corpora. 2. It would be better to compare the GeoLLM’s performance with the results from satellite images.
1. This paper is well structured and good at the clarity of presentation. 2. The significance of the problem is high, potentially impacting a wide range of geospatial applications and offering a new way to view large language model capabilities. 3. The paper provides a detailed discussion of how large language models can be linked to geospatial applications. This can potentially bring new research opportunities to the field of spatial data mining.
1. The method presented seems to build incrementally on existing methodologies to generate auxiliary texts for prompt engineering, which may not represent a significance in technical novelty. 2. The scope of experiments could be further expanded to support the conclusion. Please consider the Questions below for details.
Code & Models
Videos
Taxonomy
TopicsGeographic Information Systems Studies · Semantic Web and Ontologies · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Dropout · Weight Decay · Softmax · Byte Pair Encoding
