GeoLLM: Extracting Geospatial Knowledge from Large Language Models

Rohin Manvi; Samar Khanna; Gengchen Mai; Marshall Burke; David Lobell,; Stefano Ermon

arXiv:2310.06213·cs.CL·February 27, 2024·29 cites

GeoLLM: Extracting Geospatial Knowledge from Large Language Models

Rohin Manvi, Samar Khanna, Gengchen Mai, Marshall Burke, David Lobell,, Stefano Ermon

PDF

Open Access 1 Repo 1 Video 3 Reviews

TL;DR

This paper introduces GeoLLM, a novel method that leverages large language models and auxiliary map data to extract geospatial knowledge, significantly improving prediction accuracy for tasks like population density and economic livelihoods.

Contribution

GeoLLM is the first approach to effectively extract geospatial knowledge from LLMs using auxiliary map data, outperforming existing baselines and satellite benchmarks.

Findings

01

GeoLLM achieves 70% performance improvement over baselines.

02

GPT-3.5 outperforms Llama 2 and RoBERTa by 19% and 51%.

03

LLMs are rich in geospatial information and sample-efficient.

Abstract

The application of machine learning (ML) in a range of geospatial tasks is increasingly common but often relies on globally available covariates such as satellite imagery that can either be expensive or lack predictive power. Here we explore the question of whether the vast amounts of knowledge found in Internet language corpora, now compressed within large language models (LLMs), can be leveraged for geospatial prediction tasks. We first demonstrate that LLMs embed remarkable spatial information about locations, but naively querying LLMs using geographic coordinates alone is ineffective in predicting key indicators like population density. We then present GeoLLM, a novel method that can effectively extract geospatial knowledge from LLMs with auxiliary map data from OpenStreetMap. We demonstrate the utility of our approach across multiple tasks of central interest to the international…

Peer Reviews

Decision·ICLR 2024 poster

Reviewer 01Rating 3· reject, not good enoughConfidence 4

Strengths

The paper is well presented, and the experiments cover several different geospatial datasets and tasks in relation to census and demographic data.

Weaknesses

• It seems that the proposed GeoLLM can only perform one specific task. After fine-tuning, is the fine-tuned LLMs (e.g., GPT-3.5) able to retain the ability to answer general questions that is not related to the specific task? Including some discussions about the generalization part could be useful. • The proposed model can only handle static/tabular geo information. It does not handle other types of spatial data, or spatiotemporal data and tasks. • Baselines are too simple. Considering some m

Reviewer 02Rating 6· marginally above the acceptance thresholdConfidence 3

Strengths

1. It proposed a novel method for efficiently extracting geospatial knowledge from large language models. 2. The paper outlined experiments to evaluate extracting geospatial knowledge from large language models, which included constructing a comprehensive benchmark, developing a robust set of baselines, and presenting results and an ablation study. 3. The paper revealed that GeoLLMs are sample-efficient, rich in geospatial information, and robust across the globe.

Weaknesses

1. The paper does not provide a detailed analysis of the potential biases of LLMs and their training corpora. 2. It would be better to compare the GeoLLM’s performance with the results from satellite images.

Reviewer 03Rating 8· accept, good paperConfidence 4

Strengths

1. This paper is well structured and good at the clarity of presentation. 2. The significance of the problem is high, potentially impacting a wide range of geospatial applications and offering a new way to view large language model capabilities. 3. The paper provides a detailed discussion of how large language models can be linked to geospatial applications. This can potentially bring new research opportunities to the field of spatial data mining.

Weaknesses

1. The method presented seems to build incrementally on existing methodologies to generate auxiliary texts for prompt engineering, which may not represent a significance in technical novelty. 2. The scope of experiments could be further expanded to support the conclusion. Please consider the Questions below for details.

Code & Models

Repositories

rohinmanvi/GeoLLM
noneOfficial

Videos

GeoLLM: Extracting Geospatial Knowledge from Large Language Models· slideslive

Taxonomy

TopicsGeographic Information Systems Studies · Semantic Web and Ontologies · Natural Language Processing Techniques

MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · {Dispute@FaQ-s}How to file a dispute with Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Cosine Annealing · Dropout · Weight Decay · Softmax · Byte Pair Encoding