Comparative Evaluation of Prompting and Fine-Tuning for Applying Large Language Models to Grid-Structured Geospatial Data
Akash Dhruv, Yangxinyu Xie, Jordan Branham, Tanwi Mallick

TL;DR
This study compares prompting and fine-tuning methods for large language models in interpreting grid-structured geospatial data, revealing their respective strengths and limitations in spatial-temporal reasoning tasks.
Contribution
It provides a systematic evaluation of prompting versus fine-tuning for LLMs applied to geospatial data, highlighting the advantages of fine-tuning for complex reasoning.
Findings
Fine-tuning improves accuracy in structured geospatial reasoning.
Prompting performs well in zero-shot scenarios but has limitations.
Fine-tuned models excel in temporal and spatial reasoning tasks.
Abstract
This paper presents a comparative study of large language models (LLMs) in interpreting grid-structured geospatial data. We evaluate the performance of a base model through structured prompting and contrast it with a fine-tuned variant trained on a dataset of user-assistant interactions. Our results highlight the strengths and limitations of zero-shot prompting and demonstrate the benefits of fine-tuning for structured geospatial and temporal reasoning.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBalanced Selection
