Is ChatGPT a game changer for geocoding -- a benchmark for geocoding address parsing techniques
Zhengcong Yin, Diya Li, Daniel W. Goldberg

TL;DR
This paper evaluates GPT-3's performance in geocoding address parsing against traditional models using a newly created, diverse benchmark dataset of low-quality address inputs, highlighting current strengths and areas for improvement.
Contribution
It introduces a comprehensive benchmark dataset for geocoding address parsing and compares GPT-3 with transformer-based and LSTM models, providing insights into their relative performances.
Findings
Bidirectional LSTM-CRF outperforms GPT-3 and transformer models.
Transformer models perform comparably to LSTM-CRF.
GPT-3 shows potential with few-shot learning but needs further fine-tuning.
Abstract
The remarkable success of GPT models across various tasks, including toponymy recognition motivates us to assess the performance of the GPT-3 model in the geocoding address parsing task. To ensure that the evaluation more accurately mirrors performance in real-world scenarios with diverse user input qualities and resolve the pressing need for a 'gold standard' evaluation dataset for geocoding systems, we introduce a benchmark dataset of low-quality address descriptions synthesized based on human input patterns mining from actual input logs of a geocoding system in production. This dataset has 21 different input errors and variations; contains over 239,000 address records that are uniquely selected from streets across all U.S. 50 states and D.C.; and consists of three subsets to be used as training, validation, and testing sets. Building on this, we train and gauge the performance of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData-Driven Disease Surveillance · Data Quality and Management · Geographic Information Systems Studies
MethodsMulti-Head Attention · 15 Ways to Contact How can i speak to someone at Delta Airlines · Attention Is All You Need · Discriminative Fine-Tuning · Attention Dropout · Softmax · Dense Connections · Cosine Annealing · Adam · Residual Connection
