A Deep Learning Approach to Geographical Candidate Selection through   Toponym Matching

Mariona Coll Ardanuy; Kasra Hosseini; Katherine McDonough; Amrey; Krause; Daniel van Strien; Federico Nanni

arXiv:2009.08114·cs.CL·September 23, 2020

A Deep Learning Approach to Geographical Candidate Selection through Toponym Matching

Mariona Coll Ardanuy, Kasra Hosseini, Katherine McDonough, Amrey, Krause, Daniel van Strien, Federico Nanni

PDF

2 Repos

TL;DR

This paper presents a deep learning method for candidate selection in toponym resolution, improving geographic entity recognition in noisy, multilingual, and historical texts by leveraging neural network architectures and new datasets.

Contribution

It introduces a novel deep learning approach for toponym candidate selection and evaluates it on diverse, realistic datasets, including historical OCR'd texts.

Findings

01

Effective in cross-lingual and regional scenarios

02

Handles OCR errors well

03

Improves downstream toponym resolution performance

Abstract

Recognizing toponyms and resolving them to their real-world referents is required for providing advanced semantic access to textual data. This process is often hindered by the high degree of variation in toponyms. Candidate selection is the task of identifying the potential entities that can be referred to by a toponym previously recognized. While it has traditionally received little attention in the research community, it has been shown that candidate selection has a significant impact on downstream tasks (i.e. entity resolution), especially in noisy or non-standard text. In this paper, we introduce a flexible deep learning method for candidate selection through toponym matching, using state-of-the-art neural network architectures. We perform an intrinsic toponym matching evaluation based on several new realistic datasets, which cover various challenging scenarios (cross-lingual and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.