An Automatic Approach for Generating Rich, Linked Geo-Metadata from Historical Map Images
Zekun Li, Yao-Yi Chiang, Sasan Tavakkol, Basel Shbita, Johannes H., Uhl, Stefan Leyk, and Craig A. Knoblock

TL;DR
This paper introduces mapKurator, an automated system that extracts and links geographic metadata from historical map images, enabling complex searches and indexing of maps spanning over a century.
Contribution
It presents an end-to-end automated approach for extracting, linking, and querying geographic information from historical maps, improving over manual and OCR-based methods.
Findings
Significant accuracy improvements over existing methods
Supports complex spatial queries on historical maps
System is applicable to diverse map styles and sources
Abstract
Historical maps contain detailed geographic information difficult to find elsewhere covering long-periods of time (e.g., 125 years for the historical topographic maps in the US). However, these maps typically exist as scanned images without searchable metadata. Existing approaches making historical maps searchable rely on tedious manual work (including crowd-sourcing) to generate the metadata (e.g., geolocations and keywords). Optical character recognition (OCR) software could alleviate the required manual work, but the recognition results are individual words instead of location phrases (e.g., "Black" and "Mountain" vs. "Black Mountain"). This paper presents an end-to-end approach to address the real-world problem of finding and indexing historical map images. This approach automatically processes historical map images to extract their text content and generates a set of metadata that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · Advanced Image and Video Retrieval Techniques · Data Management and Algorithms
