Synthetic Map Generation to Provide Unlimited Training Data for   Historical Map Text Detection

Zekun Li; Runyu Guan; Qianmu Yu; Yao-Yi Chiang; Craig A. Knoblock

arXiv:2112.06104·cs.CV·December 14, 2021

Synthetic Map Generation to Provide Unlimited Training Data for Historical Map Text Detection

Zekun Li, Runyu Guan, Qianmu Yu, Yao-Yi Chiang, Craig A. Knoblock

PDF

TL;DR

This paper introduces a style transfer-based method to generate unlimited synthetic historical map images with annotated text, significantly improving text detection models' performance on historical maps.

Contribution

The paper presents a novel approach to automatically generate large-scale annotated historical map images using style transfer, reducing manual annotation efforts and enhancing text detection accuracy.

Findings

01

Synthetic maps improve text detection accuracy.

02

State-of-the-art models benefit from synthetic training data.

03

Method reduces manual annotation effort.

Abstract

Many historical map sheets are publicly available for studies that require long-term historical geographic data. The cartographic design of these maps includes a combination of map symbols and text labels. Automatically reading text labels from map images could greatly speed up the map interpretation and helps generate rich metadata describing the map content. Many text detection algorithms have been proposed to locate text regions in map images automatically, but most of the algorithms are trained on out-ofdomain datasets (e.g., scenic images). Training data determines the quality of machine learning models, and manually annotating text regions in map images is labor-extensive and time-consuming. On the other hand, existing geographic data sources, such as Open- StreetMap (OSM), contain machine-readable map layers, which allow us to separate out the text layer and obtain text label…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings