# Self-supervised Data Bootstrapping for Deep Optical Character   Recognition of Identity Documents

**Authors:** Oliver Mothes, Joachim Denzler

arXiv: 1908.04027 · 2019-08-13

## TL;DR

This paper introduces a self-supervised bootstrapping method that combines real and synthetic data to train CNNs for OCR of identity documents, significantly improving accuracy over synthetic-only training.

## Contribution

It presents a novel iterative self-supervised approach for mining real character data from ID images, enhancing OCR performance without relying on predefined dictionaries.

## Key findings

- Achieved 99.4% class-wise accuracy on 74 character classes.
- Outperformed an established open-source OCR framework.
- Synthetic-only training resulted in 58.1% accuracy.

## Abstract

The essential task of verifying person identities at airports and national borders is very time consuming. To accelerate it, optical character recognition for identity documents (IDs) using dictionaries is not appropriate due to high variability of the text content in IDs, e.g., individual street names or surnames. Additionally, no properties of the used fonts in IDs are known. Therefore, we propose an iterative self-supervised bootstrapping approach using a smart strategy to mine real character data from IDs. In combination with synthetically generated character data, the real data is used to train efficient convolutional neural networks for character classification serving a practical runtime as well as a high accuracy. On a dataset with 74 character classes, we achieve an average class-wise accuracy of 99.4 %. In contrast, if we would apply a classifier trained only using synthetic data, the accuracy is reduced to 58.1 %. Finally, we show that our whole proposed pipeline outperforms an established open-source framework

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1908.04027/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/1908.04027/full.md

## References

19 references — full list in the complete paper: https://tomesphere.com/paper/1908.04027/full.md

---
Source: https://tomesphere.com/paper/1908.04027