MIDV-2019: Challenges of the modern mobile-based document OCR

Konstantin Bulatov; Daniil Matalov; Vladimir V. Arlazarov

arXiv:1910.04009·cs.CV·February 12, 2020

MIDV-2019: Challenges of the modern mobile-based document OCR

Konstantin Bulatov, Daniil Matalov, Vladimir V. Arlazarov

PDF

1 Repo

TL;DR

This paper introduces the MIDV-2019 dataset, a challenging collection of mobile-captured identity document videos with distortions and low lighting, to advance OCR research.

Contribution

The paper presents a new dataset, MIDV-2019, addressing key issues like distortions and lighting variations not covered in previous datasets.

Findings

01

Baseline OCR performance varies significantly across conditions.

02

The dataset highlights challenges in mobile document OCR.

03

Provides a benchmark for future research.

Abstract

Recognition of identity documents using mobile devices has become a topic of a wide range of computer vision research. The portfolio of methods and algorithms for solving such tasks as face detection, document detection and rectification, text field recognition, and other, is growing, and the scarcity of datasets has become an important issue. One of the openly accessible datasets for evaluating such methods is MIDV-500, containing video clips of 50 identity document types in various conditions. However, the variability of capturing conditions in MIDV-500 did not address some of the key issues, mainly significant projective distortions and different lighting conditions. In this paper we present a MIDV-2019 dataset, containing video clips shot with modern high-resolution mobile cameras, with strong projective distortions and with low lighting conditions. The description of the added data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

SmartEngines/stoppers_modelling
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.