Unsupervised Training Data Generation of Handwritten Formulas using   Generative Adversarial Networks with Self-Attention

Matthias Springstein; Eric M\"uller-Budack; Ralph Ewerth

arXiv:2106.09432·cs.CV·June 18, 2021

Unsupervised Training Data Generation of Handwritten Formulas using Generative Adversarial Networks with Self-Attention

Matthias Springstein, Eric M\"uller-Budack, Ralph Ewerth

PDF

1 Repo

TL;DR

This paper presents an innovative attention-based GAN system that synthesizes large datasets of handwritten mathematical formulas from LaTeX, addressing the data scarcity challenge in handwritten formula recognition.

Contribution

It introduces a novel GAN architecture with self-attention for translating rendered equations into handwritten formulas, generating extensive training data from LaTeX documents.

Findings

01

Generated datasets contain hundreds of thousands of formulas.

02

Synthesized data improves training for handwritten formula recognition.

03

Feasibility demonstrated on CROHME 2014 benchmark.

Abstract

The recognition of handwritten mathematical expressions in images and video frames is a difficult and unsolved problem yet. Deep convectional neural networks are basically a promising approach, but typically require a large amount of labeled training data. However, such a large training dataset does not exist for the task of handwritten formula recognition. In this paper, we introduce a system that creates a large set of synthesized training examples of mathematical expressions which are derived from LaTeX documents. For this purpose, we propose a novel attention-based generative adversarial network to translate rendered equations to handwritten formulas. The datasets generated by this approach contain hundreds of thousands of formulas, making it ideal for pretraining or the design of more complex models. We evaluate our synthesized dataset and the recognition approach on the CROHME…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

TIBHannover/formula_gan
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.