French Word Recognition through a Quick Survey on Recurrent Neural   Networks Using Long-Short Term Memory RNN-LSTM

Saman Sarraf

arXiv:1804.03683·cs.CV·April 12, 2018·1 cites

French Word Recognition through a Quick Survey on Recurrent Neural Networks Using Long-Short Term Memory RNN-LSTM

Saman Sarraf

PDF

Open Access

TL;DR

This paper demonstrates that RNN-LSTM models can achieve near-perfect accuracy in recognizing French printed words across multiple fonts, highlighting their robustness and effectiveness in OCR tasks.

Contribution

The study applies RNN-LSTM to French OCR, achieving high accuracy and demonstrating robustness across fonts, with optimized preprocessing for efficient training.

Findings

01

Achieved over 99.9% accuracy in OCR recognition

02

Demonstrated robustness of RNN-LSTM across multiple fonts

03

Validated preprocessing methods for optimal training

Abstract

Optical character recognition (OCR) is a fundamental problem in computer vision. Research studies have shown significant progress in classifying printed characters using deep learning-based methods and topologies. Among current algorithms, recurrent neural networks with long-short term memory blocks called RNN-LSTM have provided the highest performance in terms of accuracy rate. Using the top 5,000 French words collected from the internet including all signs and accents, RNN-LSTM models were trained and tested for several cases. Six fonts were used to generate OCR samples and an additional dataset that included all samples from these six fonts was prepared for training and testing purposes. The trained RNN-LSTM models were tested and achieved the accuracy rates of 99.98798% and 99.91889% for edit distance and sequence error, respectively. An accurate preprocessing followed by height…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques