Historical Document Image Segmentation with LDA-Initialized Deep Neural   Networks

Michele Alberti; Mathias Seuret; Vinaychandran Pondenkandath; Rolf; Ingold; Marcus Liwicki

arXiv:1710.07363·cs.CV·November 27, 2017

Historical Document Image Segmentation with LDA-Initialized Deep Neural Networks

Michele Alberti, Mathias Seuret, Vinaychandran Pondenkandath, Rolf, Ingold, Marcus Liwicki

PDF

1 Repo

TL;DR

This paper introduces a novel LDA-based weight initialization method for deep neural networks, specifically applied to historical document image segmentation, demonstrating faster training and improved accuracy over traditional methods.

Contribution

The paper presents a new LDA-based initialization technique for neural networks, enhancing training stability and performance in historical document segmentation tasks.

Findings

01

LDA initialization is quick and stable.

02

LDA-based initialization outperforms random methods.

03

Improves layout analysis accuracy.

Abstract

In this paper, we present a novel approach to perform deep neural networks layer-wise weight initialization using Linear Discriminant Analysis (LDA). Typically, the weights of a deep neural network are initialized with: random values, greedy layer-wise pre-training (usually as Deep Belief Network or as auto-encoder) or by re-using the layers from another network (transfer learning). Hence, many training epochs are needed before meaningful weights are learned, or a rather similar dataset is required for seeding a fine-tuning of transfer learning. In this paper, we describe how to turn an LDA into either a neural layer or a classification layer. We analyze the initialization technique on historical documents. First, we show that an LDA-based initialization is quick and leads to a very stable initialization. Furthermore, for the task of layout analysis at pixel level, we investigate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DIVA-DIA/LayoutAnalysisEvaluator
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLinear Discriminant Analysis · Deep Belief Network