Word and character segmentation directly in run-length compressed handwritten document images
Amarnath R, P. Nagabhushan, Mohammed Javed

TL;DR
This paper presents a method for directly performing word and character segmentation on run-length compressed handwritten document images, reducing computational resources and addressing segmentation challenges.
Contribution
It introduces a novel approach for segmentation directly in compressed images using connected components and min-cut graph techniques, applicable to multiple languages.
Findings
Effective segmentation of words and characters in compressed images.
Reduced computational time and memory usage.
Successful application to non-English handwritten documents.
Abstract
From the literature, it is demonstrated that performing text-line segmentation directly in the run-length compressed handwritten document images significantly reduces the computational time and memory space. In this paper, we investigate the issues of word and character segmentation directly on the run-length compressed document images. Primarily, the spreads of the characters are intelligently extracted from the foreground runs of the compressed data and subsequently connected components are established. The spacing between the connected components would be larger between the adjacent words when compared to that of intra-words. With this knowledge, a threshold is empirically chosen for inter-word separation. Every connected component within a word is further analysed for character segmentation. Here, min-cut graph concept is used for separating the touching characters.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Image Retrieval and Classification Techniques
