Word Spotting in Cursive Handwritten Documents using Modified Character Shape Codes
Sayantan Sarkar

TL;DR
This paper presents a novel word spotting technique for cursive handwritten English documents using Modified Character Shape Codes, enabling faster and more efficient search in large document image databases.
Contribution
It introduces a two-level selection process based on word size and character shape code, improving speed and reducing preprocessing compared to existing methods.
Findings
Enhanced search efficiency in handwritten document databases
Reduced preprocessing requirements
Faster word matching process
Abstract
There is a large collection of Handwritten English paper documents of Historical and Scientific importance. But paper documents are not recognized directly by computer. Hence the closest way of indexing these documents is by storing their document digital image. Hence a large database of document images can replace the paper documents. But the document and data corresponding to each image cannot be directly recognized by the computer. This paper applies the technique of word spotting using Modified Character Shape Code to Handwritten English document images for quick and efficient query search of words on a database of document images. It is different from other Word Spotting techniques as it implements two level of selection for word segments to match search query. First based on word size and then based on character shape code of query. It makes the process faster and more efficient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Image Processing and 3D Reconstruction · Vehicle License Plate Recognition
