A Novel Approach to OCR using Image Recognition based Classification for Ancient Tamil Inscriptions in Temples
Lalitha Giridhar, Aishwarya Dharani and, Velmathi Guruviah

TL;DR
This paper presents a new OCR method for ancient Tamil inscriptions using a CNN-based classification approach, achieving 77.7% accuracy, and integrates text-to-speech for audio output.
Contribution
It introduces a CNN-based OCR system specifically tailored for ancient Tamil script, utilizing a curated dataset and integrating Tesseract and Google's TTS for enhanced recognition and output.
Findings
Achieved 77.7% recognition accuracy on ancient Tamil inscriptions.
Developed a curated dataset of temple inscription characters.
Integrated OCR with text-to-speech for audio digitization.
Abstract
Recognition of ancient Tamil characters has always been a challenge for epigraphers. This is primarily because the language has evolved over the several centuries and the character set over this time has both expanded and diversified. This proposed work focuses on improving optical character recognition techniques for ancient Tamil script which was in use between the 7th and 12th centuries. While comprehensively curating a functional data set for ancient Tamil characters is an arduous task, in this work, a data set has been curated using cropped images of characters found on certain temple inscriptions, specific to this time as a case study. After using Otsu thresholding method for binarization of the image a two dimensional convolution neural network is defined and used to train, classify and, recognize the ancient Tamil characters. To implement the optical character recognition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHandwritten Text Recognition Techniques · Digital Media Forensic Detection · Vehicle License Plate Recognition
MethodsConvolution
