Quantifying Scripts: Defining metrics of characters for quantitative and descriptive analysis
Vinodh Rajan

TL;DR
This paper introduces a comprehensive set of metrics for character analysis in scripts, aiding digital paleography by quantifying glyph features and enabling script comparison.
Contribution
It presents a novel, grounded metric framework for character analysis, including new metrics, to enhance quantitative script analysis and differentiation.
Findings
Metrics successfully differentiate glyphs in medieval Tamil script.
Proposed metrics are based on handwriting production principles.
Application demonstrates utility in paleographic evolution study.
Abstract
Analysis of scripts plays an important role in paleography and in quantitative linguistics. Especially in the field of digital paleography quantitative features are much needed to differentiate glyphs. We describe an elaborate set of metrics that quantify qualitative information contained in characters and hence indirectly also quantify the scribal features. We broadly divide the metrics into several categories and describe each individual metric with its underlying qualitative significance. The metrics are largely derived from the related area of gesture design and recognition. We also propose several novel metrics. The proposed metrics are soundly grounded on the principles of handwriting production and handwriting analysis. These computed metrics could serve as descriptors for scripts and also be used for comparing and analyzing scripts. We illustrate some quantitative analysis based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
