Agglomerative Clustering of Handwritten Numerals to Determine Similarity of Different Languages
Md. Rahat-uz-Zaman, Shadmaan Hye

TL;DR
This paper introduces a method to analyze and cluster handwritten numerals from different languages using Siamese networks and agglomerative clustering, revealing regional similarities among languages.
Contribution
It presents a novel approach combining Siamese networks and clustering to measure and analyze language similarities based on handwritten numerals.
Findings
Clusters reveal regional language groupings
Siamese network effectively measures numeral similarity
Method identifies language origins from numeral features
Abstract
Handwritten numerals of different languages have various characteristics. Similarities and dissimilarities of the languages can be measured by analyzing the extracted features of the numerals. Handwritten numeral datasets are available and accessible for many renowned languages of different regions. In this paper, several handwritten numeral datasets of different languages are collected. Then they are used to find the similarity among those written languages through determining and comparing the similitude of each handwritten numerals. This will help to find which languages have the same or adjacent parent language. Firstly, a similarity measure of two numeral images is constructed with a Siamese network. Secondly, the similarity of the numeral datasets is determined with the help of the Siamese network and a new random sample with replacement similarity averaging technique. Finally, an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSiamese Network
