A symbolic Perl algorithm for the unification of Nahuatl word spellings
Juan-Jos\'e Guzm\'an-Landa, Jes\'us V\'azquez-Osorio, Juan-Manuel Torres-Moreno, Ligia Quintana Torres, Miguel Figueroa-Saavedra, Martha-Lorena Avenda\~no-Garrido, Graham Ranger, Patricia Vel\'azquez-Morales, Gerardo Eugenio Sierra Mart\'inez

TL;DR
This paper presents a symbolic algorithm for automatically unifying different Nawatl orthographies, improving consistency in text documents through linguistic rule-based processing and evaluation.
Contribution
It introduces a novel symbolic unification algorithm based on linguistic rules and evaluates its effectiveness on Nawatl texts with encouraging results.
Findings
High-quality unification of Nawatl orthographies achieved
Effective use of symbolic regular expressions for linguistic rules
Positive evaluator feedback on sentence quality
Abstract
In this paper, we describe a symbolic model for the automatic orthographic unification of Nawatl text documents. Our model is based on algorithms that we have previously used to analyze sentences in Nawatl, and on the corpus called -yalli, consisting of texts in several Nawatl orthographies. Our automatic unification algorithm implements linguistic rules in symbolic regular expressions. We also present a manual evaluation protocol that we have proposed and implemented to assess the quality of the unified sentences generated by our algorithm, by testing in a sentence semantic task. We have obtained encouraging results from the evaluators for most of the desired features of our artificially unified sentences
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Digital Humanities and Scholarship · Mathematics, Computing, and Information Processing
