Analyzer and generator for Pali
David Alfter

TL;DR
This paper presents a system for morphological analysis and generation of Pali words, utilizing a lexical database and rules to efficiently analyze and generate inflected forms, with fallback capabilities when the database is incomplete.
Contribution
It introduces a combined rule-based and database-driven system for Pali morphology analysis and generation, improving efficiency and flexibility.
Findings
Efficient lookup of morphological forms through stored data.
System can operate with or without a lexical database.
Effective handling of regular inflectional paradigms.
Abstract
This work describes a system that performs morphological analysis and generation of Pali words. The system works with regular inflectional paradigms and a lexical database. The generator is used to build a collection of inflected and derived words, which in turn is used by the analyzer. Generating and storing morphological forms along with the corresponding morphological information allows for efficient and simple look up by the analyzer. Indeed, by looking up a word and extracting the attached morphological information, the analyzer does not have to compute this information. As we must, however, assume the lexical database to be incomplete, the system can also work without the dictionary component, using a rule-based approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems
