A framework for lexical representation
Jos\'e M. Go\~ni, Jos\'e C. Gonz\'alez (E.T.S.I. Telecomunicaci\'on,, Universidad Polit\'ecnica de Madrid, Madrid, Spain)

TL;DR
This paper introduces a unification-based lexical framework tailored for highly inflected languages, enabling automatic dictionary generation and supporting linguistic analysis through specialized software tools.
Contribution
It presents a novel formalism for encoding lemmas and generating allomorph-indexed dictionaries, enhancing linguistic processing for complex languages.
Findings
Efficient automatic dictionary generation from lemma-based sources
Implementation of software tools for morphological processing
Formalism suited for highly inflected languages
Abstract
In this paper we present a unification-based lexical platform designed for highly inflected languages (like Roman ones). A formalism is proposed for encoding a lemma-based lexical source, well suited for linguistic generalizations. From this source, we automatically generate an allomorph indexed dictionary, adequate for efficient processing. A set of software tools have been implemented around this formalism: access libraries, morphological processors, etc.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Mathematics, Computing, and Information Processing · Semantic Web and Ontologies
