UniMorph 2.0: Universal Morphology
Christo Kirov, Ryan Cotterell, John Sylak-Glassman, G\'eraldine, Walther, Ekaterina Vylomova, Patrick Xia, Manaal Faruqui, Sabrina J. Mielke,, Arya D. McCarthy, Sandra K\"ubler, David Yarowsky, Jason Eisner, Mans Hulden

TL;DR
UniMorph 2.0 advances the collection and annotation of universal morphological data across languages, providing standardized resources and tools to improve NLP handling of complex morphology.
Contribution
This paper introduces updates to the UniMorph project, enhancing data collection, annotation, and dissemination of universal morphological resources for diverse languages.
Findings
Expanded multilingual morphological datasets
Improved annotation schema and tools
Enhanced accessibility of morphological resources
Abstract
The Universal Morphology UniMorph project is a collaborative effort to improve how NLP handles complex morphology across the world's languages. The project releases annotated morphological data using a universal tagset, the UniMorph schema. Each inflected form is associated with a lemma, which typically carries its underlying lexical meaning, and a bundle of morphological features from our schema. Additional supporting data and tools are also released on a per-language basis when available. UniMorph is based at the Center for Language and Speech Processing (CLSP) at Johns Hopkins University in Baltimore, Maryland and is sponsored by the DARPA LORELEI program. This paper details advances made to the collection, annotation, and dissemination of project resources since the initial UniMorph release described at LREC 2016. lexical resources} }
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems · Topic Modeling
