A Freely Available Morphological Analyzer, Disambiguator and Context Sensitive Lemmatizer for German
Wolfgang Lezius (University of Paderborn), Reinhard Rapp (University, of Mainz), Manfred Wettler (University of Paderborn)

TL;DR
Morphy is a comprehensive, freely accessible tool for German language processing that integrates morphological analysis, disambiguation, and context-sensitive lemmatization, covering a wide range of word forms including compounds.
Contribution
It introduces Morphy, an integrated German language tool combining morphology, POS tagging, and lemmatization with extensive lexical coverage and open availability.
Findings
Large lexicon with 320,000+ word forms
Effective disambiguation using statistical POS tagging
Open-source and freely downloadable
Abstract
In this paper we present Morphy, an integrated tool for German morphology, part-of-speech tagging and context-sensitive lemmatization. Its large lexicon of more than 320,000 word forms plus its ability to process German compound nouns guarantee a wide morphological coverage. Syntactic ambiguities can be resolved with a standard statistical part-of-speech tagger. By using the output of the tagger, the lemmatizer can determine the correct root even for ambiguous word forms. The complete package is freely available and can be downloaded from the World Wide Web.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Speech and dialogue systems
