GR-NLP-TOOLKIT: An Open-Source NLP Toolkit for Modern Greek
Lefteris Loukas, Nikolaos Smyrnioudis, Chrysa Dikonomaki, Spyros, Barbakos, Anastasios Toumazatos, John Koutsikakis, Manolis Kyriakakis, Mary, Georgiou, Stavros Vassos, John Pavlopoulos, Ion Androutsopoulos

TL;DR
GR-NLP-TOOLKIT is a comprehensive open-source NLP toolkit tailored for modern Greek, offering state-of-the-art performance across key NLP tasks using pre-trained Transformers, and accessible via Python and HuggingFace.
Contribution
It introduces a specialized Greek NLP toolkit with high performance, accessible implementation, and comparison to existing open-source solutions, filling a gap in Greek language processing resources.
Findings
Achieves state-of-the-art results in five NLP tasks for Greek
Provides an easy-to-install Python package with pre-trained models
Demonstrates superior performance compared to existing tools
Abstract
We present GR-NLP-TOOLKIT, an open-source natural language processing (NLP) toolkit developed specifically for modern Greek. The toolkit provides state-of-the-art performance in five core NLP tasks, namely part-of-speech tagging, morphological tagging, dependency parsing, named entity recognition, and Greeklishto-Greek transliteration. The toolkit is based on pre-trained Transformers, it is freely available, and can be easily installed in Python (pip install gr-nlp-toolkit). It is also accessible through a demonstration platform on HuggingFace, along with a publicly available API for non-commercial use. We discuss the functionality provided for each task, the underlying methods, experiments against comparable open-source toolkits, and future possible enhancements. The toolkit is available at: https://github.com/nlpaueb/gr-nlp-toolkit
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
