The Large Language Model GreekLegalRoBERTa

Vasileios Saketos; Despina-Athanasia Pantazi; Manolis Koubarakis

arXiv:2410.12852·cs.CL·October 18, 2024

The Large Language Model GreekLegalRoBERTa

Vasileios Saketos, Despina-Athanasia Pantazi, Manolis Koubarakis

PDF

Open Access

TL;DR

This paper introduces GreekLegalRoBERTa, a set of large language models trained on Greek legal texts that outperform existing models in legal NLP tasks, advancing domain-specific NLP for low-resource languages.

Contribution

The paper presents four new GreekLegalRoBERTa models trained on Greek legal and nonlegal texts, demonstrating superior performance over existing Greek legal language models.

Findings

01

Models outperform GreekLegalBERT, Greek-LegalBERT-v2, and GreekBERT in legal NLP tasks.

02

Models achieve higher accuracy in named entity recognition and legal topic classification.

03

Work advances NLP for Greek, a low-resource language, in legal domain.

Abstract

We develop four versions of GreekLegalRoBERTa, which are four large language models trained on Greek legal and nonlegal text. We show that our models surpass the performance of GreekLegalBERT, Greek- LegalBERT-v2, and GreekBERT in two tasks involving Greek legal documents: named entity recognition and multi-class legal topic classification. We view our work as a contribution to the study of domain-specific NLP tasks in low-resource languages, like Greek, using modern NLP techniques and methodologies.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques