Legal-Tech Open Diaries: Lesson learned on how to develop and deploy   light-weight models in the era of humongous Language Models

Stelios Maroudas; Sotiris Legkas; Prodromos Malakasiotis; Ilias; Chalkidis

arXiv:2210.13086·cs.CL·October 25, 2022

Legal-Tech Open Diaries: Lesson learned on how to develop and deploy light-weight models in the era of humongous Language Models

Stelios Maroudas, Sotiris Legkas, Prodromos Malakasiotis, Ilias, Chalkidis

PDF

Open Access

TL;DR

This paper explores developing and deploying lightweight legal language models using domain-specific training and model compression techniques, addressing high resource demands of large models in legal tech.

Contribution

It introduces domain-specific multilingual legal models and evaluates their performance, demonstrating effective compression techniques that maintain accuracy while reducing resource needs.

Findings

01

Domain-specific models outperform general ones in legal tasks.

02

Model compression techniques significantly reduce size and computational requirements.

03

Lightweight models achieve comparable performance to larger models.

Abstract

In the era of billion-parameter-sized Language Models (LMs), start-ups have to follow trends and adapt their technology accordingly. Nonetheless, there are open challenges since the development and deployment of large models comes with a need for high computational resources and has economical consequences. In this work, we follow the steps of the R&D group of a modern legal-tech start-up and present important insights on model development and deployment. We start from ground zero by pre-training multiple domain-specific multi-lingual LMs which are a better fit to contractual and regulatory text compared to the available alternatives (XLM-R). We present benchmark results of such models in a half-public half-private legal benchmark comprising 5 downstream tasks showing the impact of larger model size. Lastly, we examine the impact of a full-scale pipeline for model compression which…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Topic Modeling

MethodsPruning · Knowledge Distillation