From LIMA to DeepLIMA: following a new path of interoperability

Victor Bocharov; Romaric Besan\c{c}on; Ga\"el de Chalendar; Olivier; Ferret; Nasredine Semmar

arXiv:2409.06550·cs.CL·September 11, 2024

From LIMA to DeepLIMA: following a new path of interoperability

Victor Bocharov, Romaric Besan\c{c}on, Ga\"el de Chalendar, Olivier, Ferret, Nasredine Semmar

PDF

Open Access

TL;DR

This paper presents DeepLIMA, an evolution of the LIMA framework, integrating deep neural network modules for multilingual text analysis, expanding language support, and promoting interoperability through standardized models and data.

Contribution

The paper introduces DeepLIMA, enhancing LIMA with deep learning modules, increased language support, and standardized interoperability using Universal Dependencies.

Findings

01

Supported over 60 languages with new models

02

Integrated deep learning modules into LIMA

03

Enabled interoperability through standardized models

Abstract

In this article, we describe the architecture of the LIMA (Libre Multilingual Analyzer) framework and its recent evolution with the addition of new text analysis modules based on deep neural networks. We extended the functionality of LIMA in terms of the number of supported languages while preserving existing configurable architecture and the availability of previously developed rule-based and statistical analysis components. Models were trained for more than 60 languages on the Universal Dependencies 2.5 corpora, WikiNer corpora, and CoNLL-03 dataset. Universal Dependencies allowed us to increase the number of supported languages and to generate models that could be integrated into other platforms. This integration of ubiquitous Deep Learning Natural Language Processing models and the use of standard annotated collections using Universal Dependencies can be viewed as a new path of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMathematics, Computing, and Information Processing · Natural Language Processing Techniques