Outilex, plate-forme logicielle de traitement de textes \'ecrits
Olivier Blanc (IGM-LabInfo), Matthieu Constant (IGM-LabInfo), Eric, Laporte (IGM-LabInfo)

TL;DR
Outilex is a comprehensive, XML-based software platform for written text processing that integrates various linguistic resources and supports multiple approaches, aimed at research, development, and industry applications.
Contribution
It introduces a unified platform combining lexicon, grammar, and resource management with flexible formats and licensing, facilitating advanced text processing in multiple languages.
Findings
Includes manually constructed French and English lexicons from LADL
Supports combining statistical and resource-based approaches
Provides format converters and resource management tools
Abstract
The Outilex software platform, which will be made available to research, development and industry, comprises software components implementing all the fundamental operations of written text processing: processing without lexicons, exploitation of lexicons and grammars, language resource management. All data are structured in XML formats, and also in more compact formats, either readable or binary, whenever necessary; the required format converters are included in the platform; the grammar formats allow for combining statistical approaches with resource-based approaches. Manually constructed lexicons for French and English, originating from the LADL, and of substantial coverage, will be distributed with the platform under LGPL-LR license.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Speech and dialogue systems
