Du corpus au dictionnaire
Bernard Jacquemin (LIMSI), Sabine Ploux (L2C2)

TL;DR
This paper introduces an automatic method for creating multi-lingual lexico-semantic resources using the Atlas sémantiques model, enabling semantic browsing across languages based on syntactic contexts.
Contribution
It presents a novel approach to build multi-lingual semantic maps from corpora, linking senses and contexts across languages for improved semantic browsing.
Findings
Generated multi-lingual semantic resources from corpora
Linked senses and contexts across languages
Enabled cross-lingual semantic browsing
Abstract
In this article, we propose an automatic process to build multi-lingual lexico-semantic resources. The goal of these resources is to browse semantically textual information contained in texts of different languages. This method uses a mathematical model called Atlas s\'emantiques in order to represent the different senses of each word. It uses the linguistic relations between words to create graphs that are projected into a semantic space. These projections constitute semantic maps that denote the sense trends of each given word. This model is fed with syntactic relations between words extracted from a corpus. Therefore, the lexico-semantic resource produced describes all the words and all their meanings observed in the corpus. The sense trends are expressed by syntactic contexts, typical for a given meaning. The link between each sense trend and the utterances used to build the sense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques
