# Specific polysemy of the brief sapiential units

**Authors:** Marie-Christine Bornes-Varol (CERMOM EA 4091), Marie-Sol Ortola (MSH, Lorraine), Gronoff Jean-Daniel

arXiv: 1905.11836 · 2019-05-29

## TL;DR

This paper discusses the development of the Aliento database, focusing on tagging polysemous brief sapiential units across languages to facilitate text analysis and comparison.

## Contribution

It introduces a method for accurately tagging polysemy in brief sapiential units within a multilingual corpus, addressing linguistic and informational complexities.

## Key findings

- Effective tagging improves the analysis of sapiential units.
- Multilingual differences pose challenges that are addressed by the proposed method.
- Enhanced corpus preparation supports better similarity computations.

## Abstract

In this paper we explain how we deal with the problems related to the constitution of the Aliento database, the complexity of which has to do with the type of phrases we work with, the differences between languages, the type of information we want to see emerge. The correct tagging of the specific polysemy of brief sapiential units is an important step in the preparation of the text within the corpus which will be submitted to compute similarities and posterity of the units.

---
Source: https://tomesphere.com/paper/1905.11836