Specific polysemy of the brief sapiential units
Marie-Christine Bornes-Varol (CERMOM EA 4091), Marie-Sol Ortola (MSH, Lorraine), Gronoff Jean-Daniel

TL;DR
This paper discusses the development of the Aliento database, focusing on tagging polysemous brief sapiential units across languages to facilitate text analysis and comparison.
Contribution
It introduces a method for accurately tagging polysemy in brief sapiential units within a multilingual corpus, addressing linguistic and informational complexities.
Findings
Effective tagging improves the analysis of sapiential units.
Multilingual differences pose challenges that are addressed by the proposed method.
Enhanced corpus preparation supports better similarity computations.
Abstract
In this paper we explain how we deal with the problems related to the constitution of the Aliento database, the complexity of which has to do with the type of phrases we work with, the differences between languages, the type of information we want to see emerge. The correct tagging of the specific polysemy of brief sapiential units is an important step in the preparation of the text within the corpus which will be submitted to compute similarities and posterity of the units.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRenaissance Literature and Culture
