Le sens de la famille : analyse du vocabulaire de la parent{\'e} par les plongements de mots
Ludovic Tanguy (CLLE), C\'ecile Fabre (CLLE), Nabil Hathout (UT, CNRS,, CLLE), Lydia-Mai Ho-Dac (CLLE)

TL;DR
This paper analyzes the French family relationship vocabulary using word embeddings to understand how these terms relate and organize based on corpus data, revealing underlying semantic features.
Contribution
It introduces a corpus-based method using word embeddings to analyze the structured vocabulary of family relationships in French.
Findings
Distributional analysis captures features like descent, alliance, siblings, genre.
Vocabulary organization varies across different corpora.
Word embeddings reveal semantic relationships among family terms.
Abstract
In this study, we propose a corpus analysis of an area of the French lexicon that is both dense and highly structured: the vocabulary of family relationships. Starting with a lexicon of 25 nouns designating the main relationships (son, cousin, mother, grandfather, sister-in-law etc.), we examine how these terms are positioned in relation to each other through distributional analyses based on the use of these terms in corpora. We show that distributional information can capture certain features that organize this vocabulary (descent, alliance, siblings, genre), in ways that vary according to the different corpora compared.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistics and Discourse Analysis · French Language Learning Methods
