Social Analysis of Young Basque Speaking Communities in Twitter
J. Fernandez de Landa, R. Agerri

TL;DR
This study combines NLP and social science methods to analyze demographic traits and social communities of young Basque Twitter users through large-scale tweet processing.
Contribution
It introduces a novel approach integrating demographic inference with social network analysis using deep learning NLP techniques on Basque tweets.
Findings
Successful identification of young Basque Twitter users
Detection of distinct social communities based on shared content
Demographic and social insights derived from automated tweet analysis
Abstract
In this paper we take into account both social and linguistic aspects to perform demographic analysis by processing a large amount of tweets in Basque language. The study of demographic characteristics and social relationships are approached by applying machine learning and modern deep-learning Natural Language Processing (NLP) techniques, combining social sciences with automatic text processing. More specifically, our main objective is to combine demographic inference and social analysis in order to detect young Basque Twitter users and to identify the communities that arise from their relationships or shared content. This social and demographic analysis will be entirely based on the~automatically collected tweets using NLP to convert unstructured textual information into interpretable knowledge.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
