Audience Engagement with Arabic Women's Social Empowerment and Wellbeing: A Decadal Corpus
Wajdi Zaghouani, Mabrouka Bessghaier, MD. Rafiul Biswas, Shimaa Amer Ibrahim

TL;DR
This paper introduces the Arabic Women and Society Corpus, a decade-long collection of Arabic Facebook posts on women's empowerment, enabling large-scale analysis of social discourse and engagement in the Arab world.
Contribution
It provides a comprehensive, cleaned, and annotated dataset of social media posts related to women's issues across 77 countries over ten years.
Findings
Enables analysis of gender discourse and social reform in Arabic social media.
Provides insights into audience sentiment and emotional engagement.
Supports research in NLP, social science, and digital communication.
Abstract
This paper presents the Arabic Women and Society Corpus, a ten year collection of 252,487 public Arabic Facebook posts related to women's empowerment and social wellbeing. The corpus was collected from 51,660 pages across 77 countries between 2013 and 2024, resulting in more than 267 million user interactions. Each post includes engagement metrics such as shares, comments, and emotional reactions, providing a unique view of audience sentiment and social attention. The data were processed using an automated pipeline with language identification, normalization, and metadata cleaning to ensure reliability and reproducibility. The corpus enables large scale analysis of gender discourse, social reform, and emotional engagement across Arabic dialects. It supports research in Arabic natural language processing, computational social science, and digital communication studies. The dataset and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
