Mapping Language Literacy At Scale: A Case Study on Facebook
Yu-Ru Lin, Shaomei Wu, Winter Mason

TL;DR
This study develops a large-scale, data-driven method to estimate language literacy levels across 160+ countries using Facebook data, revealing gender and regional disparities linked to socioeconomic factors.
Contribution
It introduces a novel approach to measure population-level literacy at scale using social media data, especially in low-resource settings with limited official data.
Findings
Women show higher literacy than men in many countries.
Significant regional literacy gaps exist within countries.
Online literacy disparities are linked to offline socioeconomic inequalities.
Abstract
Literacy is one of the most fundamental skills for people to access and navigate today's digital environment. This work systematically studies the language literacy skills of online populations for more than 160 countries and regions across the world, including many low-resourced countries where official literacy data are particularly sparse. Leveraging public data on Facebook, we develop a population-level literacy estimate for the online population that is based on aggregated and de-identified public posts written by adult Facebook users globally, significantly improving both the coverage and resolution of existing literacy tracking data. We found that, on Facebook, women collectively show higher language literacy than men in many countries, but substantial gaps remain in Africa and Asia. Further, our analysis reveals a considerable regional gap within a country that is associated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Media and Politics · Media Influence and Politics · Multilingual Education and Policy
