Demographic Inference from Social Media Data with Multimodal Foundation Models: Strategies, Evaluation, and Benchmarking
Hao Yang, Angela Yao, Eric Chang, Hexiang Wang

TL;DR
This paper demonstrates how GPT-5, a multimodal foundation model, can accurately infer demographic attributes such as age, gender, and race from social media data by integrating text and images, outperforming existing methods.
Contribution
It introduces a novel multimodal framework leveraging GPT-5 for demographic inference, enhancing accuracy and robustness across multiple attributes with minimal task-specific training.
Findings
Achieves 0.90 accuracy for age, 0.98 for gender, 0.85 for race
Incorporating textual and visual cues improves inference performance
Outperforms existing models with equivalent inputs
Abstract
Demographic inference plays a crucial role in understanding the representativeness and equity of social media-based research. However, existing methods typically rely on a single modality, such as text, image, or network, and are limited to predicting one or two demographic attributes, constraining their generalizability and robustness across populations. This study leverages GPT-5, a state-of-the-art multimodal foundation model, to infer age, gender, and race from social media profiles. Using a dataset of 263 publicly available X (formerly Twitter) users, we design a progressive multimodal framework that incrementally incorporates usernames, profile descriptions, tweets, and profile images to examine how each information source contributes to inference accuracy. Results show a consistent improvement across all conditions, with the inclusion of textual and visual cues substantially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAuthorship Attribution and Profiling · Mental Health via Writing · Face recognition and analysis
