From Millions of Tweets to Actionable Insights: Leveraging LLMs for User Profiling
Vahid Rahimzadeh, Ali Hamzehpour, Azadeh Shakery, Masoud Asadpour

TL;DR
This paper presents a novel LLM-based user profiling method for social media that produces interpretable profiles with minimal labeled data, outperforming existing techniques in flexibility and accuracy.
Contribution
Introduces a two-stage LLM-based approach using domain-defining statements for adaptable, interpretable user profiling with minimal supervision.
Findings
Outperforms state-of-the-art methods by 9.8%
Generates natural language profiles that are interpretable
Reduces reliance on large labeled datasets
Abstract
Social media user profiling through content analysis is crucial for tasks like misinformation detection, engagement prediction, hate speech monitoring, and user behavior modeling. However, existing profiling techniques, including tweet summarization, attribute-based profiling, and latent representation learning, face significant limitations: they often lack transferability, produce non-interpretable features, require large labeled datasets, or rely on rigid predefined categories that limit adaptability. We introduce a novel large language model (LLM)-based approach that leverages domain-defining statements, which serve as key characteristics outlining the important pillars of a domain as foundations for profiling. Our two-stage method first employs semi-supervised filtering with a domain-specific knowledge base, then generates both abstractive (synthesized descriptions) and extractive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Sentiment Analysis and Opinion Mining · Authorship Attribution and Profiling
