VASTU: Value-Aligned Social Toolkit for Online Content Curation
Agam Goyal, Xianyang Zhan, Charlotte Lambert, Koustuv Saha, Eshwar Chandrasekharan

TL;DR
VASTU is a benchmark dataset and evaluation framework for detecting community-valued content in online social communities, demonstrating that community-specific models, especially fine-tuned transformers, outperform global models in content curation tasks.
Contribution
Introduces VASTU, a comprehensive benchmark with a dataset and evaluation framework for community-specific content valuation, enabling systematic comparison of detection approaches.
Findings
Community-specific models outperform global models.
Fine-tuned transformers achieve the highest AUROC (0.72).
Chain-of-thought prompting offers no benefit in this task.
Abstract
Detecting what content communities value is a foundational challenge for social computing systems -- from feed curation and content ranking to moderation tools and personalized recommendation systems. Yet existing approaches remain fragmented across methodological paradigms, and it remains unclear which methods best capture community-specific notions of value. We introduce VASTU (Value-Aligned Social Toolkit for Online Content Curation), a benchmark and evaluation framework for systematically comparing approaches to detecting community-valued content. VASTU includes a dataset of 75,000 comments from 15 diverse Reddit communities, annotated with community approval labels and rich linguistic features. Using VASTU, we evaluate feature-based models, transformers, prompted and fine-tuned language models under global versus community-specific training regimes. We find that community-specific…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Advanced Graph Neural Networks · Topic Modeling
