Density-Guided Response Optimization: Community-Grounded Alignment via Implicit Acceptance Signals
Patrick Gerard, Svitlana Volkova

TL;DR
This paper introduces Density-Guided Response Optimization (DGRO), a novel method that aligns language models to community norms by leveraging implicit acceptance signals derived from community content, without requiring explicit preference labels.
Contribution
The paper proposes DGRO, a new approach that uses geometric structure in representation space to align models with community norms based on implicit acceptance signals, especially useful in annotation-scarce settings.
Findings
DGRO recovers pairwise community judgments from geometric density.
Models aligned with DGRO produce responses preferred by humans and experts.
DGRO outperforms supervised and prompt-based baselines in diverse community settings.
Abstract
Language models deployed in online communities must adapt to norms that vary across social, cultural, and domain-specific contexts. Prior alignment approaches rely on explicit preference supervision or predefined principles, which are effective for well-resourced settings but exclude most online communities -- particularly those without institutional backing, annotation infrastructure, or organized around sensitive topics -- where preference elicitation is costly, ethically fraught, or culturally misaligned. We observe that communities already express preferences implicitly through what content they accept, engage with, and allow to persist. We show that this acceptance behavior induces measurable geometric structure in representation space: accepted responses occupy coherent, high-density regions that reflect community-specific norms, while rejected content falls in sparser or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Mobile Crowdsensing and Crowdsourcing · Sentiment Analysis and Opinion Mining
