EMBRACE: Shaping Inclusive Opinion Representation by Aligning Implicit Conversations with Social Norms

Abeer Aldayel; Areej Alokaili

arXiv:2507.20264·cs.CL·November 11, 2025

EMBRACE: Shaping Inclusive Opinion Representation by Aligning Implicit Conversations with Social Norms

Abeer Aldayel, Areej Alokaili

PDF

Open Access

TL;DR

This paper introduces EMBRACE, a framework for evaluating and improving inclusive opinion representation in NLP models by focusing on implicit conversations and social norms, using stance modeling and alignment assessment.

Contribution

It proposes a novel alignment evaluation framework that emphasizes implicit opinions and normative social views, advancing beyond surface-level demographic inclusion methods.

Findings

01

The framework effectively models implicit opinions through stance analysis.

02

Evaluation with online learning and instruction-tuned models reveals alignment issues.

03

The approach offers a structured way to assess and enhance social norm alignment in NLP models.

Abstract

Shaping inclusive representations that embrace diversity and ensure fair participation and reflections of values is at the core of many conversation-based models. However, many existing methods rely on surface inclusion using mention of user demographics or behavioral attributes of social groups. Such methods overlook the nuanced, implicit expression of opinion embedded in conversations. Furthermore, the over-reliance on overt cues can exacerbate misalignment and reinforce harmful or stereotypical representations in model outputs. Thus, we took a step back and recognized that equitable inclusion needs to account for the implicit expression of opinion and use the stance of responses to validate the normative alignment. This study aims to evaluate how opinions are represented in NLP or computational models by introducing an alignment evaluation framework that foregrounds implicit, often…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Topic Modeling · Ethics and Social Impacts of AI