Contrastive Analysis of Linguistic Representations in Large Language Model Outputs through Structured Synthetic Data Generation and Abstracted N-gram Associations
S.A. Desimone, L. Alonso Alemany

TL;DR
This paper introduces a framework for detecting subtle social biases in text by generating controlled synthetic data and analyzing linguistic patterns with statistical methods.
Contribution
It proposes a novel methodology that characterizes nuanced bias expressions in contextualized text across genres, beyond simple word lists.
Findings
Identifies biased linguistic signals using a variant of pointwise mutual information.
Prioritizes text segments with high biased signal concentration for expert assessment.
Works with diverse textual genres and contextualized data.
Abstract
We present a methodological framework to discover linguistic and discursive patterns associated to different social groups through contrastive synthetic text generation and statistical analysis. In contrast with previous approaches, we aim to characterize subtle expressions of bias, instead of diagnosing bias through a pre-determined list of words or expressions. We are also working with contextualized data instead of isolated words or sentences. Our methodology applies to textual productions in any genre, encompassing narrative, task-oriented or dialogic. Contextualized data are generated using controlled combinations of situational scenarios and group markers, creating minimal pairs of texts that differ only in the referenced group while maintaining comparable narrative conditions. To facilitate robust analysis, linguistic forms are generalized and associations between linguistic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
