ML-EAT: A Multilevel Embedding Association Test for Interpretable and Transparent Social Science
Robert Wolfe, Alexis Hiniker, Bill Howe

TL;DR
ML-EAT introduces a multilevel, interpretable approach to measuring bias in language models, providing detailed insights into bias patterns and improving transparency in social science applications.
Contribution
The paper presents ML-EAT, a novel multilevel embedding association test that enhances interpretability and transparency of bias measurement in language models.
Findings
EAT patterns reveal component biases in embeddings
Prompting effects are observable in zero-shot models
EAT-Maps visualize bias patterns effectively
Abstract
This research introduces the Multilevel Embedding Association Test (ML-EAT), a method designed for interpretable and transparent measurement of intrinsic bias in language technologies. The ML-EAT addresses issues of ambiguity and difficulty in interpreting the traditional EAT measurement by quantifying bias at three levels of increasing granularity: the differential association between two target concepts with two attribute concepts; the individual effect size of each target concept with two attribute concepts; and the association between each individual target concept and each individual attribute concept. Using the ML-EAT, this research defines a taxonomy of EAT patterns describing the nine possible outcomes of an embedding association test, each of which is associated with a unique EAT-Map, a novel four-quadrant visualization for interpreting the ML-EAT. Empirical analysis of static…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsQualitative Comparative Analysis Research
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Attention Dropout · Residual Connection · Multi-Head Attention · Cosine Annealing · Weight Decay · Adam · Byte Pair Encoding
