Discovering Differences in the Representation of People using Contextualized Semantic Axes
Li Lucy, Divya Tadimeti, David Bamman

TL;DR
This paper extends the use of semantic axes to contextualized BERT embeddings to better identify social and temporal differences in representations of people, revealing evolving attitudes over time.
Contribution
It introduces a method for constructing contextualized semantic axes with BERT, addressing limitations of static embeddings in capturing nuanced social and temporal differences.
Findings
Contextualized axes distinguish differences in occupation representations.
References to women have become more negative over fourteen years.
The method effectively captures social and temporal semantic shifts.
Abstract
A common paradigm for identifying semantic differences across social and temporal contexts is the use of static word embeddings and their distances. In particular, past work has compared embeddings against "semantic axes" that represent two opposing concepts. We extend this paradigm to BERT embeddings, and construct contextualized axes that mitigate the pitfall where antonyms have neighboring representations. We validate and demonstrate these axes on two people-centric datasets: occupations from Wikipedia, and multi-platform discussions in extremist, men's communities over fourteen years. In both studies, contextualized semantic axes can characterize differences among instances of the same word type. In the latter study, we show that references to women and the contexts around them have become more detestable over time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Wikis in Education and Collaboration
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Multi-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dropout · WordPiece · Dense Connections · Softmax
