OSCaR: Orthogonal Subspace Correction and Rectification of Biases in Word Embeddings
Sunipa Dev, Tao Li, Jeff M Phillips, Vivek Srikumar

TL;DR
OSCaR is a novel bias mitigation method for word embeddings that effectively reduces stereotypes while preserving semantic information by disentangling biased associations rather than removing entire concepts.
Contribution
The paper introduces OSCaR, a new approach that balances bias reduction and information retention by focusing on disentangling biased associations in embeddings.
Findings
OSCaR effectively reduces gender bias in word embeddings.
The method preserves semantic information better than linear projection techniques.
Experimental results demonstrate a favorable bias-information tradeoff.
Abstract
Language representations are known to carry stereotypical biases and, as a result, lead to biased predictions in downstream tasks. While existing methods are effective at mitigating biases by linear projection, such methods are too aggressive: they not only remove bias, but also erase valuable information from word embeddings. We develop new measures for evaluating specific information retention that demonstrate the tradeoff between bias removal and information retention. To address this challenge, we propose OSCaR (Orthogonal Subspace Correction and Rectification), a bias-mitigating method that focuses on disentangling biased associations between concepts instead of removing concepts wholesale. Our experiments on gender biases show that OSCaR is a well-balanced approach that ensures that semantic information is retained in the embeddings and bias is also effectively mitigated.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification
MethodsOSCAR
