Gender Stereotype Reinforcement: Measuring the Gender Bias Conveyed by   Ranking Algorithms

Alessandro Fabris; Alberto Purpura; Gianmaria Silvello; Gian Antonio; Susto

arXiv:2009.01334·cs.CY·September 4, 2020

Gender Stereotype Reinforcement: Measuring the Gender Bias Conveyed by Ranking Algorithms

Alessandro Fabris, Alberto Purpura, Gianmaria Silvello, Gian Antonio, Susto

PDF

Open Access

TL;DR

This paper introduces GSR, a novel measure to quantify gender stereotype reinforcement in search engine rankings, validating it on synthetic and real data, and analyzing how different algorithms and debiasing methods influence gender bias.

Contribution

The paper presents GSR, the first IR-specific measure for quantifying gender stereotype reinforcement in ranking algorithms, and evaluates bias across various models and debiasing techniques.

Findings

01

GSR effectively measures gender bias in search rankings.

02

Ranking algorithms inherit biases from word embeddings.

03

Debiasing methods reduce GSR but may impact performance.

Abstract

Search Engines (SE) have been shown to perpetuate well-known gender stereotypes identified in psychology literature and to influence users accordingly. Similar biases were found encoded in Word Embeddings (WEs) learned from large online corpora. In this context, we propose the Gender Stereotype Reinforcement (GSR) measure, which quantifies the tendency of a SE to support gender stereotypes, leveraging gender-related information encoded in WEs. Through the critical lens of construct validity, we validate the proposed measure on synthetic and real collections. Subsequently, we use GSR to compare widely-used Information Retrieval ranking algorithms, including lexical, semantic, and neural models. We check if and how ranking algorithms based on WEs inherit the biases of the underlying embeddings. We also consider the most common debiasing approaches for WEs proposed in the literature and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Authorship Attribution and Profiling · Media Influence and Politics