Gender bias in (non)-contextual clinical word embeddings for   stereotypical medical categories

Gizem Sogancioglu; Fabian Mijsters; Amar van Uden; Jelle Peperzak

arXiv:2208.01341·cs.CL·August 9, 2022·5 cites

Gender bias in (non)-contextual clinical word embeddings for stereotypical medical categories

Gizem Sogancioglu, Fabian Mijsters, Amar van Uden, Jelle Peperzak

PDF

Open Access

TL;DR

This paper investigates gender bias in clinical word embeddings across three medical categories, revealing that both contextualized and non-contextualized embeddings exhibit bias, with potential implications for medical applications.

Contribution

It provides a comparative analysis of gender bias in clinical embeddings, highlighting biases in both contextualized and non-contextualized models and their inconsistency with medical literature.

Findings

01

BioWordVec shows higher gender bias than clinical-BERT.

02

Both embeddings are biased towards sensitive gender groups.

03

Biases in embeddings conflict with medical literature.

Abstract

Clinical word embeddings are extensively used in various Bio-NLP problems as a state-of-the-art feature vector representation. Although they are quite successful at the semantic representation of words, due to the dataset - which potentially carries statistical and societal bias - on which they are trained, they might exhibit gender stereotypes. This study analyses gender bias of clinical embeddings on three medical categories: mental disorders, sexually transmitted diseases, and personality traits. To this extent, we analyze two different pre-trained embeddings namely (contextualized) clinical-BERT and (non-contextualized) BioWordVec. We show that both embeddings are biased towards sensitive gender groups but BioWordVec exhibits a higher bias than clinical-BERT for all three categories. Moreover, our analyses show that clinical embeddings carry a high degree of bias for some medical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSex and Gender in Healthcare