# Quantifying Bias in Hierarchical Category Systems

**Authors:** Katie Warburton, Charles Kemp, Yang Xu, Lea Frermann

PMC · DOI: 10.1162/opmi_a_00121 · Open Mind : Discoveries in Cognitive Science · 2024-03-01

## TL;DR

This paper introduces methods to measure bias in hierarchical category systems, using library classifications to show biases favoring western concepts and male authors.

## Contribution

The paper presents general methods for quantifying bias in hierarchical category systems, demonstrated through library classification analysis.

## Key findings

- Library categories related to religion show more western bias than those for literature or history.
- Books by men are more broadly distributed in library systems than those by women.
- Dewey Decimal Classification shows more bias than the Library of Congress Classification.

## Abstract

Categorization is ubiquitous in human cognition and society, and shapes how we perceive and understand the world. Because categories reflect the needs and perspectives of their creators, no category system is entirely objective, and inbuilt biases can have harmful social consequences. Here we propose methods for measuring biases in hierarchical systems of categories, a common form of category organization with multiple levels of abstraction. We illustrate these methods by quantifying the extent to which library classification systems are biased in favour of western concepts and male authors. We analyze a large library data set including more than 3 million books organized into thousands of categories, and find that categories related to religion show greater western bias than do categories related to literature or history, and that books written by men are distributed more broadly across library classification systems than are books written by women. We also find that the Dewey Decimal Classification shows a greater level of bias than does the Library of Congress Classification. Although we focus on library classification as a case study, our methods are general, and can be used to measure biases in both natural and institutional category systems across a range of domains.1

## Full-text entities

- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10898782/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10898782/full.md

## References

72 references — full list in the complete paper: https://tomesphere.com/paper/PMC10898782/full.md

---
Source: https://tomesphere.com/paper/PMC10898782