Leveraging Ontologies to Document Bias in Data

Mayra Russo; Maria-Esther Vidal

arXiv:2407.00509·cs.AI·December 1, 2025

Leveraging Ontologies to Document Bias in Data

Mayra Russo, Maria-Esther Vidal

PDF

Open Access

TL;DR

This paper introduces the Doc-BiasO ontology, a formal resource to describe and measure biases in machine learning pipelines, aiming to improve understanding and documentation of bias in AI systems.

Contribution

The paper presents the development of the Doc-BiasO ontology, integrating bias definitions, measures, and related terminology to enhance bias documentation and interoperability in AI research.

Findings

01

Created an ontology for bias in ML pipelines

02

Reused existing vocabularies for interoperability

03

Aims to clarify bias terminology in AI

Abstract

Machine Learning (ML) systems are capable of reproducing and often amplifying undesired biases. This puts emphasis on the importance of operating under practices that enable the study and understanding of the intrinsic characteristics of ML pipelines, prompting the emergence of documentation frameworks with the idea that ``any remedy for bias starts with awareness of its existence''. However, a resource that can formally describe these pipelines in terms of biases detected is still amiss. To fill this gap, we present the Doc-BiasO ontology, a resource that aims to create an integrated vocabulary of biases defined in the \textit{fair-ML} literature and their measures, as well as to incorporate relevant terminology and the relationships between them. Overseeing ontology engineering best practices, we re-use existing vocabulary on machine learning and AI, to foster knowledge sharing and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSemantic Web and Ontologies

MethodsOntology