Leveraging Ontologies to Document Bias in Data
Mayra Russo, Maria-Esther Vidal

TL;DR
This paper introduces the Doc-BiasO ontology, a formal resource to describe and measure biases in machine learning pipelines, aiming to improve understanding and documentation of bias in AI systems.
Contribution
The paper presents the development of the Doc-BiasO ontology, integrating bias definitions, measures, and related terminology to enhance bias documentation and interoperability in AI research.
Findings
Created an ontology for bias in ML pipelines
Reused existing vocabularies for interoperability
Aims to clarify bias terminology in AI
Abstract
Machine Learning (ML) systems are capable of reproducing and often amplifying undesired biases. This puts emphasis on the importance of operating under practices that enable the study and understanding of the intrinsic characteristics of ML pipelines, prompting the emergence of documentation frameworks with the idea that ``any remedy for bias starts with awareness of its existence''. However, a resource that can formally describe these pipelines in terms of biases detected is still amiss. To fill this gap, we present the Doc-BiasO ontology, a resource that aims to create an integrated vocabulary of biases defined in the \textit{fair-ML} literature and their measures, as well as to incorporate relevant terminology and the relationships between them. Overseeing ontology engineering best practices, we re-use existing vocabulary on machine learning and AI, to foster knowledge sharing and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies
MethodsOntology
