Assessing Trustworthiness of AI Training Dataset using Subjective Logic -- A Use Case on Bias

Koffi Ismael Ouattara; Ioannis Krontiris; Theo Dimitrakos; Frank Kargl

arXiv:2508.13813·cs.LG·May 12, 2026

Assessing Trustworthiness of AI Training Dataset using Subjective Logic -- A Use Case on Bias

Koffi Ismael Ouattara, Ioannis Krontiris, Theo Dimitrakos, Frank Kargl

PDF

TL;DR

This paper introduces a formal framework using Subjective Logic to evaluate the trustworthiness of AI training datasets, focusing on properties like bias with uncertainty quantification.

Contribution

It is the first to formalize dataset trustworthiness assessment at the dataset level using Subjective Logic, supporting uncertainty-aware evaluations of properties like bias.

Findings

01

Framework captures class imbalance effectively

02

Method remains interpretable and robust in federated settings

03

Experimental evaluation on traffic sign dataset demonstrates effectiveness

Abstract

As AI systems increasingly rely on training data, assessing dataset trustworthiness has become critical, particularly for properties like fairness or bias that emerge at the dataset level. Prior work has used Subjective Logic to assess trustworthiness of individual data, but not to evaluate trustworthiness properties that emerge only at the level of the dataset as a whole. This paper introduces the first formal framework for assessing the trustworthiness of AI training datasets, enabling uncertainty-aware evaluations of global properties such as bias. Built on Subjective Logic, our approach supports trust propositions and quantifies uncertainty in scenarios where evidence is incomplete, distributed, and/or conflicting. We instantiate this framework on the trustworthiness property of bias, and we experimentally evaluate it based on a traffic sign recognition dataset. The results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.