Robust ML Auditing using Prior Knowledge

Jade Garcia Bourr\'ee; Augustin Godinot; Martijn De Vos; Milos Vujasinovic; Sayan Biswas; Gilles Tredan; Erwan Le Merrer; Anne-Marie Kermarrec

arXiv:2505.04796·cs.LG·May 26, 2025

Robust ML Auditing using Prior Knowledge

Jade Garcia Bourr\'ee, Augustin Godinot, Martijn De Vos, Milos Vujasinovic, Sayan Biswas, Gilles Tredan, Erwan Le Merrer, Anne-Marie Kermarrec

PDF

Open Access 1 Video

TL;DR

This paper proposes a new method for AI regulation audits that leverages prior knowledge to prevent manipulation by platforms, ensuring more reliable and tamper-proof evaluations.

Contribution

It introduces a formal framework for manipulation-proof auditing using prior knowledge, highlighting limitations of public priors and demonstrating conditions for effective detection.

Findings

01

Platforms can hide unfairness up to a certain level before detection

02

Public priors are insufficient for preventing manipulation

03

Formal conditions enable more robust audits

Abstract

Among the many technical challenges to enforcing AI regulations, one crucial yet underexplored problem is the risk of audit manipulation. This manipulation occurs when a platform deliberately alters its answers to a regulator to pass an audit without modifying its answers to other users. In this paper, we introduce a novel approach to manipulation-proof auditing by taking into account the auditor's prior knowledge of the task solved by the platform. We first demonstrate that regulators must not rely on public priors (e.g. a public dataset), as platforms could easily fool the auditor in such cases. We then formally establish the conditions under which an auditor can prevent audit manipulations using prior knowledge about the ground truth. Finally, our experiments with two standard datasets illustrate the maximum level of unfairness a platform can hide before being detected as malicious.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Robust ML Auditing using Prior Knowledge· slideslive

Taxonomy

TopicsNeural Networks and Applications

MethodsSparse Evolutionary Training