Can We Trust AI to Govern AI? Benchmarking LLM Performance on Privacy and AI Governance Exams

Zane Witherspoon; Thet Mon Aye; YingYing Hao

arXiv:2508.09036·cs.CY·August 13, 2025

Can We Trust AI to Govern AI? Benchmarking LLM Performance on Privacy and AI Governance Exams

Zane Witherspoon, Thet Mon Aye, YingYing Hao

PDF

Open Access

TL;DR

This paper benchmarks leading large language models on privacy and AI governance exams, revealing that some models can surpass human certification standards, thus informing their potential use in high-stakes data governance roles.

Contribution

It introduces a new benchmark for evaluating LLMs on privacy and AI governance exams, highlighting their strengths and gaps in regulatory and technical knowledge.

Findings

01

Some models exceed human certification passing scores

02

Models show domain-specific strengths in privacy law and governance

03

Results inform AI readiness for high-stakes data governance

Abstract

The rapid emergence of large language models (LLMs) has raised urgent questions across the modern workforce about this new technology's strengths, weaknesses, and capabilities. For privacy professionals, the question is whether these AI systems can provide reliable support on regulatory compliance, privacy program management, and AI governance. In this study, we evaluate ten leading open and closed LLMs, including models from OpenAI, Anthropic, Google DeepMind, Meta, and DeepSeek, by benchmarking their performance on industry-standard certification exams: CIPP/US, CIPM, CIPT, and AIGP from the International Association of Privacy Professionals (IAPP). Each model was tested using official sample exams in a closed-book setting and compared to IAPP's passing thresholds. Our findings show that several frontier models such as Gemini 2.5 Pro and OpenAI's GPT-5 consistently achieve scores…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education · Privacy, Security, and Data Protection