Can We Trust AI to Govern AI? Benchmarking LLM Performance on Privacy and AI Governance Exams
Zane Witherspoon, Thet Mon Aye, YingYing Hao

TL;DR
This paper benchmarks leading large language models on privacy and AI governance exams, revealing that some models can surpass human certification standards, thus informing their potential use in high-stakes data governance roles.
Contribution
It introduces a new benchmark for evaluating LLMs on privacy and AI governance exams, highlighting their strengths and gaps in regulatory and technical knowledge.
Findings
Some models exceed human certification passing scores
Models show domain-specific strengths in privacy law and governance
Results inform AI readiness for high-stakes data governance
Abstract
The rapid emergence of large language models (LLMs) has raised urgent questions across the modern workforce about this new technology's strengths, weaknesses, and capabilities. For privacy professionals, the question is whether these AI systems can provide reliable support on regulatory compliance, privacy program management, and AI governance. In this study, we evaluate ten leading open and closed LLMs, including models from OpenAI, Anthropic, Google DeepMind, Meta, and DeepSeek, by benchmarking their performance on industry-standard certification exams: CIPP/US, CIPM, CIPT, and AIGP from the International Association of Privacy Professionals (IAPP). Each model was tested using official sample exams in a closed-book setting and compared to IAPP's passing thresholds. Our findings show that several frontier models such as Gemini 2.5 Pro and OpenAI's GPT-5 consistently achieve scores…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education · Privacy, Security, and Data Protection
