Supervision policies can shape long-term risk management in general-purpose AI models

Manuel Cebrian; Emilia Gomez; David Fernandez Llorca

arXiv:2501.06137·cs.AI·June 12, 2025

Supervision policies can shape long-term risk management in general-purpose AI models

Manuel Cebrian, Emilia Gomez, David Fernandez Llorca

PDF

Open Access 1 Repo

TL;DR

This paper explores how different supervision policies influence long-term risk management in general-purpose AI models, revealing trade-offs between risk mitigation effectiveness and comprehensive risk coverage.

Contribution

It introduces a simulation framework to evaluate supervision policies and demonstrates their impact on risk reporting and management in diverse AI ecosystems.

Findings

01

Priority-based policies effectively mitigate high-impact risks.

02

Diversity-prioritized policies promote comprehensive risk coverage.

03

Certain policies may create feedback loops that skew risk perception.

Abstract

The rapid proliferation and deployment of General-Purpose AI (GPAI) models, including large language models (LLMs), present unprecedented challenges for AI supervisory entities. We hypothesize that these entities will need to navigate an emergent ecosystem of risk and incident reporting, likely to exceed their supervision capacity. To investigate this, we develop a simulation framework parameterized by features extracted from the diverse landscape of risk, incident, or hazard reporting ecosystems, including community-driven platforms, crowdsourcing initiatives, and expert assessments. We evaluate four supervision policies: non-prioritized (first-come, first-served), random selection, priority-based (addressing the highest-priority risks first), and diversity-prioritized (balancing high-priority risks with comprehensive coverage across risk types). Our results indicate that while…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

manuelcebrianramos/llm_supervision_policies
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare