Supervision policies can shape long-term risk management in general-purpose AI models
Manuel Cebrian, Emilia Gomez, David Fernandez Llorca

TL;DR
This paper explores how different supervision policies influence long-term risk management in general-purpose AI models, revealing trade-offs between risk mitigation effectiveness and comprehensive risk coverage.
Contribution
It introduces a simulation framework to evaluate supervision policies and demonstrates their impact on risk reporting and management in diverse AI ecosystems.
Findings
Priority-based policies effectively mitigate high-impact risks.
Diversity-prioritized policies promote comprehensive risk coverage.
Certain policies may create feedback loops that skew risk perception.
Abstract
The rapid proliferation and deployment of General-Purpose AI (GPAI) models, including large language models (LLMs), present unprecedented challenges for AI supervisory entities. We hypothesize that these entities will need to navigate an emergent ecosystem of risk and incident reporting, likely to exceed their supervision capacity. To investigate this, we develop a simulation framework parameterized by features extracted from the diverse landscape of risk, incident, or hazard reporting ecosystems, including community-driven platforms, crowdsourcing initiatives, and expert assessments. We evaluate four supervision policies: non-prioritized (first-come, first-served), random selection, priority-based (addressing the highest-priority risks first), and diversity-prioritized (balancing high-priority risks with comprehensive coverage across risk types). Our results indicate that while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Machine Learning in Healthcare
