AI Safety Frameworks Should Include Procedures for Model Access   Decisions

Edward Kembery; Tom Reed

arXiv:2411.10547·cs.CY·December 3, 2024

AI Safety Frameworks Should Include Procedures for Model Access Decisions

Edward Kembery, Tom Reed

PDF

Open Access

TL;DR

This paper advocates for AI safety frameworks to incorporate transparent, empirical procedures for making model access decisions, aiming to better manage risks associated with different user groups and access levels.

Contribution

It introduces Responsible Access Policies (RAPs), a structured approach for AI companies to evaluate and regulate model access based on empirical capability assessments and risk profiles.

Findings

01

Proposes RAPs as a new framework for model access decisions

02

Highlights the need for empirical evaluation of model capabilities

03

Emphasizes risk assessment for different user categories

Abstract

The downstream use cases, benefits, and risks of AI models depend significantly on what sort of access is provided to the model, and who it is provided to. Though existing safety frameworks and AI developer usage policies recognise that the risk posed by a given model depends on the level of access provided to a given audience, the procedures they use to make decisions about model access are ad hoc, opaque, and lacking in empirical substantiation. This paper consequently proposes that frontier AI companies build on existing safety frameworks by outlining transparent procedures for making decisions about model access, which we term Responsible Access Policies (RAPs). We recommend that, at a minimum, RAPs should include the following: i) processes for empirically evaluating model capabilities given different styles of access, ii) processes for assessing the risk profiles of different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSafety Systems Engineering in Autonomy · Artificial Intelligence in Healthcare and Education