The Elephant in the Room -- Why AI Safety Demands Diverse Teams

David Rostcheck; Lara Scheibling

arXiv:2407.10254·cs.CY·July 16, 2024

The Elephant in the Room -- Why AI Safety Demands Diverse Teams

David Rostcheck, Lara Scheibling

PDF

Open Access

TL;DR

This paper proposes treating AI safety and alignment as a social science problem, emphasizing the importance of diverse teams and social science tools to better understand and address AI alignment challenges.

Contribution

It introduces a novel approach to AI alignment that leverages social science methodologies and advocates for diverse teams to improve problem-solving effectiveness.

Findings

01

Social science tools can be repurposed for AI alignment.

02

Diverse teams enhance understanding of alignment challenges.

03

A three-step framework for social science-informed AI alignment.

Abstract

We consider that existing approaches to AI "safety" and "alignment" may not be using the most effective tools, teams, or approaches. We suggest that an alternative and better approach to the problem may be to treat alignment as a social science problem, since the social sciences enjoy a rich toolkit of models for understanding and aligning motivation and behavior, much of which could be repurposed to problems involving AI models, and enumerate reasons why this is so. We introduce an alternate alignment approach informed by social science tools and characterized by three steps: 1. defining a positive desired social outcome for human/AI collaboration as the goal or "North Star," 2. properly framing knowns and unknowns, and 3. forming diverse teams to investigate, observe, and navigate emerging challenges in alignment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOccupational Health and Safety Research · Ethics and Social Impacts of AI