Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships

Kerem Oktar; Katherine M. Collins; Jose Hernandez-Orallo; Diane Coyle; Stephen Cave; Adrian Weller; Ilia Sucholutsky

arXiv:2505.16899·cs.AI·May 23, 2025

Identifying, Evaluating, and Mitigating Risks of AI Thought Partnerships

Kerem Oktar, Katherine M. Collins, Jose Hernandez-Orallo, Diane Coyle, Stephen Cave, Adrian Weller, Ilia Sucholutsky

PDF

Open Access

TL;DR

This paper introduces a comprehensive framework for identifying, evaluating, and mitigating the unique risks posed by AI systems that collaborate with humans in complex reasoning tasks, emphasizing safety and societal impact.

Contribution

It presents a novel multi-level RISc framework for risk analysis of AI thought partners, along with metrics and mitigation strategies for responsible development.

Findings

01

RISc framework effectively categorizes AI collaboration risks

02

Concrete metrics enable systematic risk evaluation

03

Proposed mitigation strategies aim to prevent major harms

Abstract

Artificial Intelligence (AI) systems have historically been used as tools that execute narrowly defined tasks. Yet recent advances in AI have unlocked possibilities for a new class of models that genuinely collaborate with humans in complex reasoning, from conceptualizing problems to brainstorming solutions. Such AI thought partners enable novel forms of collaboration and extended cognition, yet they also pose major risks-including and beyond risks of typical AI tools and agents. In this commentary, we systematically identify risks of AI thought partners through a novel framework that identifies risks at multiple levels of analysis, including Real-time, Individual, and Societal risks arising from collaborative cognition (RISc). We leverage this framework to propose concrete metrics for risk evaluation, and finally suggest specific mitigation strategies for developers and policymakers.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Embodied and Extended Cognition