Mind the Gap! Pathways Towards Unifying AI Safety and Ethics Research
Dani Roytburg, Beck Miller

TL;DR
This paper analyzes the structural divide between AI safety and ethics research, revealing limited cross-disciplinary collaboration and emphasizing the need for integrated approaches to develop AI systems that are both safe and just.
Contribution
It provides a large-scale bibliometric analysis highlighting the institutional and conceptual split between AI safety and ethics communities, and advocates for integrated research strategies.
Findings
Over 80% of collaborations occur within either safety or ethics communities.
A small number of bridging papers account for most cross-field links.
Removing key bridging papers increases segregation, showing reliance on few actors for cross-disciplinary exchange.
Abstract
While much research in artificial intelligence (AI) has focused on scaling capabilities, the accelerating pace of development makes countervailing work on producing harmless, "aligned" systems increasingly urgent. Yet research on alignment has diverged along two largely parallel tracks: safety--centered on scaled intelligence, deceptive or scheming behaviors, and existential risk--and ethics--focused on present harms, the reproduction of social bias, and flaws in production pipelines. Although both communities warn of insufficient investment in alignment, they disagree on what alignment means or ought to mean. As a result, their efforts have evolved in relative isolation, shaped by distinct methodologies, institutional homes, and disciplinary genealogies. We present a large-scale, quantitative study showing the structural split between AI safety and AI ethics. Using a bibliometric and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Artificial Intelligence in Healthcare and Education · Explainable Artificial Intelligence (XAI)
