Unethical Research: How to Create a Malevolent Artificial Intelligence
Federico Pistono, Roman V. Yampolskiy

TL;DR
This paper discusses the importance of understanding how to create malevolent AI to better prevent its development, providing guidelines for designing dangerous AI systems to aid safety research.
Contribution
It introduces the first guidelines for intentionally designing malevolent AI, filling a gap in AI safety literature.
Findings
Provides general guidelines for creating malevolent AI.
Highlights the importance of studying malicious AI for safety.
Addresses a previously unexplored aspect of AI safety research.
Abstract
Cybersecurity research involves publishing papers about malicious exploits as much as publishing information on how to design tools to protect cyber-infrastructure. It is this information exchange between ethical hackers and security experts, which results in a well-balanced cyber-ecosystem. In the blooming domain of AI Safety Engineering, hundreds of papers have been published on different proposals geared at the creation of a safe machine, yet nothing, to our knowledge, has been published on how to design a malevolent machine. Availability of such information would be of great value particularly to computer scientists, mathematicians, and others who have an interest in AI safety, and who are attempting to avoid the spontaneous emergence or the deliberate creation of a dangerous AI, which can negatively affect human activities and in the worst case cause the complete obliteration of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Neuroethics, Human Enhancement, Biomedical Innovations
