Goal Conflict in Designing an Autonomous Artificial System
Mark Muraven

TL;DR
This paper explores how artificial autonomous systems can manage conflicting goals by drawing insights from human self-regulation, emphasizing the importance for safety, stability, and alignment with human values.
Contribution
It analyzes human self-regulation mechanisms and discusses their potential application to improve goal conflict management in autonomous systems.
Findings
Understanding human goal conflict can inform autonomous system design.
Managing goal conflict is crucial for system safety and alignment.
Insights can enhance long-term stability of autonomous systems.
Abstract
Research on human self-regulation has shown that people hold many goals simultaneously and have complex self-regulation mechanisms to deal with this goal conflict. Artificial autonomous systems may also need to find ways to cope with conflicting goals. Indeed, the intricate interplay among different goals may be critical to the design as well as long-term safety and stability of artificial autonomous systems. I discuss some of the critical features of the human self-regulation system and how it might be applied to an artificial system. Furthermore, the implications of goal conflict for the reliability and stability of artificial autonomous systems and ensuring their alignment with human goals and ethics is examined.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPsychology of Moral and Emotional Judgment · Ethics and Social Impacts of AI · Behavioral Health and Interventions
