From Algorithmic Black Boxes to Adaptive White Boxes: Declarative   Decision-Theoretic Ethical Programs as Codes of Ethics

Martijn van Otterlo

arXiv:1711.06035·cs.AI·November 17, 2017·6 cites

From Algorithmic Black Boxes to Adaptive White Boxes: Declarative Decision-Theoretic Ethical Programs as Codes of Ethics

Martijn van Otterlo

PDF

Open Access

TL;DR

This paper proposes using declarative decision-theoretic ethical programs to formalize and implement codes of ethics in AI, enhancing transparency and accountability in machine ethical reasoning.

Contribution

It introduces a novel approach to formalize professional codes of ethics as declarative decision-theoretic programs for AI systems.

Findings

01

Proof-of-concept examples demonstrate the approach on toy dilemmas.

02

Application to gatekeeping domains shows potential for real-world use.

03

Enhances transparency and accountability in machine ethics.

Abstract

Ethics of algorithms is an emerging topic in various disciplines such as social science, law, and philosophy, but also artificial intelligence (AI). The value alignment problem expresses the challenge of (machine) learning values that are, in some way, aligned with human requirements or values. In this paper I argue for looking at how humans have formalized and communicated values, in professional codes of ethics, and for exploring declarative decision-theoretic ethical programs (DDTEP) to formalize codes of ethics. This renders machine ethical reasoning and decision-making, as well as learning, more transparent and hopefully more accountable. The paper includes proof-of-concept examples of known toy dilemmas and gatekeeping domains such as archives and libraries.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning