Ethics Testing: Proactive Identification of Generative AI System Harms

Shin Hwei Tan; Haibo Wang; Heng Li

arXiv:2604.22089·cs.SE·April 27, 2026

Ethics Testing: Proactive Identification of Generative AI System Harms

Shin Hwei Tan, Haibo Wang, Heng Li

PDF

TL;DR

This paper introduces ethics testing as a systematic approach to identify potential harms in content generated by AI systems, addressing a gap in current testing methodologies.

Contribution

It proposes the novel concept of ethics testing to detect unethical behaviors and harms in generative AI outputs, supported by five case studies.

Findings

01

Ethics testing can systematically identify software harms in AI-generated content.

02

Five case studies demonstrate the practical application of ethics testing.

03

The approach addresses limitations of existing fairness testing methods.

Abstract

Generative Artificial Intelligence (GAI) systems that can automatically generate content in the form of source code or other contents (e.g., images) has seen increasing popularity due to the emergence of tools such as ChatGPT which rely on Large Language Models (LLMs). Misuse of the automatically generated content can incur serious consequences due to potential harms in the generated content. Despite the importance of ensuring the quality of automatically generated content, there is little to no approach that can systematically generate tests for identifying software harms in the content generated by these GAI systems. In this article, we introduce the novel concept of ethics testing which aims to systematically generate tests for identifying software harms. Different from existing testing methodologies (e.g., fairness testing that aims to identifying software discrimination), ethics…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.