Can We Trust AI Agents? A Case Study of an LLM-Based Multi-Agent System for Ethical AI

Jos\'e Antonio Siqueira de Cerqueira; Mamia Agbese; Rebekah Rousi; Nannan Xi; Juho Hamari; and Pekka Abrahamsson

arXiv:2411.08881·cs.CY·May 19, 2025·3 cites

Can We Trust AI Agents? A Case Study of an LLM-Based Multi-Agent System for Ethical AI

Jos\'e Antonio Siqueira de Cerqueira, Mamia Agbese, Rebekah Rousi, Nannan Xi, Juho Hamari, and Pekka Abrahamsson

PDF

Open Access

TL;DR

This paper explores the use of multi-agent LLM systems to improve AI ethics trustworthiness, demonstrating their ability to generate extensive ethical code and documentation through structured multi-agent debates.

Contribution

It introduces a novel multi-agent LLM-based prototype for addressing AI ethics issues, employing structured communication and debate techniques to enhance trustworthiness in AI systems.

Findings

01

Prototype generates significantly more source code than baseline.

02

Structured multi-agent debates improve ethical code quality.

03

Practical challenges remain in code integration and dependency management.

Abstract

AI-based systems, including Large Language Models (LLM), impact millions by supporting diverse tasks but face issues like misinformation, bias, and misuse. AI ethics is crucial as new technologies and concerns emerge, but objective, practical guidance remains debated. This study examines the use of LLMs for AI ethics in practice, assessing how LLM trustworthiness-enhancing techniques affect software development in this context. Using the Design Science Research (DSR) method, we identify techniques for LLM trustworthiness: multi-agents, distinct roles, structured communication, and multiple rounds of debate. We design a multi-agent prototype LLM-MAS, where agents engage in structured discussions on real-world AI ethics issues from the AI Incident Database. We evaluate the prototype across three case scenarios using thematic analysis, hierarchical clustering, comparative (baseline)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Law, Economics, and Judicial Systems