FAIRTOPIA: Envisioning Multi-Agent Guardianship for Disrupting Unfair AI Pipelines
Athena Vakali, Ilias Dimitriadis

TL;DR
FAIRTOPIA proposes a multi-agent framework to embed adaptive fairness guardianship throughout AI pipelines, aiming to systematically address AI unfairness with human-centric, flexible, and self-refining mechanisms.
Contribution
It introduces a novel multi-layered, agent-based architecture for continuous fairness monitoring and intervention across all AI development stages.
Findings
Framework enables fairness watch in all pipeline stages
Multi-agent workflows foster systematic fairness improvements
Positioning AI fairness as a human-centric, interdisciplinary challenge
Abstract
AI models have become active decision makers, often acting without human supervision. The rapid advancement of AI technology has already caused harmful incidents that have hurt individuals and societies and AI unfairness in heavily criticized. It is urgent to disrupt AI pipelines which largely neglect human principles and focus on computational biases exploration at the data (pre), model(in), and deployment (post) processing stages. We claim that by exploiting the advances of agents technology, we will introduce cautious, prompt, and ongoing fairness watch schemes, under realistic, systematic, and human-centric fairness expectations. We envision agents as fairness guardians, since agents learn from their environment, adapt to new information, and solve complex problems by interacting with external tools and other systems. To set the proper fairness guardrails in the overall AI pipeline,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEthics and Social Impacts of AI · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
MethodsFocus · Sparse Evolutionary Training
