CIRCLE: A Framework for Evaluating AI from a Real-World Lens

Reva Schwartz; Carina Westling; Morgan Briggs; Marzieh Fadaee; Isar Nejadgholi; Matthew Holmes; Fariza Rashid; Maya Carlyle; Afaf Ta\"ik; Kyra Wilson; Peter Douglas; Theodora Skeadas; Gabriella Waters; Rumman Chowdhury; Thiago Lacerda

arXiv:2602.24055·cs.AI·March 26, 2026·2 cites

CIRCLE: A Framework for Evaluating AI from a Real-World Lens

Reva Schwartz, Carina Westling, Morgan Briggs, Marzieh Fadaee, Isar Nejadgholi, Matthew Holmes, Fariza Rashid, Maya Carlyle, Afaf Ta\"ik, Kyra Wilson, Peter Douglas, Theodora Skeadas, Gabriella Waters, Rumman Chowdhury, Thiago Lacerda

PDF

Open Access

TL;DR

CIRCLE is a comprehensive framework designed to evaluate AI systems in real-world settings by linking stakeholder concerns to measurable outcomes through a structured, lifecycle-based approach.

Contribution

It introduces a six-stage lifecycle framework that operationalizes validation, integrating qualitative insights with quantitative metrics for real-world AI evaluation.

Findings

01

Provides a systematic protocol for real-world AI assessment

02

Integrates field testing, red teaming, and longitudinal studies

03

Enables governance based on downstream effects

Abstract

This paper proposes CIRCLE, a six-stage, lifecycle-based framework to bridge the reality gap between model-centric performance metrics and AI's materialized outcomes in deployment. Current approaches such as MLOps frameworks and AI model benchmarks offer detailed insights into system stability and model capabilities, but they do not provide decision-makers outside the AI stack with systematic evidence of how these systems actually behave in real-world contexts or affect their organizations over time. CIRCLE operationalizes the Validation phase of TEVV (Test, Evaluation, Verification, and Validation) by formalizing the translation of stakeholder concerns outside the stack into measurable signals. Unlike participatory design, which often remains localized, or algorithmic audits, which are often retrospective, CIRCLE provides a structured, prospective protocol for linking context-sensitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education