CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening

Giuliano Lorenzoni; Ivens Portugal; Paulo Alencar; Donald Cowan (University of Waterloo)

arXiv:2605.11341·cs.AI·May 13, 2026

CPEMH: An Agentic Framework for Prompt-Driven Behavior Evaluation and Assurance in Foundation-Model Systems for Mental Health Screening

Giuliano Lorenzoni, Ivens Portugal, Paulo Alencar, Donald Cowan (University of Waterloo)

PDF

TL;DR

CPEMH is a modular agentic framework that systematically evaluates and assures prompt-driven behavior in foundation models for mental health screening, emphasizing stability, traceability, and robustness.

Contribution

It introduces an orchestrated, modular architecture for behavioral assurance in large-scale language systems applied to mental health screening tasks.

Findings

01

Demonstrated capacity to stabilize and audit model behavior in depression screening.

02

Highlighted importance of modular orchestration for behavioral assurance.

03

Emphasized stability over architectural complexity in system design.

Abstract

This paper presents CPEMH, an agentic framework designed to evaluate prompt-driven behavior in foundation-model systems operating on transcript-based datasets for mental-health screening. CPEMH serves as an engineering methodology for behavioral assurance in large-scale language systems, introducing an orchestrated architecture that autonomously performs the design, evaluation, and selection of prompt strategies, enabling systematic control of behavioral variability across contexts. Its modular agentic design, combining orchestrator, inference, and evaluation agents, ensures traceability, reproducibility, and robustness throughout the prompting lifecycle. A case study on automated depression screening from interview transcripts demonstrates the framework's capacity to stabilize and audit foundation-model behavior in conversational and clinically sensitive domains. Lessons learned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.