Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats

Ee Wei Seah; Yongsen Zheng; Naga Nikshith; Mahran Morsidi; Gabriel Waikin Loh Matienzo; Nigel Gay; Akriti Vij; Benjamin Chua; En Qi Ng; Sharmini Johnson; Vanessa Wilfred; Wan Sie Lee; Anna Davidson; Catherine Devine; Erin Zorer; Gareth Holvey; Harry Coppock; James Walpole; Jerome Wynee; Magda Dubois; Michael Schmatz; Patrick Keane; Sam Deverett; Bill Black; Bo Yan; Bushra Sabir; Frank Sun; Hao Zhang; Harriet Farlow; Helen Zhou; Lingming Dong; Qinghua Lu; Seung Jang; Sharif Abuadbba; Simon O'Callaghan; Suyu Ma; Tom Howroyd; Cyrus Fung; Fatemeh Azadi; Isar Nejadgholi; Krishnapriya Vishnubhotla; Pulei Xiong; Saeedeh Lohrasbi; Scott Buffett; Shahrear Iqbal; Sowmya Vajjala; Anna Safont-Andreu; Luca Massarelli; Oskar van der Wal; Simon M\"oller; Agnes Delaborde; Joris Dugu\'ep\'eroux; Nicolas Rolin; Romane Gallienne; Sarah Behanzin; Tom Seimandi; Akiko Murakami; Takayuki Semitsu; Teresa Tsukiji; Angela Kinuthia; Michael Michie; Stephanie Kasaon; Jean Wangari; Hankyul Baek; Jaewon Noh; Kihyuk Nam; Sang Seo; Sungpil Shin; Taewhi Lee; Yongsu Kim

arXiv:2601.15679·cs.AI·January 23, 2026

Improving Methodologies for Agentic Evaluations Across Domains: Leakage of Sensitive Information, Fraud and Cybersecurity Threats

Ee Wei Seah, Yongsen Zheng, Naga Nikshith, Mahran Morsidi, Gabriel Waikin Loh Matienzo, Nigel Gay, Akriti Vij, Benjamin Chua, En Qi Ng, Sharmini Johnson, Vanessa Wilfred, Wan Sie Lee, Anna Davidson, Catherine Devine, Erin Zorer, Gareth Holvey, Harry Coppock, James Walpole

PDF

Open Access

TL;DR

This paper discusses the development of standardized methodologies for evaluating autonomous AI agents across different domains, focusing on risks like information leakage, fraud, and cybersecurity, to improve safety and reliability.

Contribution

It presents a collaborative international effort to refine best practices for agentic evaluation methodologies, emphasizing risk assessment and testing procedures for advanced AI systems.

Findings

01

Identified key methodological challenges in agentic testing

02

Developed preliminary best practices for evaluating AI risks

03

Facilitated international collaboration to advance evaluation science

Abstract

The rapid rise of autonomous AI systems and advancements in agent capabilities are introducing new risks due to reduced oversight of real-world interactions. Yet agent testing remains nascent and is still a developing science. As AI agents begin to be deployed globally, it is important that they handle different languages and cultures accurately and securely. To address this, participants from The International Network for Advanced AI Measurement, Evaluation and Science, including representatives from Singapore, Japan, Australia, Canada, the European Commission, France, Kenya, South Korea, and the United Kingdom have come together to align approaches to agentic evaluations. This is the third exercise, building on insights from two earlier joint testing exercises conducted by the Network in November 2024 and February 2025. The objective is to further refine best practices for testing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning