Conformal Agent Error Attribution

Naihe Feng; Yi Sui; Shiyi Hou; Ga Wu; Jesse C. Cresswell

arXiv:2605.06788·cs.LG·May 11, 2026

Conformal Agent Error Attribution

Naihe Feng, Yi Sui, Shiyi Hou, Ga Wu, Jesse C. Cresswell

PDF

1 Repo

TL;DR

This paper introduces a conformal prediction-based framework for error attribution in multi-agent systems, enabling precise error localization and automated recovery with theoretical guarantees.

Contribution

It develops new filtration-based conformal prediction algorithms tailored for sequential agent data, facilitating efficient debugging and recovery in MAS.

Findings

01

Errors can be precisely isolated using the proposed method.

02

Prediction sets enable effective rollback and error correction.

03

The approach is model-agnostic and theoretically guaranteed.

Abstract

When multi-agent systems (MAS) fail, identifying where the decisive error occurred is the first step for automated recovery to an earlier state. Error attribution remains a fundamental challenge due to the long interaction traces that large language model-based MAS generate. This paper presents a framework for error attribution based on conformal prediction (CP) which provides finite-sample, distribution-free coverage guarantees. We introduce new algorithms for filtration-based CP designed for sequential data such as agent trajectories. Unlike existing CP algorithms, our approach predicts sets that are contiguous sequences to enable efficient recovery and debugging. We verify our theoretical guarantees on a variety of agents and datasets, show that errors can be precisely isolated, then use prediction sets to rollback MAS to correct their own errors. Our overall approach is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

layer6ai-labs/conformal-agent-error-attribution
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.