CANTANTE: Optimizing Agentic Systems via Contrastive Credit Attribution
Tom Zehle

TL;DR
CANTANTE is a novel framework that improves multi-agent system performance by decomposing system rewards into per-agent signals using contrastive methods, enhancing prompt optimization across various tasks.
Contribution
The paper introduces CANTANTE, a contrastive credit attribution framework that effectively decomposes system-level rewards into local agent signals for better optimization.
Findings
CANTANTE outperforms existing optimizers on programming and reasoning benchmarks.
It achieves significant performance improvements, e.g., +18.9% on MBPP and +12.5% on GSM8K.
The credit attribution produces meaningful per-agent signals, confirmed by correlation analysis.
Abstract
LLM-based multi-agent systems have demonstrated strong performance across complex real-world tasks, such as software engineering, predictive modeling, and retrieval-augmented generation. Yet automating their configuration remains a structural challenge, as scores are available only at the system level, whereas the parameters governing agent behavior are local. We argue that optimizing these systems is fundamentally a credit-assignment problem. We therefore introduce CANTANTE, a framework that decomposes system-level rewards into per-agent update signals by contrasting rollouts of multiple joint configurations on the same query. We instantiate it for prompt optimization, treating agent prompts as learnable system parameters. We evaluate CANTANTE against GEPA and MIPROv2 on programming (MBPP), mathematical reasoning (GSM8K), and multi-hop question answering (HotpotQA). Across these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
