Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control

Amirmohammad Farzaneh; Salvatore D'Oro; Osvaldo Simeone

arXiv:2601.20090·cs.AI·January 30, 2026

Should I Have Expressed a Different Intent? Counterfactual Generation for LLM-Based Autonomous Control

Amirmohammad Farzaneh, Salvatore D'Oro, Osvaldo Simeone

PDF

Open Access

TL;DR

This paper presents a formal framework for counterfactual reasoning in LLM-based autonomous agents, enabling users to explore alternative outcomes of their high-level intents with reliability guarantees.

Contribution

It introduces conformal counterfactual generation (CCG), a novel method that models agent interactions as a structural causal model and provides probabilistic guarantees on counterfactual outcomes.

Findings

01

CCG effectively generates high-probability sets of counterfactual outcomes.

02

Demonstrated advantages over naive re-execution baselines in wireless network control.

03

Provides formal reliability guarantees for counterfactual reasoning.

Abstract

Large language model (LLM)-powered agents can translate high-level user intents into plans and actions in an environment. Yet after observing an outcome, users may wonder: What if I had phrased my intent differently? We introduce a framework that enables such counterfactual reasoning in agentic LLM-driven control scenarios, while providing formal reliability guarantees. Our approach models the closed-loop interaction between a user, an LLM-based agent, and an environment as a structural causal model (SCM), and leverages test-time scaling to generate multiple candidate counterfactual outcomes via probabilistic abduction. Through an offline calibration phase, the proposed conformal counterfactual generation (CCG) yields sets of counterfactual outcomes that are guaranteed to contain the true counterfactual outcome with high probability. We showcase the performance of CCG on a wireless…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Multimodal Machine Learning Applications · Topic Modeling