InconLens: Interactive Visual Diagnosis of Behavioral Inconsistencies in LLM-based Agentic Systems

Shuo Yan; Xiaolin Wen; Shaolun Ruan; Yanjie Zhang; Jiaming Mi; Yushi Sun; Huamin Qu; Rui Sheng

arXiv:2603.28106·cs.HC·March 31, 2026

InconLens: Interactive Visual Diagnosis of Behavioral Inconsistencies in LLM-based Agentic Systems

Shuo Yan, Xiaolin Wen, Shaolun Ruan, Yanjie Zhang, Jiaming Mi, Yushi Sun, Huamin Qu, Rui Sheng

PDF

TL;DR

InconLens is a visual analytics tool that helps developers diagnose and understand behavioral inconsistencies in LLM-based agent systems across multiple runs, improving debugging efficiency.

Contribution

The paper introduces InconLens, a novel interactive system that facilitates cross-run behavioral analysis of LLM agents using semantic alignment of informational milestones.

Findings

01

InconLens enables efficient identification of divergence points in agent behavior.

02

Expert interviews confirm InconLens's usability and analytical value.

03

Case study demonstrates improved diagnosis of failure modes.

Abstract

Large Language Model (LLM)-based agentic systems have shown growing promise in tackling complex, multi-step tasks through autonomous planning, reasoning, and interaction with external environments. However, the stochastic nature of LLM generation introduces intrinsic behavioral inconsistency: the same agent may succeed in one execution but fail in another under identical inputs. Diagnosing such inconsistencies remains a major challenge for developers, as agent execution logs are often lengthy, unstructured, and difficult to compare across runs. Existing debugging and evaluation tools primarily focus on inspecting single executions, offering limited support for understanding how and why agent behaviors diverge across repeated runs. To address this challenge, we introduce InconLens, a visual analytics system designed to support interactive diagnosis of LLM-based agentic systems with a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.