DiLLS: Interactive Diagnosis of LLM-based Multi-agent Systems via Layered Summary of Agent Behaviors

Rui Sheng; Yukun Yang; Chuhan Shi; Yanna Lin; Zixin Chen; Huamin Qu; Furui Cheng

arXiv:2602.05446·cs.HC·February 6, 2026

DiLLS: Interactive Diagnosis of LLM-based Multi-agent Systems via Layered Summary of Agent Behaviors

Rui Sheng, Yukun Yang, Chuhan Shi, Yanna Lin, Zixin Chen, Huamin Qu, Furui Cheng

PDF

Open Access

TL;DR

DiLLS is an interactive framework that structures and summarizes multi-agent system behaviors across multiple levels, significantly aiding developers in diagnosing failures efficiently using natural language queries.

Contribution

This paper introduces DiLLS, a novel layered summarization system that organizes complex agent behaviors for easier diagnosis and understanding of failures in LLM-based multi-agent systems.

Findings

01

DiLLS improves diagnosis efficiency by 30% in user studies.

02

Developers can identify root causes faster with layered summaries.

03

The system effectively organizes behaviors into activities, actions, and operations.

Abstract

Large language model (LLM)-based multi-agent systems have demonstrated impressive capabilities in handling complex tasks. However, the complexity of agentic behaviors makes these systems difficult to understand. When failures occur, developers often struggle to identify root causes and to determine actionable paths for improvement. Traditional methods that rely on inspecting raw log records are inefficient, given both the large volume and complexity of data. To address this challenge, we propose a framework and an interactive system, DiLLS, designed to reveal and structure the behaviors of multi-agent systems. The key idea is to organize information across three levels of query completion: activities, actions, and operations. By probing the multi-agent system through natural language, DiLLS derives and organizes information about planning and execution into a structured, multi-layered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware System Performance and Reliability · Multi-Agent Systems and Negotiation · AI-based Problem Solving and Planning