KnowThyself: An Agentic Assistant for LLM Interpretability
Suraj Prasai, Mengnan Du, Ying Zhang, Fan Yang

TL;DR
KnowThyself is a chat-based tool that simplifies LLM interpretability by integrating model inspection, natural language querying, and visual explanations into an accessible, conversational platform.
Contribution
It introduces a unified, extensible system that consolidates interpretability tools into a user-friendly chat interface with an orchestrator and specialized modules.
Findings
Enhanced accessibility for LLM interpretability
Interactive visualizations and explanations in a chat format
Lowered technical barriers for users
Abstract
We develop KnowThyself, an agentic assistant that advances large language model (LLM) interpretability. Existing tools provide useful insights but remain fragmented and code-intensive. KnowThyself consolidates these capabilities into a chat-based interface, where users can upload models, pose natural language questions, and obtain interactive visualizations with guided explanations. At its core, an orchestrator LLM first reformulates user queries, an agent router further directs them to specialized modules, and the outputs are finally contextualized into coherent explanations. This design lowers technical barriers and provides an extensible platform for LLM inspection. By embedding the whole process into a conversational workflow, KnowThyself offers a robust foundation for accessible LLM interpretability.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Artificial Intelligence in Healthcare and Education · Topic Modeling
