CentaurTA Studio: A Self-Improving Human-Agent Collaboration System for Thematic Analysis
Lei Wang, Min Huang, Eduard Dragut

TL;DR
CentaurTA Studio is a web-based system that enhances human-agent collaboration for thematic analysis through self-improvement, feedback, and process control, achieving high accuracy and efficiency across multiple domains.
Contribution
It introduces a novel self-improving framework combining feedback, prompt optimization, and evaluation to improve thematic analysis performance.
Findings
Achieves up to 92.12% accuracy in thematic analysis tasks.
Substantial agreement (κ=0.68) between system and human annotators.
Performance drops from 90% to 81% without feedback loop.
Abstract
Thematic analysis is difficult to scale: manual workflows are labor-intensive, while fully automated pipelines often lack controllability and transparent evaluation. We present \textbf{CentaurTA Studio}, a web-based system for self-improving human--agent collaboration in open coding and theme construction. The system integrates (1) a two-stage human feedback pipeline separating simulator drafting and expert validation, (2) persistent prompt optimization that distills validated feedback into reusable alignment principles, and (3) rubric-based evaluation with early stopping for process control. Across three domains, CentaurTA achieves the strongest performance in both Open Coding and Theme Construction, reaching up to 92.12\% accuracy and consistently outperforming baseline systems. Agreement between the rubric-based LLM judge and human annotators reaches substantial reliability…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
