Efficiency with Rigor! A Trustworthy LLM-powered Workflow for Qualitative Data Analysis

Jie Gao; Zhiyao Shu; Shun Yi Yeo; Alok Prakash; Chien-Ming Huang; Mark Dredze; Ziang Xiao

arXiv:2501.00775·cs.HC·October 28, 2025

Efficiency with Rigor! A Trustworthy LLM-powered Workflow for Qualitative Data Analysis

Jie Gao, Zhiyao Shu, Shun Yi Yeo, Alok Prakash, Chien-Ming Huang, Mark Dredze, Ziang Xiao

PDF

Open Access

TL;DR

This paper introduces MindCoder, a transparent, LLM-powered workflow for qualitative data analysis that enhances trustworthiness by combining automation with human interpretive control and detailed logging.

Contribution

It presents a novel workflow, MindCoder, that balances automation and human involvement in QDA, ensuring transparency and trustworthiness.

Findings

01

MindCoder supports active interpretation and flexible control.

02

It produces more trustworthy codebooks.

03

The workflow maintains comprehensive logs for transparency.

Abstract

Qualitative data analysis (QDA) emphasizes trustworthiness, requiring sustained human engagement and reflexivity. Recently, large language models (LLMs) have been applied in QDA to improve efficiency. However, their use raises concerns about unvalidated automation and displaced sensemaking, which can undermine trustworthiness. To address these issues, we employed two strategies: transparency and human involvement. Through a literature review and formative interviews, we identified six design requirements for transparent automation and meaningful human involvement. Guided by these requirements, we developed MindCoder, an LLM-powered workflow that delegates mechanical tasks, such as grouping and validation, to the system, while enabling humans to conduct meaningful interpretation. MindCoder also maintains comprehensive logs of users' step-by-step interactions to ensure transparency and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics · Computational and Text Analysis Methods