Self-reflection in Automated Qualitative Coding: Improving Text Annotation through Secondary LLM Critique
Zackary Okun Dunivin, Mobina Noori, Seth Frey, Curtis Atkinson

TL;DR
This paper introduces a two-stage LLM workflow with self-reflection to improve qualitative coding accuracy, significantly reducing errors and enhancing annotation quality in large datasets.
Contribution
It presents a novel self-reflection approach for LLM-based qualitative coding that improves precision and reduces false positives through secondary critique.
Findings
Self-reflection reduces false-positive rates from 8-54% to improve F1 scores.
Secondary critique enhances code interpretation accuracy and reduces errors.
The approach is effective across multiple codes and datasets.
Abstract
Large language models (LLMs) allow for sophisticated qualitative coding of large datasets, but zero- and few-shot classifiers can produce an intolerable number of errors, even with careful, validated prompting. We present a simple, generalizable two-stage workflow: an LLM applies a human-designed, LLM-adapted codebook; a secondary LLM critic performs self-reflection on each positive label by re-reading the source text alongside the first model's rationale and issuing a final decision. We evaluate this approach on six qualitative codes over 3,000 high-content emails from Apache Software Foundation project evaluation discussions. Our human-derived audit of 360 positive annotations (60 passages by six codes) found that the first-line LLM had a false-positive rate of 8% to 54%, despite F1 scores of 0.74 and 1.00 in testing. Subsequent recoding of all stage-one annotations via a second…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Artificial Intelligence in Healthcare and Education · Qualitative Research Methods and Applications
