Co-Refine: AI-Powered Tool Supporting Qualitative Analysis
Athikash Jeyaganthan, Kai Xu, Franziska Becker, and Steffen Koch

TL;DR
Co-Refine is an AI-powered tool that provides real-time feedback on coding consistency in qualitative analysis, addressing temporal drift issues during large dataset coding.
Contribution
It introduces a novel three-stage audit pipeline combining deterministic metrics and LLM verdicts to ensure reliable, real-time quality control in qualitative coding workflows.
Findings
Deterministic embedding metrics effectively measure coding consistency.
LLM verdicts grounded within a small score range improve reliability.
The system enables continuous, real-time detection of interpretative drift.
Abstract
Qualitative coding relies on a researcher's application of codes to textual data. As coding proceeds across large datasets, interpretations of codes often shift (temporal drift), reducing the credibility of the analysis. Existing Computer-Assisted Qualitative Data Analysis (CAQDAS) tools provide support for data management but offer no workflow for real-time detection of these drifts. We present Co-Refine, an AI-augmented qualitative coding platform that delivers continuous, grounded feedback on coding consistency without disrupting the researcher's workflow. The system employs a three-stage audit pipeline: Stage 1 computes deterministic embedding-based metrics for mathematical consistency; Stage 2 grounds LLM verdicts within of the deterministic scores; and Stage 3 produces code definitions from previous patterns to create a deepening feedback loop. Co-Refine demonstrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
