Think Together and Work Better: Combining Humans' and LLMs' Think-Aloud Outcomes for Effective Text Evaluation
SeongYeub Chu, JongWoo Kim, and MunYong Yi

TL;DR
This paper presents InteractEval, a framework combining human expertise and LLMs using the Think-Aloud method to improve checklist-based text evaluation across multiple quality dimensions.
Contribution
It introduces a novel framework that integrates human and LLM reasoning with the Think-Aloud method, enhancing attribute generation and evaluation performance.
Findings
Humans excel at internal quality attributes like Coherence and Fluency.
LLMs perform better on external alignment attributes like Consistency and Relevance.
Combining humans and LLMs yields the best evaluation outcomes.
Abstract
This study introduces \textbf{InteractEval}, a framework that integrates human expertise and Large Language Models (LLMs) using the Think-Aloud (TA) method to generate attributes for checklist-based text evaluation. By combining human flexibility and reasoning with LLM consistency, InteractEval outperforms traditional non-LLM-based and LLM-based baselines across four distinct dimensions, consisting of Coherence, Fluency, Consistency, and Relevance. The experiment also investigates the effectiveness of the TA method, showing that it promotes divergent thinking in both humans and LLMs, leading to the generation of a wider range of relevant attributes and enhance text evaluation performance. Comparative analysis reveals that humans excel at identifying attributes related to internal quality (Coherence and Fluency), but LLMs perform better at those attributes related to external alignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Research and Analysis · Advanced Text Analysis Techniques · Education Practices and Evaluation
