CLAWS:Creativity detection for LLM-generated solutions using Attention Window of Sections
Keuntae Kim, Eunhye Jeong, Sehyeon Lee, Seohee Yoon, Yong Suk Choi

TL;DR
This paper introduces CLAWS, a novel method for automatically detecting creativity in LLM-generated solutions by analyzing attention weights, addressing the challenge of assessing creativity without human evaluation in reasoning tasks.
Contribution
CLAWS is the first approach to classify mathematical solutions into typical, creative, and hallucinated categories without human input, using attention weights across prompt sections.
Findings
CLAWS outperforms five existing detection methods.
Validated on 4545 math problems from 181 contests.
Effective across multiple 7-8B RL models.
Abstract
Recent advances in enhancing the reasoning ability of large language models (LLMs) have been remarkably successful. LLMs trained with reinforcement learning (RL) for reasoning demonstrate strong performance in challenging tasks such as mathematics and coding, even with relatively small model sizes. However, despite these improvements in task accuracy, the assessment of creativity in LLM generations has been largely overlooked in reasoning tasks, in contrast to writing tasks. The lack of research on creativity assessment in reasoning primarily stems from two challenges: (1) the difficulty of defining the range of creativity, and (2) the necessity of human evaluation in the assessment process. To address these challenges, we propose CLAWS, a method that defines and classifies mathematical solutions into typical, creative, and hallucinated categories without human evaluation, by leveraging…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsArtificial Intelligence in Games · Creativity in Education and Neuroscience · Machine Learning in Materials Science
