Steering Large Language Models to Evaluate and Amplify Creativity

Matthew Lyle Olson; Neale Ratzlaff; Musashi Hinck; Shao-yen Tseng,; Vasudev Lal

arXiv:2412.06060·cs.CL·December 10, 2024

Steering Large Language Models to Evaluate and Amplify Creativity

Matthew Lyle Olson, Neale Ratzlaff, Musashi Hinck, Shao-yen Tseng,, Vasudev Lal

PDF

Open Access

TL;DR

This paper presents a method to evaluate and enhance the creativity of large language models by analyzing their internal states when generating creative versus boring responses, aligning better with human judgment.

Contribution

It introduces a mechanistic approach to measure and improve LLM creativity by leveraging internal state differences during response generation.

Findings

01

Internal state differences correlate with human judgments of creativity.

02

The method improves the creativity of generated text at inference time.

03

The approach provides a robust measure of creativity for LLMs.

Abstract

Although capable of generating creative text, Large Language Models (LLMs) are poor judges of what constitutes "creativity". In this work, we show that we can leverage this knowledge of how to write creatively in order to better judge what is creative. We take a mechanistic approach that extracts differences in the internal states of an LLM when prompted to respond "boringly" or "creatively" to provide a robust measure of creativity that corresponds strongly with human judgment. We also show these internal state differences can be applied to enhance the creativity of generated text at inference time.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOnline Learning and Analytics · Creativity in Education and Neuroscience