Predictive Prompt Analysis

Jae Yong Lee; Sungmin Kang; Shin Yoo

arXiv:2501.18883·cs.SE·December 22, 2025

Predictive Prompt Analysis

Jae Yong Lee, Sungmin Kang, Shin Yoo

PDF

Open Access

TL;DR

This paper introduces a predictive prompt analysis method called SPA that estimates LLM responses to prompts efficiently, aiding prompt design without extensive trial-and-error.

Contribution

It presents SPA, a novel autoencoder-based approach for predicting LLM behavior, significantly reducing computational costs in prompt analysis.

Findings

01

SPA achieved up to 0.994 Pearson correlation with actual LLM responses.

02

SPA required only 0.4% of the time needed for running the LLM.

03

Demonstrated effectiveness in predicting syntactic structures during code synthesis.

Abstract

Large Language Models (LLMs) are machine learning models that have seen widespread adoption due to their capability of handling previously difficult tasks. LLMs, due to their training, are sensitive to how exactly a question is presented, also known as prompting. However, prompting well is challenging, as it has been difficult to uncover principles behind prompting -- generally, trial-and-error is the most common way of improving prompts, despite its significant computational cost. In this context, we argue it would be useful to perform `predictive prompt analysis', in which an automated technique would perform a quick analysis of a prompt and predict how the LLM would react to it, relative to a goal provided by the user. As a demonstration of the concept, we present Syntactic Prevalence Analyzer (SPA), a predictive prompt analysis approach based on sparse autoencoders (SAEs). SPA…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEducational and Psychological Assessments