PROMPT2BOX: Uncovering Entailment Structure among LLM Prompts

Neeladri Bhuiya; Shib Sankar Dasgupta; Andrew McCallum; Haw-Shiuan Chang

arXiv:2603.21438·cs.CL·March 24, 2026

PROMPT2BOX: Uncovering Entailment Structure among LLM Prompts

Neeladri Bhuiya, Shib Sankar Dasgupta, Andrew McCallum, Haw-Shiuan Chang

PDF

Open Access

TL;DR

PROMPT2BOX introduces a novel box embedding method for prompts that captures both semantic similarity and specificity, improving the analysis of LLM weaknesses and hierarchical prompt structure.

Contribution

It proposes a new box embedding space and a dimension reduction technique to better represent prompt specificity and facilitate visualization, outperforming vector baselines.

Findings

01

Box embeddings better capture prompt specificity than vector embeddings.

02

PROMPT2BOX identifies 8.9% more LLM weaknesses.

03

Hierarchical depth correlates 33% more with instruction specificity.

Abstract

To discover the weaknesses of LLMs, researchers often embed prompts into a vector space and cluster them to extract insightful patterns. However, vector embeddings primarily capture topical similarity. As a result, prompts that share a topic but differ in specificity, and consequently in difficulty, are often represented similarly, making fine-grained weakness analysis difficult. To address this limitation, we propose PROMPT2BOX, which embeds prompts into a box embedding space using a trained encoder. The encoder, trained on existing and synthesized datasets, outputs box embeddings that capture not only semantic similarity but also specificity relations between prompts (e.g., "writing an adventure story" is more specific than "writing a story"). We further develop a novel dimension reduction technique for box embeddings to facilitate dataset visualization and comparison. Our experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Data Visualization and Analytics