One Word Is Not Enough: Simple Prompts Improve Word Embeddings

Rajeev Ranjan

arXiv:2512.06744·cs.CL·December 9, 2025

One Word Is Not Enough: Simple Prompts Improve Word Embeddings

Rajeev Ranjan

PDF

Open Access

TL;DR

Prepending semantic prompts to words before embedding significantly enhances word similarity correlations across multiple models and benchmarks, establishing a new state-of-the-art for embedding methods without additional training.

Contribution

Demonstrates that simple semantic prompts improve word embedding quality on similarity tasks, outperforming static embeddings and establishing a zero-shot enhancement technique.

Findings

01

Prompts like "meaning: {word}" improve correlations by up to +0.29.

02

Some models recover from zero correlation with prompts.

03

Achieves new state-of-the-art correlations on standard benchmarks.

Abstract

Text embedding models are designed for sentence-level applications like retrieval and semantic similarity, and are primarily evaluated on sentence-level benchmarks. Their behavior on isolated words is less understood. We show that simply prepending semantic prompts to words before embedding substantially improves word similarity correlations. Testing 7 text embedding models, including text-embedding-3-large (OpenAI), embed-english-v3.0 (Cohere), voyage-3(Voyage AI), all-mpnet-base-v2, and Qwen3-Embedding-8B, on 3 standard benchmarks (SimLex-999, WordSim-353, MEN-3000), we find that prompts like "meaning: {word}" or "Represent the semantic concept: {word}" improve Spearman correlations by up to +0.29 on SimLex-999. Some models fail completely on bare words (correlation = 0) but recover with prompts (+0.73 improvement). Our best results achieve correlation = 0.692 on SimLex-999 with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Graph Neural Networks · Text Readability and Simplification