EmbedGrad: Gradient-Based Prompt Optimization in Embedding Space for Large Language Models

Xiaoming Hou; Jiquan Zhang; Zibin Lin; DaCheng Tao; Shengli Zhang

arXiv:2508.03533·cs.CL·August 6, 2025

EmbedGrad: Gradient-Based Prompt Optimization in Embedding Space for Large Language Models

Xiaoming Hou, Jiquan Zhang, Zibin Lin, DaCheng Tao, Shengli Zhang

PDF

TL;DR

EmbedGrad introduces a gradient-based method to optimize prompt embeddings, enabling precise, interpretable, and effective task adaptation for large language models without altering their architecture.

Contribution

This work presents EmbedGrad, a novel framework that refines prompt embeddings via gradient-based optimization, bridging the gap between prompt engineering and parameter tuning.

Findings

01

Significant accuracy improvements on mathematical reasoning tasks.

02

Consistent performance gains across various model sizes and tasks.

03

Enhanced reasoning capabilities through embedding refinement.

Abstract

Effectively adapting powerful pretrained foundation models to diverse tasks remains a key challenge in AI deployment. Current approaches primarily follow two paradigms:discrete optimization of text prompts through prompt engineering, or continuous adaptation via additional trainable parameters. Both exhibit limitations-discrete methods lack refinement precision while parameter-based techniques increase complexity and reduce interpretability. To address these constraints, we propose EmbedGrad, a novel framework that optimizes text prompt embeddings through gradient-based refinement. Our approach uniquely decouples training from deployment:during optimization,labeled examples guide precise embedding adjustments while preserving semantic meaning; during inference, only optimized embeddings integrate with user queries. This enables fine-grained calibration impossible in text space, such as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.