Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders

Ailiang Lin; Zhuoyun Li; Keyu Mao; Kotaro Funakoshi; Manabu Okumura

arXiv:2605.01372·cs.CL·May 5, 2026

Embedding-based In-Context Prompt Training for Enhancing LLMs as Text Encoders

Ailiang Lin, Zhuoyun Li, Keyu Mao, Kotaro Funakoshi, Manabu Okumura

PDF

TL;DR

EPIC introduces an embedding-based in-context prompt training method that improves LLM-generated embeddings, reduces computational costs, and achieves state-of-the-art results on the MTEB benchmark.

Contribution

The paper presents a novel embedding-based prompt training strategy that enhances embedding quality and efficiency, surpassing existing models on key benchmarks.

Findings

01

EPIC achieves state-of-the-art results on the MTEB benchmark.

02

The method reduces token overhead compared to traditional in-context learning.

03

EPIC-trained models perform well with or without in-context prompts.

Abstract

Large language models (LLMs) have been widely explored for embedding generation. While recent studies show that in-context learning (ICL) effectively enhances the representational capability of LLMs by prepending a few task-related demonstrations, it causes substantial token overhead due to the increased sequence length. In this work, we propose EPIC, a novel embedding-based in-context prompt training strategy that leverages ICL to generate high-quality embeddings while reducing computational burden during both training and inference. This approach replaces discrete text demonstrations with their corresponding continuous embeddings, which not only encourages the LLM to align semantically-related text pairs during contrastive learning, but also requires the model to interpret demonstration embeddings as part of the in-context prompt. Consequently, EPIC-trained models achieve excellent…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.