Meta-Task Prompting Elicits Embeddings from Large Language Models

Yibin Lei; Di Wu; Tianyi Zhou; Tao Shen; Yu Cao; Chongyang Tao; Andrew; Yates

arXiv:2402.18458·cs.CL·July 23, 2024·1 cites

Meta-Task Prompting Elicits Embeddings from Large Language Models

Yibin Lei, Di Wu, Tianyi Zhou, Tao Shen, Yu Cao, Chongyang Tao, Andrew, Yates

PDF

Open Access 1 Repo 1 Video

TL;DR

MetaEOL introduces an unsupervised, prompt-based method for generating high-quality sentence embeddings from LLMs, outperforming contrastive models on various benchmarks without fine-tuning.

Contribution

It presents a novel meta-task prompting approach that guides LLMs to produce versatile embeddings, establishing a new scaling law for resource-efficient embedding generation.

Findings

01

Embeddings from MetaEOL outperform contrastive-trained models on STS benchmarks.

02

MetaEOL embeddings excel in downstream NLP tasks.

03

A new scaling law for embedding quality and resource efficiency.

Abstract

We introduce a new unsupervised text embedding method, Meta-Task Prompting with Explicit One-Word Limitation (MetaEOL), for generating high-quality sentence embeddings from Large Language Models (LLMs) without the need for model fine-tuning. Leveraging meta-task prompting, MetaEOL guides LLMs to produce embeddings through a series of carefully designed prompts that address multiple representational aspects. Our comprehensive experiments demonstrate that embeddings averaged from various meta-tasks are versatile embeddings that yield competitive performance on Semantic Textual Similarity (STS) benchmarks and excel in downstream tasks, surpassing contrastive-trained models. Our findings suggest a new scaling law, offering a versatile and resource-efficient approach for embedding generation across diverse scenarios.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yibin-lei/metaeol
pytorchOfficial

Videos

Meta-Task Prompting Elicits Embeddings from Large Language Models· underline

Taxonomy

TopicsTopic Modeling