ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning
Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Thien Huu Nguyen

TL;DR
ULLME introduces a versatile framework for large language model embeddings, combining bidirectional attention support and a novel fine-tuning method called GRL, significantly improving passage embedding quality across diverse LLM architectures.
Contribution
The paper presents ULLME, a flexible, plug-and-play framework supporting various LLMs and fine-tuning strategies, along with GRL, a new method leveraging generative abilities for better embeddings.
Findings
Strong performance on the Massive Text Embedding Benchmark
Supports multiple LLM architectures from 1.5B to 8B parameters
Open-source implementation available for practical use
Abstract
Large Language Models (LLMs) excel in various natural language processing tasks, but leveraging them for dense passage embedding remains challenging. This is due to their causal attention mechanism and the misalignment between their pre-training objectives and the text ranking tasks. Despite some recent efforts to address these issues, existing frameworks for LLM-based text embeddings have been limited by their support for only a limited range of LLM architectures and fine-tuning strategies, limiting their practical application and versatility. In this work, we introduce the Unified framework for Large Language Model Embedding (ULLME), a flexible, plug-and-play implementation that enables bidirectional attention across various LLMs and supports a range of fine-tuning strategies. We also propose Generation-augmented Representation Learning (GRL), a novel fine-tuning method to boost LLMs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsSoftmax · Attention Is All You Need
