ULLME: A Unified Framework for Large Language Model Embeddings with   Generation-Augmented Learning

Hieu Man; Nghia Trung Ngo; Franck Dernoncourt; Thien Huu Nguyen

arXiv:2408.03402·cs.CL·August 8, 2024

ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning

Hieu Man, Nghia Trung Ngo, Franck Dernoncourt, Thien Huu Nguyen

PDF

Open Access 1 Repo 1 Video

TL;DR

ULLME introduces a versatile framework for large language model embeddings, combining bidirectional attention support and a novel fine-tuning method called GRL, significantly improving passage embedding quality across diverse LLM architectures.

Contribution

The paper presents ULLME, a flexible, plug-and-play framework supporting various LLMs and fine-tuning strategies, along with GRL, a new method leveraging generative abilities for better embeddings.

Findings

01

Strong performance on the Massive Text Embedding Benchmark

02

Supports multiple LLM architectures from 1.5B to 8B parameters

03

Open-source implementation available for practical use

Abstract

Large Language Models (LLMs) excel in various natural language processing tasks, but leveraging them for dense passage embedding remains challenging. This is due to their causal attention mechanism and the misalignment between their pre-training objectives and the text ranking tasks. Despite some recent efforts to address these issues, existing frameworks for LLM-based text embeddings have been limited by their support for only a limited range of LLM architectures and fine-tuning strategies, limiting their practical application and versatility. In this work, we introduce the Unified framework for Large Language Model Embedding (ULLME), a flexible, plug-and-play implementation that enables bidirectional attention across various LLMs and supports a range of fine-tuning strategies. We also propose Generation-augmented Representation Learning (GRL), a novel fine-tuning method to boost LLMs…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nlp-uoregon/ullme
pytorchOfficial

Videos

ULLME: A Unified Framework for Large Language Model Embeddings with Generation-Augmented Learning· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsSoftmax · Attention Is All You Need