Embedding-Aligned Language Models

Guy Tennenholtz; Yinlam Chow; Chih-Wei Hsu; Lior Shani; Ethan Liang,; Craig Boutilier

arXiv:2406.00024·cs.CL·October 29, 2024

Embedding-Aligned Language Models

Guy Tennenholtz, Yinlam Chow, Chih-Wei Hsu, Lior Shani, Ethan Liang,, Craig Boutilier

PDF

Open Access 1 Video

TL;DR

This paper introduces EAGLE, a reinforcement learning-based method for aligning large language models with objectives in a latent embedding space, enhancing controlled and domain-specific text generation.

Contribution

It presents a novel RL approach that guides LLMs in adhering to embedding space objectives, improving content relevance and domain alignment.

Findings

01

EAGLE effectively surfaces content gaps in user demand datasets.

02

Optimal design of state-dependent actions improves efficiency.

03

Demonstrates controlled text generation aligned with domain knowledge.

Abstract

We propose a novel approach for training large language models (LLMs) to adhere to objectives defined within a latent embedding space. Our method leverages reinforcement learning (RL), treating a pre-trained LLM as an environment. Our embedding-aligned guided language (EAGLE) agent is trained to iteratively steer the LLM's generation towards optimal regions of the latent embedding space, w.r.t. some predefined criterion. We demonstrate the effectiveness of the EAGLE agent using the MovieLens 25M and Amazon Review datasets to surface content gaps that satisfy latent user demand. We also demonstrate the benefit of using an optimal design of a state-dependent action set to improve EAGLE's efficiency. Our work paves the way for controlled and grounded text generation using LLMs, ensuring consistency with domain-specific knowledge and data representations.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Embedding-Aligned Language Models· slideslive

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques

MethodsSparse Evolutionary Training