BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval   Augmented Long-Context Large Language Models

Kun Luo; Zheng Liu; Shitao Xiao; Kang Liu

arXiv:2402.11573·cs.CL·February 20, 2024·1 cites

BGE Landmark Embedding: A Chunking-Free Embedding Method For Retrieval Augmented Long-Context Large Language Models

Kun Luo, Zheng Liu, Shitao Xiao, Kang Liu

PDF

Open Access

TL;DR

This paper introduces Extensible Embedding, a novel method for extending the context of large language models efficiently and flexibly, enabling access to larger context scopes without significant cost or quality loss.

Contribution

The paper presents a chunking-free, high-density embedding technique that enhances context extension in LLMs, improving flexibility, sample efficiency, and compatibility.

Findings

01

Effective long-context modeling demonstrated in experiments

02

Supports diverse context lengths flexibly

03

Cost-efficient training process

Abstract

Large language models (LLMs) call for extension of context to handle many critical applications. However, the existing approaches are prone to expensive costs and inferior quality of context extension. In this work, we proposeExtensible Embedding, which realizes high-quality extension of LLM's context with strong flexibility and cost-effectiveness. Extensible embedding stand as an enhancement of typical token embedding, which represents the information for an extensible scope of context instead of a single token. By leveraging such compact input units of higher information density, the LLM can access to a vast scope of context even with a small context window. Extensible embedding is systematically optimized in architecture and training method, which leads to multiple advantages. 1) High flexibility of context extension, which flexibly supports ad-hoc extension of diverse context…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Recommender Systems and Techniques · Natural Language Processing Techniques