Extensible Embedding: A Flexible Multipler For LLM's Context Length
Ninglu Shao, Shitao Xiao, Zheng Liu, Peitian Zhang

TL;DR
This paper introduces Extensible Embedding, a novel approach that enhances large language models by enabling flexible, high-quality context extension through a compact, information-dense embedding method that is cost-effective and compatible with existing models.
Contribution
The paper proposes Extensible Embedding, a new method that improves context extension in LLMs by representing multiple tokens as a single, information-rich embedding, enhancing flexibility and efficiency.
Findings
Supports diverse context lengths flexibly
Achieves high sample efficiency in training
Demonstrates superior performance in long-context tasks
Abstract
Large language models (LLMs) call for extension of context to handle many critical applications. However, the existing approaches are prone to expensive costs and inferior quality of context extension. In this work, we propose Extensible Embedding, which realizes high-quality extension of LLM's context with strong flexibility and cost-effectiveness. Extensible embedding stand as an enhancement of typical token embedding, which represents the information for an extensible scope of context instead of a single token. By leveraging such compact input units of higher information density, the LLM can access to a vast scope of context even with a small context window. Extensible embedding is systematically optimized in architecture and training method, which leads to multiple advantages. 1) High flexibility of context extension, which flexibly supports ad-hoc extension of diverse context…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Data Mining Algorithms and Applications · Semantic Web and Ontologies
