E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

Zihan Liao; Jun Wang; Hang Yu; Lingxiao Wei; Jianguo Li; Jun Wang; Wei Zhang

arXiv:2409.06679·cs.CL·April 15, 2026

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning

Zihan Liao, Jun Wang, Hang Yu, Lingxiao Wei, Jianguo Li, Jun Wang, Wei Zhang

PDF

1 Repo 1 Video

TL;DR

E2LLM introduces a novel method for long-context understanding in LLMs by compressing chunks of text into soft prompts, improving performance and efficiency in tasks like summarization and question answering.

Contribution

The paper proposes E2LLM, a new approach that effectively balances long-context performance, computational efficiency, and compatibility with pretrained models.

Findings

01

E2LLM outperforms 8 SOTA methods in effectiveness and efficiency.

02

Achieves top performance on LongBench v2 among comparable models.

03

Uses soft prompts and training objectives to enhance reasoning with long contexts.

Abstract

Processing long contexts is increasingly important for Large Language Models (LLMs) in tasks like multi-turn dialogues, code generation, and document summarization. This paper addresses the challenges of achieving high long-context performance, low computational complexity, and compatibility with pretrained models -- collectively termed the ``impossible triangle''. We introduce E2LLM (Encoder Elongated Large Language Models), a novel approach that effectively navigates this paradox. E2LLM divides long contexts into chunks, compresses each into soft prompts using a pretrained text encoder, and aligns these representations with a decoder-only LLM via an adapter. To enhance the LLM's reasoning with these soft prompts, we employ two training objectives: encoder output reconstruction and long-context instruction fine-tuning. Extensive experiments reveal that E2LLM not only outperforms 8…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

codefuse-ai/E2LLM
github

Videos

E2LLM: Encoder Elongated Large Language Models for Long-Context Understanding and Reasoning· underline