END: Early Noise Dropping for Efficient and Effective Context Denoising

Hongye Jin; Pei Chen; Jingfeng Yang; Zhengyang Wang; Fangran Mo; Jinghan Zhang; Meng Jiang; Yifan Gao; Binxuan Huang; Xinyang Zhang; Zheng Li; Tianyi Liu; Huasheng Li; Bing Yin

arXiv:2502.18915·cs.CL·May 21, 2026

END: Early Noise Dropping for Efficient and Effective Context Denoising

Hongye Jin, Pei Chen, Jingfeng Yang, Zhengyang Wang, Fangran Mo, Jinghan Zhang, Meng Jiang, Yifan Gao, Binxuan Huang, Xinyang Zhang, Zheng Li, Tianyi Liu, Huasheng Li, Bing Yin

PDF

TL;DR

END introduces a method to improve LLM performance by early noise removal from input sequences, enhancing efficiency and accuracy without fine-tuning.

Contribution

The paper presents a novel early noise dropping technique that leverages LLMs' implicit understanding to filter noisy input chunks before token generation.

Findings

01

END improves LLM performance across multiple datasets.

02

END reduces computational overhead during inference.

03

The approach enhances understanding of LLM internal reasoning processes.

Abstract

Large Language Models (LLMs) have demonstrated remarkable performance across a wide range of natural language processing tasks. However, they are often distracted by irrelevant or noisy context in input sequences that degrades output quality. This problem affects both long- and short-context scenarios, such as retrieval-augmented generation, table question-answering, and in-context learning. We reveal that LLMs can implicitly identify whether input sequences contain useful information at early layers, prior to token generation. Leveraging this insight, we introduce Early Noise Dropping (\textsc{END}), a novel approach to mitigate this issue without requiring fine-tuning the LLMs. \textsc{END} segments input sequences into chunks and employs a linear prober on the early layers of LLMs to differentiate between informative and noisy chunks. By discarding noisy chunks early in the process,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Natural Language Processing Techniques