FltLM: An Intergrated Long-Context Large Language Model for Effective Context Filtering and Understanding
Jingyang Deng, Zhengyang Shen, Boyang Wang, Lixin Su, Suqi Cheng, Ying, Nie, Junfeng Wang, Dawei Yin, Jinwen Ma

TL;DR
FltLM is a novel long-context language model that uses dynamic context filtering to improve understanding and reasoning in multi-document question-answering tasks, addressing key challenges of lost information and distraction.
Contribution
The paper introduces FltLM, an integrated long-context LLM with a context filter mechanism that enhances focus and comprehension in multi-document QA tasks.
Findings
FltLM outperforms fine-tuning and retrieval-based methods in complex QA.
The model effectively mitigates lost context and distraction issues.
FltLM operates efficiently in a single forward pass.
Abstract
The development of Long-Context Large Language Models (LLMs) has markedly advanced natural language processing by facilitating the process of textual data across long documents and multiple corpora. However, Long-Context LLMs still face two critical challenges: The lost in the middle phenomenon, where crucial middle-context information is likely to be missed, and the distraction issue that the models lose focus due to overly extended contexts. To address these challenges, we propose the Context Filtering Language Model (FltLM), a novel integrated Long-Context LLM which enhances the ability of the model on multi-document question-answering (QA) tasks. Specifically, FltLM innovatively incorporates a context filter with a soft mask mechanism, identifying and dynamically excluding irrelevant content to concentrate on pertinent information for better comprehension and reasoning. Our approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Recommender Systems and Techniques · Human Mobility and Location-Based Analysis
MethodsFocus
