Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding

Yanming Liu; Xinyue Peng; Jiannan Cao; Yanxin Shen; Tianyu Du; Sheng Cheng; Xun Wang; Jianwei Yin; Xuhong Zhang

arXiv:2410.01671·cs.CL·August 18, 2025·2 cites

Bridging Context Gaps: Leveraging Coreference Resolution for Long Contextual Understanding

Yanming Liu, Xinyue Peng, Jiannan Cao, Yanxin Shen, Tianyu Du, Sheng Cheng, Xun Wang, Jianwei Yin, Xuhong Zhang

PDF

Open Access 3 Reviews

TL;DR

This paper introduces LQCA, a coreference resolution framework designed to improve long context understanding in large language models, significantly enhancing question answering performance on lengthy texts.

Contribution

The paper proposes a novel coreference-based method tailored for long contexts, systematically improving LLMs' ability to understand and process lengthy documents.

Findings

01

Improved question answering accuracy on long texts.

02

Significant performance gains on GPT-4o and OpenAI-o1-mini models.

03

Effective handling of coreferences enhances long-context comprehension.

Abstract

Large language models (LLMs) have shown remarkable capabilities in natural language processing; however, they still face difficulties when tasked with understanding lengthy contexts and executing effective question answering. These challenges often arise due to the complexity and ambiguity present in longer texts. To enhance the performance of LLMs in such scenarios, we introduce the Long Question Coreference Adaptation (LQCA) method. This innovative framework focuses on coreference resolution tailored to long contexts, allowing the model to identify and manage references effectively. The LQCA method encompasses four key steps: resolving coreferences within sub-documents, computing the distances between mentions, defining a representative mention for coreference, and answering questions through mention replacement. By processing information systematically, the framework provides…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 8Confidence 3

Strengths

- A simple method to resolve ambiguity of mentions in a document so that an LLM can easily leverage the contextual information to generate a response. The use of a BERT-based model is a limitation in that it cannot handle longer contexts, but this work proposes a method to combine coreference resolution results for sub-documents using a simple statistics, i.e., whether a mention is within a cluster or not, to compute a distance metric. - Experiments show gains when compared with other methods c

Weaknesses

- It is not clear how the errors in coreference resolution might have an impact to the end performance since coreference resolution itself is a rather challenging task.

Reviewer 02Rating 6Confidence 4

Strengths

- The idea of performing coreference resolution before attempting to do the task at hand, makes sense and is corroborated by empirical results showing improvements across models and datasets. - The paper is well-written and the proposed method easy to follow. - The proposed coreference approach seems intuitive and could be also used during LLM pertaining or fine-tuning.

Weaknesses

- The technical contribution is somewhat limited. Coreference merging is interesting, however, without empirical evaluation it is not possible to tell whether it actually works (only down-stream evaluation results are presented, see my questions below). - What is the main contribution of this paper, is it the algorithm in Section 3.2? -

Reviewer 03Rating 6Confidence 3

Strengths

The paper addresses an important issue in natural language processing, which is the difficulty LLMs face in dealing with long contexts. The proposed LQCA method is a novel approach that combines coreference resolution with question answering in long contexts.

Weaknesses

While the paper demonstrates improvements in performance, it does not provide a comprehensive comparison with alternative methods that also aim to improve long-context understanding. The paper could benefit from a more detailed discussion on how the LQCA method handles cases with highly ambiguous references, which could be a common occurrence in real-world applications. There is a lack of analysis on the computational overhead introduced by the LQCA method, which is crucial for understanding i

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Context-Aware Activity Recognition Systems