LongCite: Enabling LLMs to Generate Fine-grained Citations in Long-context QA
Jiajie Zhang, Yushi Bai, Xin Lv, Wanjun Gu, Danqing Liu, Minhao Zou,, Shulin Cao, Lei Hou, Yuxiao Dong, Ling Feng, Juanzi Li

TL;DR
This paper introduces LongCite, a method for training large language models to generate responses with precise, sentence-level citations in long-context question answering, enhancing trustworthiness and verifiability.
Contribution
The paper presents a novel pipeline and dataset for training LLMs to produce fine-grained citations, significantly improving citation accuracy in long-context QA.
Findings
Models achieve state-of-the-art citation quality.
LongCite models outperform GPT-4o in citation accuracy.
The pipeline enables automatic generation of high-quality citation datasets.
Abstract
Though current long-context large language models (LLMs) have demonstrated impressive capacities in answering user questions based on extensive text, the lack of citations in their responses makes user verification difficult, leading to concerns about their trustworthiness due to their potential hallucinations. In this work, we aim to enable long-context LLMs to generate responses with fine-grained sentence-level citations, improving their faithfulness and verifiability. We first introduce LongBench-Cite, an automated benchmark for assessing current LLMs' performance in Long-Context Question Answering with Citations (LQAC), revealing considerable room for improvement. To this end, we propose CoF (Coarse to Fine), a novel pipeline that utilizes off-the-shelf LLMs to automatically generate long-context QA instances with precise sentence-level citations, and leverage this pipeline to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗ibm-granite/granite-3.2-8b-lora-rag-citation-generationmodel· 11 dl· ♡ 411 dl♡ 4
- 🤗zai-org/LongCite-glm4-9bmodel· 108 dl· ♡ 33108 dl♡ 33
- 🤗zai-org/LongCite-llama3.1-8bmodel· 353 dl· ♡ 30353 dl♡ 30
- 🤗voidism/SelfCite-8Bmodel· 34 dl· ♡ 234 dl♡ 2
- 🤗voidism/SelfCite-8B-from-CCmodel· 38 dl· ♡ 238 dl♡ 2
- 🤗MuXodious/LongCite-llama3.1-8b-absolute-heresymodel
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Topic Modeling · Mathematics, Computing, and Information Processing
MethodsShrink and Fine-Tune
