C$^2$-Cite: Contextual-Aware Citation Generation for Attributed Large Language Models
Yue Yu, Ting Bai, HengZhi Lan, Li Qian, Li Peng, Jie Wu, Wei Liu, Jian Luan, Chuan Shi

TL;DR
C$^2$-Cite is a framework that improves attribution in large language models by explicitly aligning citation markers with their referenced content, enhancing the integration and reliability of generated citations.
Contribution
The paper introduces C$^2$-Cite, a novel contextual-aware citation generation method that explicitly encodes and aligns citation markers with source content, addressing limitations of existing attribution techniques.
Findings
Outperforms SOTA by 5.8% in citation quality
Achieves 17.4% improvement in response correctness
Validated on ALCE benchmark across three datasets
Abstract
The attribution technique enhances the credibility of LLMs by adding citations to the generated sentences, enabling users to trace back to the original sources and verify the reliability of the output. However, existing instruction-tuned attributed LLMs often fail to properly interpret the contextual semantics of citation symbols (e.g., [i]) during text generation. This shortcoming arises from their insufficient awareness of the context information surrounding citation markers, which in turn leads to disjointed references and poor integration of retrieved knowledge into the generated content. To address this issue, we propose a novel \textbf{C}ontextual-aware \textbf{C}itation generation framework (\textbf{C}-\textbf{Cite}) that explicitly integrates the semantic relationships between citation markers and their referenced content. Specifically, a contextual citation alignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
