QUILL: Quotation Generation Enhancement of Large Language Models
Jin Xiao, Bowei Zhang, Qianyu He, Jiaqing Liang, Feng Wei, Jinglei, Chen, Zujie Liang, Deqing Yang, Yanghua Xiao

TL;DR
This paper introduces QUILL, a system that enhances large language models' ability to generate accurate and relevant quotations by establishing a new evaluation framework, creating a comprehensive bilingual quote database, and developing a reranking metric.
Contribution
The paper presents a novel evaluation system, a large bilingual quote knowledge base, and a quotation-specific reranking metric to improve LLMs' quotation generation performance.
Findings
Metrics strongly correlate with human preferences.
The knowledge base and reranking improve quote relevance.
Existing LLMs struggle with quote generation, but our methods help.
Abstract
While Large language models (LLMs) have become excellent writing assistants, they still struggle with quotation generation. This is because they either hallucinate when providing factual quotations or fail to provide quotes that exceed human expectations. To bridge the gap, we systematically study how to evaluate and improve LLMs' performance in quotation generation tasks. We first establish a holistic and automatic evaluation system for quotation generation task, which consists of five criteria each with corresponding automatic metric. To improve the LLMs' quotation generation abilities, we construct a bilingual knowledge base that is broad in scope and rich in dimensions, containing up to 32,022 quotes. Moreover, guided by our critiria, we further design a quotation-specific metric to rerank the retrieved quotations from the knowledge base. Extensive experiments show that our metrics…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques
MethodsBalanced Selection
