Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules
Yueqi Zhang, Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Jiayi Shi, Chuyi Tan, Boyuan Pan, Yao Hu, Kan Li

TL;DR
This paper introduces a new framework and method for enabling quotation-aware dialogue in large language models, allowing them to locate and utilize quoted spans effectively without retraining the entire model.
Contribution
The paper formalizes span-conditioned generation, creates a benchmark for quotation-aware dialogue, and proposes QuAda, a lightweight plug-and-play module for improved quotation handling in LLMs.
Findings
QuAda improves quotation span utilization across various models.
The benchmark covers five diverse dialogue scenarios.
QuAda requires less than 2.8% of model weights to be updated.
Abstract
Human-AI conversation frequently relies on quoting earlier text-"check it with the formula I just highlighted"-yet today's large language models (LLMs) lack an explicit mechanism for locating and exploiting such spans. We formalise the challenge as span-conditioned generation, decomposing each turn into the dialogue history, a set of token-offset quotation spans, and an intent utterance. Building on this abstraction, we introduce a quotation-centric data pipeline that automatically synthesises task-specific dialogues, verifies answer correctness through multi-stage consistency checks, and yields both a heterogeneous training corpus and the first benchmark covering five representative scenarios. To meet the benchmark's zero-overhead and parameter-efficiency requirements, we propose QuAda, a lightweight training-based method that attaches two bottleneck projections to every attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsService-Oriented Architecture and Web Services · Semantic Web and Ontologies · Multi-Agent Systems and Negotiation
MethodsSparse Evolutionary Training
