Diagnosing and Repairing Citation Failures in Generative Engine Optimization
Zhihua Tian, Yuhan Chen, Yao Tang, Jian Liu, Ruoxi Jia

TL;DR
This paper presents a diagnostic framework for improving citation success in Generative Engine Optimization, addressing why documents are not cited and applying targeted repairs to enhance content visibility.
Contribution
It introduces a taxonomy of citation failure modes, an agentic system called AgentGEO for diagnosis and repair, and a benchmark for evaluating generalization across queries.
Findings
AgentGEO improves citation rates by over 40%
Modifies only 5% of content on average
Generic optimization can harm long-tail content
Abstract
Generative Engine Optimization (GEO) aims to improve content visibility in AI-generated responses. However, existing methods measure contribution-how much a document influences a response-rather than citation, the mechanism that actually drives traffic back to creators. Also, these methods apply generic rewriting rules uniformly, failing to diagnose why individual document are not cited. This paper introduces a diagnostic approach to GEO that asks why a document fails to be cited and intervenes accordingly. We develop a unified framework comprising: (1) the first taxonomy of citation failure modes spanning different stages of a citation pipeline; (2) AgentGEO, an agentic system that diagnoses failures using this taxonomy, selects targeted repairs from a corresponding tool library, and iterates until citation is achieved; and (3) a document-centric benchmark evaluating whether…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Digital Humanities and Scholarship · Data Visualization and Analytics
