SAGEO Arena: A Realistic Environment for Evaluating Search-Augmented Generative Engine Optimization
Sunghwan Kim, Wooseok Jeong, Serin Kim, Sangam Lee, Dongha Lee

TL;DR
This paper introduces SAGEO Arena, a comprehensive environment for evaluating search-augmented generative engine optimization, addressing current gaps by incorporating realistic web document structures and full pipeline analysis.
Contribution
We develop SAGEO Arena, a realistic benchmark environment that enables stage-level analysis of SAGEO, integrating structural web data and full search pipelines for better evaluation.
Findings
Existing approaches are often impractical in realistic settings.
Structural information improves SAGEO effectiveness.
Tailoring optimization to each pipeline stage is essential.
Abstract
Search-Augmented Generative Engines (SAGE) have emerged as a new paradigm for information access, bridging web-scale retrieval with generative capabilities to deliver synthesized answers. This shift has fundamentally reshaped how web content gains exposure online, giving rise to Search-Augmented Generative Engine Optimization (SAGEO), the practice of optimizing web documents to improve their visibility in AI-generated responses. Despite growing interest, no evaluation environment currently supports comprehensive investigation of SAGEO. Specifically, existing benchmarks lack end-to-end visibility evaluation of optimization strategies, operating on pre-determined candidate documents that abstract away retrieval and reranking preceding generation. Moreover, existing benchmarks discard structural information (e.g., schema markup) present in real web documents, overlooking the rich signals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInformation Retrieval and Search Behavior · Web Data Mining and Analysis · Spreadsheets and End-User Computing
