Vector RAG vs LLM-Compiled Wiki: A Preregistered Comparison on a Small Multi-Domain Research
Theodore O. Cochran

TL;DR
This study compares Vector RAG and LLM-compiled wiki systems for answering research questions, revealing trade-offs in evidence organization, citation support, and operational costs.
Contribution
It provides a preregistered, empirical comparison of two research synthesis methods, highlighting their distinct strengths and limitations.
Findings
Wiki scored better at connecting findings across papers.
RAG was better at single-fact lookup questions.
Decomposition-based RAG improved cross-paper synthesis at lower token cost.
Abstract
We preregistered a comparison of two ways to help an LLM answer questions over a small research corpus: a single-round Vector RAG system and an LLM-compiled markdown wiki. Both systems answered the same 13 questions over 24 papers using the same answer-generating model, and their answers were scored by blinded LLM judges. The wiki scored much better at connecting findings across papers, but its advantage in answer organization was not strong after judge adjustment. RAG met the preregistered test for single-fact lookup questions. The clean query-side cost result went against the expected wiki advantage: under the tested setup, the wiki used far more query tokens than RAG, so it could not recover any upfront build cost through cheaper queries. Two exploratory analyses changed how we interpret the result. First, claim-level citation checking favored the wiki: its cited pages more often…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
