TL;DR
MaxShapley is an efficient algorithm for fair credit attribution in generative search engines, enabling sustainable compensation for content providers with reduced computational costs.
Contribution
It introduces MaxShapley, a polynomial-time algorithm for fair attribution in generative search, improving efficiency over traditional Shapley value calculations.
Findings
MaxShapley achieves comparable attribution quality to exact Shapley values.
It reduces resource consumption by up to 9x compared to prior methods.
Evaluations on three multi-hop QA datasets demonstrate its effectiveness.
Abstract
Generative search engines based on large language models (LLMs) are replacing traditional search, fundamentally changing how information providers are compensated. To sustain this ecosystem, we need fair mechanisms to attribute and compensate content providers based on their contributions to generated answers. We introduce MaxShapley, an efficient algorithm for fair credit attribution in generative search pipelines that retrieve external sources before generation. MaxShapley is a special case of the celebrated Shapley value; it leverages a de-composable max-sum utility function to compute attributions with polynomial-time computation in the number of documents, as opposed to the exponential cost of Shapley values. We evaluate MaxShapley on three multi-hop QA datasets (HotPotQA, MuSiQUE, MS MARCO); MaxShapley achieves comparable attribution quality to exact Shapley computation, while…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Information Retrieval and Search Behavior · Expert finding and Q&A systems
