Collaborative prediction with expert advice
Paul Christiano

TL;DR
This paper introduces the first robust collaborative algorithm for prediction with expert advice, enabling multiple users to share experiences for faster learning while resisting manipulation by dishonest users.
Contribution
It presents a novel robust collaborative algorithm that maintains performance in the presence of adversarial users, extending traditional single-user prediction frameworks.
Findings
Algorithm matches pooling data performance with honest users
Performance remains stable even with many adversarial users
Highly robust to user variation and manipulation
Abstract
Many practical learning systems aggregate data across many users, while learning theory traditionally considers a single learner who trusts all of their observations. A case in point is the foundational learning problem of prediction with expert advice. To date, there has been no theoretical study of the general collaborative version of prediction with expert advice, in which many users face a similar problem and would like to share their experiences in order to learn faster. A key issue in this collaborative framework is robustness: generally algorithms that aggregate data are vulnerable to manipulation by even a small number of dishonest users. We exhibit the first robust collaborative algorithm for prediction with expert advice. When all users are honest and have similar tastes our algorithm matches the performance of pooling data and using a traditional algorithm. But our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Data Stream Mining Techniques · Stochastic Gradient Optimization Techniques
