Test-time Collective Prediction
Celestine Mendler-D\"unner, Wenshuo Guo, Stephen Bates, Michael I., Jordan

TL;DR
This paper introduces a decentralized test-time prediction method for multiple agents with private data, inspired by social consensus, which improves accuracy over traditional averaging techniques without data sharing.
Contribution
It proposes a novel decentralized mechanism for collective predictions that converges to inverse MSE weighting and includes a Jackknife-based error estimation method.
Findings
Outperforms classical model averaging in accuracy.
Effectively combines models with varying quality.
Achieves better results than weighted averaging with validation data.
Abstract
An increasingly common setting in machine learning involves multiple parties, each with their own data, who want to jointly make predictions on future test points. Agents wish to benefit from the collective expertise of the full set of agents to make better predictions than they would individually, but may not be willing to release their data or model parameters. In this work, we explore a decentralized mechanism to make collective predictions at test time, leveraging each agent's pre-trained model without relying on external validation, model retraining, or data pooling. Our approach takes inspiration from the literature in social science on human consensus-making. We analyze our mechanism theoretically, showing that it converges to inverse meansquared-error (MSE) weighting in the large-sample limit. To compute error bars on the collective predictions we propose a decentralized…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsOpinion Dynamics and Social Influence · Mobile Crowdsensing and Crowdsourcing · Complex Network Analysis Techniques
