Statistical Collusion by Collectives on Learning Platforms

Etienne Gauthier; Francis Bach; Michael I. Jordan

arXiv:2502.04879·stat.ML·May 27, 2025

Statistical Collusion by Collectives on Learning Platforms

Etienne Gauthier, Francis Bach, Michael I. Jordan

PDF

Open Access 2 Repos

TL;DR

This paper develops a theoretical and algorithmic framework to understand how collectives can coordinate data manipulation to influence learning platforms, highlighting the risks and strategies involved.

Contribution

It introduces a novel framework for analyzing collective data manipulation, including assessments and coordination algorithms, with experimental validation in product evaluation.

Findings

01

Collectives can strategically influence learning platforms through coordinated data submission.

02

The framework enables assessment of impact and development of implementable coordination algorithms.

03

Experimental results demonstrate the effectiveness of the proposed methods.

Abstract

As platforms increasingly rely on learning algorithms, collectives may form and seek ways to influence these platforms to align with their own interests. This can be achieved by coordinated submission of altered data. To evaluate the potential impact of such behavior, it is essential to understand the computations that collectives must perform to impact platforms in this way. In particular, collectives need to make a priori assessments of the effect of the collective before taking action, as they may face potential risks when modifying their data. Moreover they need to develop implementable coordination algorithms based on quantities that can be inferred from observed data. We develop a framework that provides a theoretical and algorithmic treatment of these issues and present experimental results in a product evaluation domain.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical and Computational Modeling

MethodsALIGN