Cooperate to Compete: Strategic Data Generation and Incentivization Framework for Coopetitive Cross-Silo Federated Learning
Thanh Linh Nguyen, Nguyen Van Huynh, and Quoc-Viet Pham

TL;DR
This paper introduces CoCoGen+, a strategic framework for data generation and incentives in coopetitive cross-silo federated learning, balancing collaboration benefits with competitive risks.
Contribution
It models data generation as a strategic game considering non-IID data, competition, and costs, and proposes an incentive mechanism to promote collaboration.
Findings
CoCoGen+ effectively balances data generation costs and competitive utility losses.
The framework improves social welfare and learning efficiency over baselines.
Non-IID data and competition intensity significantly influence organizational strategies.
Abstract
In data-sensitive domains such as healthcare, cross-silo federated learning (CFL) allows organizations to collaboratively train AI models without sharing raw data. However, practical CFL deployments are inherently coopetitive, in which organizations cooperate during model training while competing in downstream markets. In such settings, training contributions, including data volume, quality, and diversity, can improve the global model yet inadvertently strengthen rivals. This dilemma is amplified by non-IID data, which leads to asymmetric learning gains and undermines sustained participation. While existing competition-aware CFL and incentive-design approaches reward organizations based on marginal training contributions, they fail to account for the costs of strengthening competitors. In this paper, we introduce CoCoGen+, a coopetition-compatible data generation and incentivization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
