The Data Hydration Gap: A Formal Model of Underinvestment in General-Purpose Data Products Under Decentralized Governance
Gaston Besanson

TL;DR
This paper models how decentralized data ownership leads to underinvestment in cross-domain data products, causing organizational inefficiencies and a potential 'data mesh trap' where reusable data layers fail to develop.
Contribution
It introduces a formal game-theoretic model of decentralized data governance, analyzing the conditions leading to underinvestment and organizational welfare losses.
Findings
Generalized externalities increase with more domains and analytics value.
Under certain parameters, no reusable data layer emerges, trapping organizations.
Organizational welfare can be significantly reduced due to underinvestment.
Abstract
When organizations decentralize data product ownership, as in the data mesh paradigm, each domain team optimizes for its immediate analytical needs, underinvesting in the cross-domain generality that enables organization-wide reuse. We formalize this as a simultaneous-move game in which N domains choose quality (q) and generality (g). Generality creates positive externalities but is privately costly. The Nash equilibrium generality gap is increasing in the number of domains and the value of cross-domain analytics. Under plausible parameter configurations, a corner solution obtains in which no reusable silver layer emerges organically, a condition we term the data mesh trap. Technical debt from narrow products grows quadratically in N. An illustrative calibration suggests non-trivial organizational welfare losses under plausible enterprise parameters. We derive within-model conditions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
