Inference from Selectively Disclosed Data
Ying Gao

TL;DR
This paper models how a sender with large data can strategically disclose or withhold information to persuade a receiver, revealing optimal imitation strategies and their impact on welfare and disclosure outcomes.
Contribution
It introduces a model for selective data disclosure using imitation strategies and characterizes the resulting partial-pooling equilibria.
Findings
Selective disclosure benefits large data holders.
Voluntary disclosure can reduce sender welfare with limited data.
Imitation strategies maximize distinguishability of higher states.
Abstract
We consider the disclosure problem of a sender with a large data set of hard evidence who wants to persuade a receiver to take higher actions. Because the receiver will make inferences based on the distribution of the data they see, the sender has an incentive to drop observations to mimic the distributions that would be observed under better states. We predict which observations the sender discloses using a model that approximates large datasets with a continuum of data. It is optimal for the sender to play an imitation strategy, under which they submit evidence that imitates the natural distribution under some more desirable target state. We characterize the partial-pooling outcomes under these imitation strategies, and show that they are supported by data on the outcomes that maximally distinguish higher states. Relative to full information, the equilibrium with voluntary disclosure…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExperimental Behavioral Economics Studies · Game Theory and Applications · Auction Theory and Applications
