A Closed-Form EVSI Expression for a Multinomial Data-Generating Process
Adam Fleischhacker, Pak-Wing Fok, Mokshay Madiman, and Nan Wu

TL;DR
This paper provides a closed-form analytical expression for EVSI in multinomial data scenarios, enabling efficient valuation of data without prior distribution knowledge or extensive simulations.
Contribution
It introduces a novel closed-form EVSI expression for multinomial data with Dirichlet priors, generalizing beta-Binomial results and allowing pre-data computation.
Findings
Derives a tractable EVSI formula for multinomial data.
Generalizes beta-Binomial computations to multinomial cases.
Enables EVSI calculation without prior distribution knowledge.
Abstract
This paper derives analytic expressions for the expected value of sample information (EVSI), the expected value of distribution information (EVDI), and the optimal sample size when data consists of independent draws from a bounded sequence of integers. Due to challenges of creating tractable EVSI expressions, most existing work valuing data does so in one of three ways: 1) analytically through closed-form expressions on the upper bound of the value of data, 2) calculating the expected value of data using numerical comparisons of decisions made using simulated data to optimal decisions where the underlying data distribution is known, or 3) using variance reduction as proxy for the uncertainty reduction that accompanies more data. For the very flexible case of modelling integer-valued observations using a multinomial data-generating process with Dirichlet prior, this paper develops…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
