TL;DR
This paper studies optimal strategies for purchasing noisy data samples over time to maximize cumulative payoff in dynamic decision-making scenarios, providing solutions and approximation guarantees for various cost structures.
Contribution
It introduces a model for sequential data purchase decisions, characterizes the optimal sampling policy, and offers approximation results when fixed costs are involved.
Findings
Batch sampling is shown to be optimal in the basic model.
A batching policy achieves a 2-approximation with fixed costs.
Optimal strategies depend on the cost structure and sampling flexibility.
Abstract
We consider a model where an agent has a repeated decision to make and wishes to maximize their total payoff. Payoffs are influenced by an action taken by the agent, but also an unknown state of the world that evolves over time. Before choosing an action each round, the agent can purchase noisy samples about the state of the world. The agent has a budget to spend on these samples, and has flexibility in deciding how to spread that budget across rounds. We investigate the problem of choosing a sampling algorithm that optimizes total expected payoff. For example: is it better to buy samples steadily over time, or to buy samples in batches? We solve for the optimal policy, and show that it is a natural instantiation of the latter. Under a more general model that includes per-round fixed costs, we prove that a variation on this batching policy is a 2-approximation.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Buying Data Over Time: Approximately Optimal Strategies for Dynamic Data-Driven Decisions· youtube
