Design of Experiments with Imputable Feature Data: An Entropy-Based Approach
Raj K. Velicheti, Amber Srivastava, Srinivasa M. Salapaka

TL;DR
This paper introduces a novel entropy-based framework that jointly optimizes experiment design and missing data imputation, significantly improving efficiency over traditional sequential methods especially in complex, data-scarce scenarios.
Contribution
It proposes a unified maximum-entropy approach that simultaneously addresses experiment selection and data imputation, overcoming limitations of sequential methods and NP-hard complexity.
Findings
Achieves over 60% improvement in cost value compared to benchmarks.
Effectively handles missing data in experiment design scenarios.
Flexible framework adaptable to various application constraints.
Abstract
Tactical selection of experiments to estimate an underlying model is an innate task across various fields. Since each experiment has costs associated with it, selecting statistically significant experiments becomes necessary. Classic linear experimental design deals with experiment selection so as to minimize (functions of) variance in estimation of regression parameter. Typically, standard algorithms for solving this problem assume that data associated with each experiment is fully known. This isn't often true since missing data is a common problem. For instance, remote sensors often miss data due to poor connection. Hence experiment selection under such scenarios is a widespread but challenging task. Though decoupling the tasks and using standard data imputation methods like matrix completion followed by experiment selection might seem a way forward, they perform sub-optimally since…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGaussian Processes and Bayesian Inference · Machine Learning and Data Classification · Machine Learning and Algorithms
