METRIK: Measurement-Efficient Randomized Controlled Trials using Transformers with Input Masking
Sayeri Lala (1), Niraj K. Jha (1) ((1) Princeton University,, Princeton, USA)

TL;DR
METRIK introduces a novel Transformer-based framework to optimize planned missing data designs in clinical RCTs, reducing data collection costs while maintaining or improving imputation accuracy and sampling efficiency.
Contribution
It presents the first method to compute a task-specific PMD from limited prior data using a learnable masking layer optimized with a Transformer imputer.
Findings
Increases sampling efficiency in real-world RCT datasets.
Enhances imputation performance over standard methods.
Reduces need for manual metric removal in trials.
Abstract
Clinical randomized controlled trials (RCTs) collect hundreds of measurements spanning various metric types (e.g., laboratory tests, cognitive/motor assessments, etc.) across 100s-1000s of subjects to evaluate the effect of a treatment, but do so at the cost of significant trial expense. To reduce the number of measurements, trial protocols can be revised to remove metrics extraneous to the study's objective, but doing so requires additional human labor and limits the set of hypotheses that can be studied with the collected data. In contrast, a planned missing design (PMD) can reduce the amount of data collected without removing any metric by imputing the unsampled data. Standard PMDs randomly sample data to leverage statistical properties of imputation algorithms, but are ad hoc, hence suboptimal. Methods that learn PMDs produce more sample-efficient PMDs, but are not suitable for RCTs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInnovative Microfluidic and Catalytic Techniques Innovation · Cell Image Analysis Techniques · 3D Printing in Biomedical Research
MethodsAttention Is All You Need · Sparse Evolutionary Training · Softmax · Layer Normalization · Byte Pair Encoding · Label Smoothing · Position-Wise Feed-Forward Layer · Dropout · Adam · Linear Layer
