Data Fusion with Distributional Equivalence Test-then-pool

Linying Yang; Xing Liu; Robin J. Evans

arXiv:2603.11867·stat.ME·March 17, 2026

Data Fusion with Distributional Equivalence Test-then-pool

Linying Yang, Xing Liu, Robin J. Evans

PDF

Open Access

TL;DR

This paper introduces a new test-then-pool framework for data fusion in clinical trials that uses kernel two-sample testing and equivalence testing to control Type-I error and improve power when combining historical and current control data.

Contribution

It develops a novel TTP method employing MMD and equivalence testing with bootstrap and permutation procedures, ensuring valid inference and higher power in data fusion.

Findings

01

Achieves higher power than standard TTP methods.

02

Maintains nominal Type-I error rate.

03

Provides a flexible criterion for pooling controls.

Abstract

Randomized controlled trials (RCTs) are the gold standard for causal inference, yet practical constraints often limit the size of the concurrent control arm. Borrowing control data from previous trials offers a potential efficiency gain, but naive borrowing can induce bias when historical and current populations differ. Existing test-then-pool (TTP) procedures address this concern by testing for equality of control outcomes between historical and concurrent trials before borrowing; however, standard implementations may suffer from reduced power or inadequate control of the Type-I error rate. We develop a new TTP framework that fuses control arms while rigorously controlling the Type-I error rate of the final treatment effect test. Our method employs kernel two-sample testing via maximum mean discrepancy (MMD) to capture distributional differences, and equivalence testing to avoid…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Statistical Methods in Clinical Trials · Bayesian Modeling and Causal Inference