Overstuffed sandwiches and separation anxiety: finite-sample variance estimation for penalized GEE with near-separated binary data

Awan Afiaz; M. Shafiqur Rahman

arXiv:2604.18863·stat.ME·April 22, 2026

Overstuffed sandwiches and separation anxiety: finite-sample variance estimation for penalized GEE with near-separated binary data

Awan Afiaz, M. Shafiqur Rahman

PDF

TL;DR

This paper develops a new finite-sample variance estimator for penalized GEE in near-separated binary data, improving inference accuracy in small samples.

Contribution

It introduces a novel variance correction method, $ ilde{V}_{AR}$, that accounts for finite-sample bias and leverage effects, outperforming existing corrections.

Findings

01

$ ilde{V}_{AR}$ provides conservative or near-nominal error control in small samples.

02

Standard corrections often overadjust or are anti-conservative in low-event, small-$N$ settings.

03

The proposed method is effective even with $N=10$ and unbalanced designs.

Abstract

Penalized generalized estimating equations (PGEE) stabilize point estimation for longitudinal binary data under near-separation, but inference still depends on how the sandwich variance is corrected. Existing corrections for PGEE can overadjust in high-leverage directions, require restrictive pooling assumptions, or add global regularization without explaining the bias. We establish first-order asymptotics for PGEE along convergent interior-root sequences and derive a matrix characterization of the parameter-specific overcorrection induced by full leverage adjustment. Finite-sample calibration is limited by both mean bias and the variability of leverage-corrected variance estimates. We propose $\hat{V}_{A R}$ , which keeps the score-level leverage correction and adds a finite-sample upward translation dominated at first order by the finite-population factor, with a smaller centering term.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.