A-Optimal Split Questionnaire Designs for Multivariate Continuous Variables
Dae-Gyu Jang, Zhengyuan Zhu, and Cindy Yu

TL;DR
This paper introduces a methodology for designing optimal split questionnaire designs for multivariate continuous variables, leveraging survey data and optimality criteria to minimize information loss and improve survey efficiency.
Contribution
It develops a probabilistic and optimality-based approach to find optimal split questionnaire designs, including theoretical insights and practical algorithms.
Findings
Local and global OSQDs outperform baseline designs in simulations.
The method effectively utilizes prior survey data to optimize questionnaire splits.
Theoretical analysis links correlation structures to optimal designs.
Abstract
A split questionnaire design (SQD), an alternative to full questionnaires, can reduce the response burden and improve survey quality. One can design a split questionnaire to reduce the information loss from missing data induced by the split questionnaire. This study develops a methodology for finding optimal SQD (OSQD) for multivariate continuous variables, applying a probabilistic design and optimality criterion approach. Our method employs previous survey data to compute the Fisher information matrix and A-optimality criterion to find OSQD for the current survey study. We derive theoretical findings on the relationship between the correlation structure and OSQD and the robustness of local OSQD. We conduct simulation studies to compare local and two global OSQDs; mini-max OSQD and Bayes OSQD) to baselines. We also apply our method to the 2016 Pet Demographic Survey (PDS) data. In both…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurvey Sampling and Estimation Techniques · Survey Methodology and Nonresponse · Statistical Methods and Bayesian Inference
