State-of-the-art in selection of variables and functional forms in multivariable analysis -- outstanding issues
Willi Sauerbrei, Aris Perperoglou, Matthias Schmid, Michal, Abrahamowicz, Heiko Becher, Harald Binder, Daniela Dunkler, Frank E. Harrell, Jr, Patrick Royston, Georg Heinze (for TG2 of the STRATOS initiative)

TL;DR
This paper reviews current methods for selecting variables and functional forms in multivariable analysis, highlighting gaps in evidence and proposing directions for future research to improve modeling practices.
Contribution
It provides a comprehensive overview of existing approaches, identifies key gaps in knowledge, and suggests priorities for future comparative studies in variable and functional form selection.
Findings
Lack of sufficient evidence to recommend specific methods
Identification of seven key topics needing further research
Illustration of modeling issues through medical literature examples
Abstract
How to select variables and identify functional forms for continuous variables is a key concern when creating a multivariable model. Ad hoc 'traditional' approaches to variable selection have been in use for at least 50 years. Similarly, methods for determining functional forms for continuous variables were first suggested many years ago. More recently, many alternative approaches to address these two challenges have been proposed, but knowledge of their properties and meaningful comparisons between them are scarce. To define a state-of-the-art and to provide evidence-supported guidance to researchers who have only a basic level of statistical knowledge many outstanding issues in multivariable modelling remain. Our main aims are to identify and illustrate such gaps in the literature and present them at a moderate technical level to the wide community of practitioners, researchers and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods and Inference · Advanced Statistical Methods and Models · Statistical Methods and Bayesian Inference
