Rethinking the handling of method failure in comparison studies
Milena W\"unsch, Moritz Herrmann, Elisa Noltenius, Mattia Mohr, Tim P. Morris, Anne-Laure Boulesteix

TL;DR
This paper critically examines how method failures are handled in comparison studies, highlighting common pitfalls and proposing practical, realistic strategies to improve the reliability and interpretability of such research.
Contribution
It offers a comprehensive analysis of existing handling approaches and introduces new guidelines and fallback strategies for managing method failures in comparison studies.
Findings
Discarding failure data sets is often inappropriate.
Imputation methods can be misleading in this context.
Proposed fallback strategies better reflect real-world scenarios.
Abstract
Comparison studies in methodological research are intended to compare methods in an evidence-based manner to help data analysts select a suitable method for their application. To provide trustworthy evidence, they must be carefully designed, implemented, and reported, especially given the many decisions made in planning and running. A common challenge in comparison studies is to handle the "failure" of one or more methods to produce a result for some (real or simulated) data sets, such that their performances cannot be measured in those instances. Despite an increasing emphasis on this topic in recent literature (focusing on non-convergence as a common manifestation), there is little guidance on proper handling and interpretation, and reporting of the chosen approach is often neglected. This paper aims to fill this gap and offers practical guidance on handling method failure in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Methods in Clinical Trials · Pesticide Residue Analysis and Safety
