ZiMM: a deep learning model for long term and blurry relapses with non-clinical claims data
Anastasiia Kabeshova, Yiyang Yu, Bertrand Lukacs, Emmanuel Bacry,, St\'ephane Ga\"iffas

TL;DR
This paper introduces ZiMM, a deep learning model designed to predict long-term, blurry relapses from non-clinical claims data, demonstrating improved accuracy over existing methods in a large population-based study.
Contribution
The paper presents ZiMM, a novel zero-inflated mixture model combined with an end-to-end deep learning architecture for modeling complex relapse patterns from claims data.
Findings
ZiMM ED outperforms baseline models in relapse prediction accuracy.
The approach effectively handles sparse, irregular, and heterogeneous health event data.
Minimal preprocessing is required for large-scale claims datasets.
Abstract
This paper considers the problems of modeling and predicting a long-term and ``blurry'' relapse that occurs after a medical act, such as a surgery. The relapse is observed only indirectly, in a ``blurry'' fashion, through longitudinal prescriptions of drugs over a long period of time after the medical act. We introduce a new model, called ZiMM (Zero-inflated Mixture of Multinomial distributions) in order to capture long-term and blurry relapses. On top of it, we build an end-to-end deep-learning architecture called ZiMM Encoder-Decoder (ZiMM ED) that can learn from the complex, irregular, highly heterogeneous and sparse patterns of health events that are observed through a claims-only database. ZiMM ED is applied on a ``non-clinical'' claims database, that contains only timestamped reimbursement codes for drug purchases, medical procedures and hospital diagnoses, the only available…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
