A comparative analysis of several multivariate zero-inflated and zero-modified models with applications in insurance
Pengcheng Zhang, David Pitt, Xueyuan Wu

TL;DR
This paper compares multivariate zero-inflated and zero-modified models for insurance claim data, addressing excess and absence of zeros, and demonstrates their application with real insurance datasets.
Contribution
It introduces and compares two approaches for modeling multivariate claim data with zero-inflation and zero-deflation features, including a novel zero-modified model.
Findings
Multivariate zero-inflated models effectively handle excess zeros in insurance data.
Zero-modified models are suitable for data with missing common zeros due to incomplete records.
The proposed models perform well on real insurance datasets from Spain.
Abstract
Claim frequency data in insurance records the number of claims on insurance policies during a finite period of time. Given that insurance companies operate with multiple lines of insurance business where the claim frequencies on different lines of business are often correlated, multivariate count modeling with dependence for claim frequency is therefore essential. Due in part to the operation of bonus-malus systems, claims data in automobile insurance are often characterized by an excess of common zeros. This feature is referred to as multivariate zero-inflation. In this paper, we establish two ways of dealing with this feature. The first is to use a multivariate zero-inflated model, where we artificially augment the probability of common zeros based on standard multivariate count distributions. The other is to apply a multivariate zero-modified model, which deals with the common zeros…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Statistical Distribution Estimation and Applications · Statistical Methods and Bayesian Inference
