Observed and estimated prevalence of Covid-19 in Italy: Is it possible to estimate the total cases from medical swabs data?
Francesca Bassi (Department of Statistical Sciences, University of, Padova, Italy), Giuseppe Arbia (Department of Statistical Sciences, Catholic, University of the Sacred Hearth, Milano, Italy), Pietro Demetrio Falorsi, (Italian National Statistical Institute)

TL;DR
This paper proposes a method to estimate the true prevalence of Covid-19 in Italy by reweighting official testing data to account for sampling biases, especially the underrepresentation of asymptomatic cases.
Contribution
It introduces a reweighting approach using post-stratification by age and gender to improve prevalence and lethality estimates from official Covid-19 data.
Findings
More accurate prevalence estimates achieved
Adjusted lethality rate calculations provided
Method enhances representativeness of sample data
Abstract
During the current Covid-19 pandemic in Italy, official data are collected with medical swabs following a pure convenience criterion which, at least in an early phase, has privileged the exam of patients showing evident symptoms. However, there are evidences of a very high proportion of asymptomatic patients (e. g. Aguilar et al., 2020; Chugthai et al, 2020; Li, et al., 2020; Mizumoto et al., 2020a, 2020b and Yelin et al., 2020). In this situation, in order to estimate the real number of infected (and to estimate the lethality rate), it should be necessary to run a properly designed sample survey through which it would be possible to calculate the probability of inclusion and hence draw sound probabilistic inference. Some researchers proposed estimates of the total prevalence based on various approaches, including epidemiologic models, time series and the analysis of data collected in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 epidemiological studies · Data-Driven Disease Surveillance · COVID-19 diagnosis using AI
