Can Self Reported Symptoms Predict Daily COVID-19 Cases?
Parth Patwa, Viswanatha Reddy, Rohan Sukumaran, Sethuraman TV, and Eptehal Nashnoush, Sheshank Shankar, Rishemjit Kaur, Abhishek, Singh, Ramesh Raskar

TL;DR
This study develops machine learning models using self-reported symptoms from online surveys to estimate daily COVID-19 cases at state levels, showing potential for cost-effective epidemiological monitoring.
Contribution
It introduces localized and global machine learning models that predict COVID-19 cases from symptom data, highlighting the importance of state-specific symptom features.
Findings
Local models outperform global models in accuracy.
Self-reported symptoms can predict daily cases with a mean absolute error of 226.30.
Important symptoms vary significantly across states.
Abstract
The COVID-19 pandemic has impacted lives and economies across the globe, leading to many deaths. While vaccination is an important intervention, its roll-out is slow and unequal across the globe. Therefore, extensive testing still remains one of the key methods to monitor and contain the virus. Testing on a large scale is expensive and arduous. Hence, we need alternate methods to estimate the number of cases. Online surveys have been shown to be an effective method for data collection amidst the pandemic. In this work, we develop machine learning models to estimate the prevalence of COVID-19 using self-reported symptoms. Our best model predicts the daily cases with a mean absolute error (MAE) of 226.30 (normalized MAE of 27.09%) per state, which demonstrates the possibility of predicting the actual number of confirmed cases by utilizing self-reported symptoms. The models are developed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Data-Driven Disease Surveillance
