Efficient Analysis of COVID-19 Clinical Data using Machine Learning Models
Sarwan Ali, Yijing Zhou, Murray Patterson

TL;DR
This paper presents a machine learning approach for analyzing COVID-19 clinical data, using efficient feature selection and encoding to achieve over 90% prediction accuracy, aiding real-time decision-making.
Contribution
It introduces a simple encoding and feature selection method for COVID-19 clinical data, improving prediction accuracy and attribute importance analysis.
Findings
Achieved over 90% prediction accuracy in most cases
Identified key attributes influencing patient outcomes
Demonstrated efficient data encoding and feature selection methods
Abstract
Because of the rapid spread of COVID-19 to almost every part of the globe, huge volumes of data and case studies have been made available, providing researchers with a unique opportunity to find trends and make discoveries like never before, by leveraging such big data. This data is of many different varieties, and can be of different levels of veracity e.g., precise, imprecise, uncertain, and missing, making it challenging to extract important information from such data. Yet, efficient analyses of this continuously growing and evolving COVID-19 data is crucial to inform -- often in real-time -- the relevant measures needed for controlling, mitigating, and ultimately avoiding viral spread. Applying machine learning based algorithms to this big data is a natural approach to take to this aim, since they can quickly scale to such data, and extract the relevant information in the presence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Artificial Intelligence in Healthcare
MethodsFeature Selection
