Unsupervised machine learning framework for discriminating major variants of concern during COVID-19
Rohitash Chandra, Chaarvi Bansal, Mingyue Kang, Tom Blau, Vinti, Agarwal, Pranjal Singh, Laurence O. W. Wilson, Seshadri Vasan

TL;DR
This paper introduces an unsupervised machine learning framework that effectively discriminates and visualizes major COVID-19 variants based on genome sequences, aiding in understanding their mutational differences and potential emergence.
Contribution
The paper presents a novel combination of dimensionality reduction and clustering techniques for analyzing COVID-19 variants without labeled data.
Findings
Framework successfully distinguishes major variants
Effective visualization of mutational differences
Potential to identify emerging variants
Abstract
Due to the high mutation rate of the virus, the COVID-19 pandemic evolved rapidly. Certain variants of the virus, such as Delta and Omicron, emerged with altered viral properties leading to severe transmission and death rates. These variants burdened the medical systems worldwide with a major impact to travel, productivity, and the world economy. Unsupervised machine learning methods have the ability to compress, characterize, and visualize unlabelled data. This paper presents a framework that utilizes unsupervised machine learning methods to discriminate and visualize the associations between major COVID-19 variants based on their genome sequences. These methods comprise a combination of selected dimensionality reduction and clustering techniques. The framework processes the RNA sequences by performing a k-mer analysis on the data and further visualises and compares the results using…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicsvaccines and immunoinformatics approaches · COVID-19 diagnosis using AI · Machine Learning in Bioinformatics
