SARS-Cov-2 RNA Sequence Classification Based on Territory Information
Jingwei Liu

TL;DR
This study develops a sequence SVM framework to classify SARS-CoV-2 RNA sequences based on geographic origin, achieving high accuracy for regional classification and demonstrating the potential for territory-based virus analysis.
Contribution
It introduces a novel sequence SVM model that projects RNA sequences into different dimensional spaces for territory-based classification of SARS-CoV-2.
Findings
3-class China classification accuracy: 82.45%
2-class China classification accuracy: 97.35%
worldwide 6-class classification accuracy: 30.30%
Abstract
CovID-19 genetics analysis is critical to determine virus type,virus variant and evaluate vaccines. In this paper, SARS-Cov-2 RNA sequence analysis relative to region or territory is investigated. A uniform framework of sequence SVM model with various genetics length from short to long and mixed-bases is developed by projecting SARS-Cov-2 RNA sequence to different dimensional space, then scoring it according to the output probability of pre-trained SVM models to explore the territory or origin information of SARS-Cov-2. Different sample size ratio of training set and test set is also discussed in the data analysis. Two SARS-Cov-2 RNA classification tasks are constructed based on GISAID database, one is for mainland, Hongkong and Taiwan of China, and the other is a 6-class classification task (Africa, Asia, Europe, North American, South American\& Central American, Ocean) of 7…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Image Retrieval and Classification Techniques · QR Code Applications and Technologies
Methods7 Fastest Ways to Call American Airlines Reservations Number (USA Guide) · Support Vector Machine
