# SARS-CoV-2-induced phosphorylation and its pharmacotherapy backed by artificial intelligence and machine learning

**Authors:** Fouzia Qamar, Zubair Sharif, Jawaria Idrees, Asif Wasim, Sana Haider, Saad Salman

PMC · DOI: 10.2144/fsoa-2023-0112 · Future Science OA · 2024-05-15

## TL;DR

This study uses AI and machine learning to explore how SARS-CoV-2 manipulates phosphorylation processes and identifies potential drugs to target harmful viral mutations.

## Contribution

The paper introduces a novel integration of data mining and machine learning tools to identify SARS-CoV-2 phosphorylation targets and drug candidates.

## Key findings

- Eight missense variants were identified in SARS-CoV-2 viral sequences, with four predicted as potentially damaging.
- Promising therapeutic candidates like gilteritinib and sorafenib were identified to target phosphorylation pathways.
- Multiple predictive tools were used to enhance the robustness of variant and drug predictions.

## Abstract

Aims: To investigate the role of phosphorylation in SARS-CoV-2 infection, potential therapeutic targets and its harmful genetic sequences. Materials & Methods: Data mining techniques were employed to identify upregulated kinases responsible for proteomic changes induced by SARS-CoV-2. Spike and nucleocapsid proteins' sequences were analyzed using predictive tools, including SNAP2, MutPred2, PhD-SNP, SNPs&Go, MetaSNP, Predict-SNP and PolyPhen-2. Missense variants were identified using ensemble-based algorithms and homology/structure-based models like SIFT, PROVEAN, Predict-SNP and MutPred-2. Results: Eight missense variants were identified in viral sequences. Four damaging variants were found, with SNPs&Go and PolyPhen-2. Promising therapeutic candidates, including gilteritinib, pictilisib, sorafenib, RO5126766 and omipalisib, were identified. Conclusion: This research offers insights into SARS-CoV-2 pathogenicity, highlighting potential treatments and harmful variants in viral proteins.

This study explores the process called phosphorylation, which involves adding phosphate groups to certain proteins, influences the way the SARS-CoV-2 virus causes disease. The virus manipulates host enzymes to help it spread and survive. Researchers used data analysis techniques to identify the proteins that play a role in this process, aiming to find potential targets for treatments. They analyzed genetic sequences of key virus proteins and used various tools to predict harmful mutations. The study found several promising compounds that could be used to target the virus. Further research and experiments are needed to confirm their effectiveness as COVID-19 treatments.

This research explored the process called phosphorylation, which involves adding certain molecules to proteins, affects how the SARS-CoV-2 virus makes people sick. The virus uses our own cell's machinery to help it spread. Researchers used computer analysis to find out which proteins are involved in this process, hoping to find new ways to treat COVID-19. They studied the genetic code of important parts of the virus and used computer programs to predict if there were harmful changes in the code. They found some potential medicines that could be used to fight the virus and reduce its harm, but more research and testing are needed to be sure.

Phosphorylation, involving phosphate group addition to viral and host proteins, influences SARS-CoV-2 pathogenicity.

Study uses data mining to identify upregulated kinases responsible for proteomic changes induced by SARS-CoV-2.

Spike and nucleocapsid protein sequences of SARS-CoV-2 were analyzed using predictive and structure-based tools.

The tools categorized into groups such as protein structure analysis, consensus-based, homology-based, and supervised methods.

Eight missense variants identified in viral sequences, with one potentially damaging variant.

SNPs&Go and PolyPhen-2 predicted four distinct possibly damaging variants of SARS-COV-2.

Promising therapeutic candidates include gilteritinib, pictilisib, sorafenib, RO5126766 and omipalisib.

These compounds could mitigate SARS-CoV-2 virulence and pathogenicity by modulating phosphorylation pathways and addressing viral protein variants.

The current study is not but a comprehensive analysis crucial to understanding disruptions in spike and nucleocapsid protein's structure and function.

MutPred2 uses a grouping-based machine learning model trained on positive-unlabeled data.

MutPred2 demonstrates probabilities to interpret pathogenicity and mutation scores.

Enhances understanding of genetic variations' impact on protein structure and function.

SNAP2 employs a neural network-based classifier with 848 input nodes, 25 hidden nodes and two output nodes (neutral and deleterious).

The prediction analysis cover structural, functional, secondary structure, localization, transmembrane domains, catalytic functions, interactions, post-translational modifications, metal-binding and allosteric properties.

Drugs and kinase inhibitors mapped to specific phosphorylation sites.

Kinases and phosphatases are phospho-transferases, regulating cellular protein form and function.

Human genome contains approximately 156 phosphatases and 568 protein kinases, involved in biological regulation.

Dysregulation of kinases and phosphatases can have serious health consequences.

Mutations in these enzymes can lead to disorders, and inhibitors are used as pharmacological tools.

Protein Kinase Inhibitors (PKIs) and Phosphatase Inhibitors (PIs) are in experimental and clinical development, with various attributes and applications.

The use of multiple methods increases prediction robustness in the research.

Further investigation and experimental validation required to assess their efficacy as COVID-19 treatments.

## Linked entities

- **Proteins:** CHMP5 (charged multivesicular body protein 5)
- **Chemicals:** gilteritinib (PubChem CID 49803313), pictilisib (PubChem CID 17755052), sorafenib (PubChem CID 216239), RO5126766 (PubChem CID 16719221), omipalisib (PubChem CID 25167777)
- **Diseases:** SARS-CoV-2 (MONDO:0100096), COVID-19 (MONDO:0100096)

## Full-text entities

- **Genes:** S (surface glycoprotein) [NCBI Gene 43740568] {aka spike glycoprotein}, N (nucleocapsid phosphoprotein) [NCBI Gene 43740575]
- **Diseases:** SARS-CoV-2 (MESH:D000086382)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11140666/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC11140666/full.md

## References

55 references — full list in the complete paper: https://tomesphere.com/paper/PMC11140666/full.md

---
Source: https://tomesphere.com/paper/PMC11140666