Predicting the hosts of prokaryotic viruses using GCN-based semi-supervised learning
Jiayu Shang, Yanni Sun

TL;DR
This paper introduces HostG, a semi-supervised GCN-based model that improves virus host prediction accuracy by leveraging virus-virus and virus-host similarities, outperforming existing methods on simulated and real data.
Contribution
The work presents a novel semi-supervised GCN model, HostG, that effectively predicts hosts for novel viruses using a knowledge graph and minimizes calibration error during training.
Findings
HostG outperforms existing methods in host prediction accuracy.
It effectively predicts hosts from new taxa.
The model demonstrates robustness on both simulated and real datasets.
Abstract
Background: Prokaryotic viruses, which infect bacteria and archaea, are the most abundant and diverse biological entities in the biosphere. To understand their regulatory roles in various ecosystems and to harness the potential of bacteriophages for use in therapy, more knowledge of viral-host relationships is required. High-throughput sequencing and its application to the microbiome have offered new opportunities for computational approaches for predicting which hosts particular viruses can infect. However, there are two main challenges for computational host prediction. First, the empirically known virus-host relationships are very limited. Second, although sequence similarity between viruses and their prokaryote hosts have been used as a major feature for host prediction, the alignment is either missing or ambiguous in many cases. Thus, there is still a need to improve the accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBacteriophages and microbial interactions · Genomics and Phylogenetic Studies · RNA and protein synthesis mechanisms
MethodsGraph Convolutional Network
