GiantHunter: Accurate detection of giant virus in metagenomic data using reinforcement-learning and Monte Carlo tree search
Fuchuan Qu, Cheng Peng, Jiaojiao Guan, Donglin Wang, Yanni, Sun, Jiayu Shang

TL;DR
GiantHunter is a novel reinforcement learning tool that accurately detects giant viruses in metagenomic data, improving precision and reducing computational costs, and reveals ecological insights from environmental samples.
Contribution
This work introduces GiantHunter, a reinforcement learning-based method utilizing Monte Carlo tree search for improved NCLDV detection in metagenomic datasets, outperforming existing approaches.
Findings
GiantHunter achieves 10% higher F1-score than previous methods.
It reduces computational costs by 90%.
Application to environmental samples reveals NCLDV diversity differences near the Three Gorges Dam.
Abstract
Motivation: Nucleocytoplasmic large DNA viruses (NCLDVs) are notable for their large genomes and extensive gene repertoires, which contribute to their widespread environmental presence and critical roles in processes such as host metabolic reprogramming and nutrient cycling. Metagenomic sequencing has emerged as a powerful tool for uncovering novel NCLDVs in environmental samples. However, identifying NCLDV sequences in metagenomic data remains challenging due to their high genomic diversity, limited reference genomes, and shared regions with other microbes. Existing alignment-based and machine learning methods struggle with achieving optimal trade-offs between sensitivity and precision. Results: In this work, we present GiantHunter, a reinforcement learning-based tool for identifying NCLDVs from metagenomic data. By employing a Monte Carlo tree search strategy, GiantHunter dynamically…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAnimal Virus Infections Studies · Machine Learning in Bioinformatics
