TaoChongBao: a large-scale C. elegans missense variant database bridging worm and human genomes
Ming Li, Shimin Wang, Yongping Chai, Zhengyang Guo, Zi Wang, Zhe Chen, Kexin Lei, Jingyi Ke, Xingshen Huang, Guanghan Chen, Peng Huang, Kaiming Xu, Zijie Shen, Wei Li, Guangshuo Ou

TL;DR
TaoChongBao is a large database of C. elegans missense mutations that connects worm and human genomes for functional and comparative studies.
Contribution
The paper introduces TaoChongBao, a 20-fold expanded C. elegans missense variant database integrating AlphaMissense and ClinVar for comparative genomics.
Findings
Generated 12,069 viable C. elegans strains with 541,102 unique missense mutations.
TaoChongBao integrates mutation data with AlphaMissense and ClinVar for functional and clinical insights.
Database enables comparative analysis of conserved residues between C. elegans and human pathogenic sites.
Abstract
Using EMS mutagenesis, we generated and sequenced 12,069 viable C. elegans strains, identifying 541,102 missense mutations. We developed TaoChongBao, an open-access database integrating mutation data, AlphaMissense, and ClinVar, greatly expanding functional variant resources for functional residuomics and comparative analyses. We generated and sequenced 12,069 viable Caenorhabditis elegans strains produced by ethyl methanesulfonate mutagenesis, identifying 20,315,536 variants, including 541,102 unique missense mutations across 20,914 genes. Most strains exhibit resistance to the anti-nematode drug ivermectin, whereas some others display phenotypes like dumpy morphology, uncoordinated movement, multivulva formation, and blistered cuticle. To organize and visualize this resource, we developed TaoChongBao, an open-access database and strain repository that integrates C. elegans mutation…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenetics, Aging, and Longevity in Model Organisms · Evolution and Genetic Dynamics · Genomics and Rare Diseases
