Studying RNA homology and conservation with Infernal: from single sequences to RNA families
Lars Barquist, Sarah W. Burge, Paul P. Gardner

TL;DR
This paper presents a protocol for identifying ncRNA homologs and constructing covariance models using Infernal, enabling systematic characterization of ncRNA families from single sequences to broader classifications.
Contribution
It introduces a step-by-step iterative method for ncRNA homology detection and family construction using covariance models in the Infernal software.
Findings
Discovered a new family of MicA homologs in Xenorhabdus.
Demonstrated the protocol's effectiveness on bacterial small RNA MicA.
Provided a practical guide for ncRNA homology analysis.
Abstract
Emerging high-throughput technologies have led to a deluge of putative non-coding RNA (ncRNA) sequences identified in a wide variety of organisms. Systematic characterization of these transcripts will be a tremendous challenge. Homology detection is critical to making maximal use of functional information gathered about ncRNAs: identifying homologous sequence allows us to transfer information gathered in one organism to another quickly and with a high degree of confidence. ncRNA presents a challenge for homology detection, as the primary sequence is often poorly conserved and de novo secondary structure prediction and search remains difficult. This protocol introduces methods developed by the Rfam database for identifying "families" of homologous ncRNAs starting from single "seed" sequences using manually curated sequence alignments to build powerful statistical models of sequence and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · RNA and protein synthesis mechanisms · Microbial Community Ecology and Physiology
