Improved Core Genes Prediction for Constructing well-supported   Phylogenetic Trees in large sets of Plant Species

Bassam AlKindy; Huda Al-Nayyef; Christophe Guyeux; Jean-Fran\c{c}ois; Couchot; Michel Salomon; Jacques M. Bahi

arXiv:1504.06110·q-bio.GN·April 24, 2015

Improved Core Genes Prediction for Constructing well-supported Phylogenetic Trees in large sets of Plant Species

Bassam AlKindy, Huda Al-Nayyef, Christophe Guyeux, Jean-Fran\c{c}ois, Couchot, Michel Salomon, Jacques M. Bahi

PDF

TL;DR

This paper introduces an improved method for predicting core genes by combining multiple annotation tools, enhancing the accuracy of phylogenetic trees in plant species by producing well-supported, reliable core gene sets.

Contribution

It presents a novel quality test approach (QTA) that integrates NCBI and DOGMA annotations to improve core gene prediction for phylogenetic analysis.

Findings

01

Enhanced core gene quality leads to more accurate phylogenetic trees.

02

Well-defined core genes produce subtrees with strong bootstrap support.

03

Merged supertrees are more reliably supported in evolutionary studies.

Abstract

The way to infer well-supported phylogenetic trees that precisely reflect the evolutionary process is a challenging task that completely depends on the way the related core genes have been found. In previous computational biology studies, many similarity based algorithms, mainly dependent on calculating sequence alignment matrices, have been proposed to find them. In these kinds of approaches, a significantly high similarity score between two coding sequences extracted from a given annotation tool means that one has the same genes. In a previous work article, we presented a quality test approach (QTA) that improves the core genes quality by combining two annotation tools (namely NCBI, a partially human-curated database, and DOGMA, an efficient annotation algorithm for chloroplasts). This method takes the advantages from both sequence similarity and gene features to guarantee that the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.