A putative scenario of how de novo protein-coding genes originate in the Saccharomyces cerevisiae lineage
Tetsushi Yada, Takeaki Taniguchi

TL;DR
This paper outlines a possible scenario for how new protein-coding genes arise from non-genic DNA in yeast, using bioinformatic analysis of ancestral sequences.
Contribution
The first reported scenario for de novo origination of protein-coding genes, using ancestral sequence analysis to capture rapid evolutionary events.
Findings
De novo genes may originate in GC-rich regions with neutral mutations.
ORF extension and Kozak sequence recruitment mark later stages of gene birth.
Mutation specificity increases as the de novo gene scenario progresses.
Abstract
Novel protein-coding genes were considered to be born by re-organization of pre-existing genes, such as gene duplication and gene fusion. However, recent progress of genome research revealed that more protein-coding genes than expected were born de novo, that is, gene origination by accumulating mutations in non-genic DNA sequences. Nonetheless, the in-depth process (scenario) for de novo origination is not well understood. We have conceived bioinformatic analysis for sketching a scenario for de novo origination of protein-coding genes. For each de novo protein-coding gene, we firstly identified an edge of a given phylogenetic tree where the gene was born based on parsimony. Then, from a multiple sequence alignment of the de novo gene and its orthologous regions, we constructed ancestral DNA sequences of the gene corresponding to both end nodes of the edge. We finally revealed…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRNA and protein synthesis mechanisms · Fungal and yeast genetics research · Genomics and Phylogenetic Studies
