Gene model for the ortholog of mts in Drosophila mojavensis
Megan E. Lawson, Clairine I. S. Larsen, Madeline McAbee, Scott Tanner, Jeffrey S. Thompson, Chinmay P. Rele, Laura K Reed

TL;DR
This paper describes the gene model for an ortholog of the mts gene in Drosophila mojavensis, part of a study on the evolution of a signaling pathway.
Contribution
The paper provides a new gene model for the mts ortholog in Drosophila mojavensis for evolutionary studies.
Findings
A gene model for the mts ortholog in Drosophila mojavensis was characterized.
The model is part of a dataset for studying the evolution of the IIS pathway in Drosophila.
Abstract
Gene model for the ortholog of microtubule star ( mts ) in the D. mojavensis May 2011 (Agencourt dmoj_caf1/DmojCAF1) Genome Assembly (GenBank Accession: GCA_000005175.1 ) of Drosophila mojavensis . This ortholog was characterized as part of a developing dataset to study the evolution of the Insulin/insulin-like growth factor signaling pathway (IIS) across the genus Drosophila using the Genomics Education Partnership gene annotation protocol for Course-based Undergraduate Research Experiences.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1|
"In this GEP CURE protocol students use web-based tools to manually annotate genes in non-model
“The particular gene ortholog described here was characterized as part of a developing dataset to study the evolution of the Insulin/insulin-like growth factor signaling pathway (IIS) across the genus
“
“The
|
- —National Institutes of Health Clinical Center (United States)https://ror.org/04vfsmv21
- —National Science Foundation (United States)https://ror.org/021nxhr62
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Chromatin Dynamics · Evolutionary Algorithms and Applications · Plant Molecular Biology Research
Description
**: **
The model presented here is the ortholog of * mts * in the May 2011 (Agencourt dmoj_caf1/DmojCAF1) assembly of * D. mojavensis * ( GCA_000005175.1 ) and corresponds to the Gnomon Peptide ID ( XM_002001773 ) predicted model in * D. mojavensis * ( LOC6575805 ) . This gene model is based on RNA-Seq data from * D. mojavensis * (Chen et al . , 2014; SRP006203 * ) and the mts * in * D. melanogaster * from FB2023_03 ( GCA_000001215.4 ; Larkin et al., 2021; Gramates et al., 2022; Jenkins et al., 2022).
** Synteny **
mts * occurs on chromosome 2L in * D. melanogaster * and is flanked upstream by * Rack1 * and * CG7115 * and downstream by * CG14537 * and * CG14535 . * We determined that the putative ortholog of * mts * is found on scaffold 6500 ( CH933807.1 ) in * D. mojavensis * (CAF1 assembly GCA_000005175.1 ) with LOC6575805 ( XP_002001809.1 ) (via tblastn search with an e-value of 0.0 and percent identity of 99.68%). It is flanked upstream by LOC6575807 ( XP_002001811.1 ) and LOC6575806 ( XP_002001810.1 ), which correspond to * Nubp1 * FBgn0032597 and * Prosbeta4 * in * D. melanogaster * with e-values of 1e-78 and 1e-123 respectively and percent identities of 86.13% and 80.40% respectively, as determined by blastp . It is flanked downstream by LOC6575804 ( XP_002001808.1 ) and LOC6575803 ( XP_032584898.1 ), which correspond to * CG14537 * and * CG14535 * in * D. melanogaster * with e-values of 8e-61 and 0.0 respectively and percent identities of 50.47% and 75.04% respectively as determined by blastp ( Figure 1A, A ltschul et al., 1990). We believe this is the correct ortholog assignment for * mts * in D. mojavenis because the tblastn hit for * mts * is very high quality, as it has a 99.68% identity and an e-value of 0.0. Additionally, the order of the downstream genes is conserved, although it is important to note that * CG14535 * is on the same strand as * mts * in * D. melanogaster * but is on the opposite strand relative to * mts * in * D. mojavensis , * so local synteny is not completely conserved in the downstream region. This inversion is also present in the sister species, * D. arizonae . * The upstream region is also different between the two species.
** Protein Model **
mts * in * D. mojavensis * has one protein coding isoform (mts-PA, mts-PB, and mts-PC) ( Figure 1B ), encoded by mRNAs mts-RA, mts-RB, and mts-RC, which differ in their UTRs, that contain four CDSs. This is the same relative to the ortholog in * D. melanogaster * , which also has one protein coding isoform, encoded by four CDSs. The sequence of * mts * in * D. mojavensis * has 99.68% identity with the * mts * in * D. melanogaster * as determined by
- blastp* ( Figure 1C ). The coordinates of the curated gene models can be found in NCBI at GenBank/BankIt using the accessions BK063009 , BK063010 , and BK063011 . These data are also available in Extended Data files below, which are archived in CaltechData.
Methods
Detailed methods including algorithms, database versions, and citations for the complete annotation process can be found in Rele et al. (2023). Briefly, students use the GEP instance of the UCSC Genome Browser v.435 (https://gander.wustl.edu ; Kent WJ et al., 2002; Navarro Gonzalez et al., 2021) to examine the genomic neighborhood of their reference IIS gene in the * D. melanogaster * genome assembly (Aug. 2014; BDGP Release 6 + ISO1 MT/dm6). Students then retrieve the protein sequence for the * D. melanogaster * reference gene for a given isoform and run it using tblastn against their target * Drosophila * species genome assembly ( * D. mojavensis * ( GCA_000005175.1 )) on the NCBI BLAST server (https://blast.ncbi.nlm.nih.gov/Blast.cgi, Altschul et al., 1990) to identify potential orthologs. To validate the potential ortholog, students compare the local genomic neighborhood of their potential ortholog with the genomic neighborhood of their reference gene in * D. melanogaster * . This local synteny analysis includes at minimum the two upstream and downstream genes relative to their putative ortholog. They also explore other sets of genomic evidence using multiple alignment tracks in the Genome Browser, including BLAT alignments of RefSeq Genes, Spaln alignment of * D. melanogaster * proteins, multiple gene prediction tracks (e.g., GeMoMa, Geneid, Augustus), and modENCODE RNA-Seq from the target species. Detailed explanation of how these lines of genomic evidenced are leveraged by students in gene model development are described in Rele et al. (2023). Genomic structure information (e.g., CDSs, CDS number and boundaries, number of isoforms) for the * D. melanogaster * reference gene is retrieved through the Gene Record Finder (https://gander.wustl.edu/~wilson/dmelgenerecord/index.html; Rele et al., 2023). Approximate splice sites within the target gene are determined using tblastn using the CDSs from the * D. melanogaster * reference gene. Coordinates of CDSs are then refined by examining aligned modENCODE RNA-Seq data, and by applying paradigms of molecular biology such as identifying canonical splice site sequences and ensuring the maintenance of an open reading frame across hypothesized splice sites. Students then confirm the biological validity of their target gene model using the Gene Model Checker (https://gander.wustl.edu/~wilson/dmelgenerecord/index.html; Rele et al., 2023), which compares the structure and translated sequence from their hypothesized target gene model against the * D. melanogaster * reference gene model. At least two independent models for this gene were generated by students under mentorship of their faculty course instructors. These models were then reconciled by a third independent researcher mentored by the project leaders to produce the final model presented here. Note: comparison of 5' and 3' UTR sequence information is not included in this GEP CURE protocol.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Altschul SF Gish W Miller W Myers EW Lipman DJ 1990105 Basic local alignment search tool.J Mol Biol 21530022-283640341010.1016/S 0022-2836(05)80360-22231712 · doi ↗ · pubmed ↗
- 2Casso DJ Liu S Iwaki DD Ogden SK Kornberg TB 200823 A screen for modifiers of hedgehog signaling in Drosophila melanogaster identifies swm and mts.Genetics 17830016-67311399141310.1534/genetics.107.08163818245841 PMC 2278107 · doi ↗ · pubmed ↗
- 3Yuh Chew L He J Wong JJL Li S Yu F 2022714 AMPK activates the Nrf 2-Keap 1 pathway to govern dendrite pruning via the insulin pathway in Drosophila.Development 149140950-199110.1242/dev.20053635735111 · doi ↗ · pubmed ↗
- 4Congleton H Kiser CA Colom Diaz PA Schlichting E Walton DA Long LJ Reed LK Martinez-Cruzado JC Rele CP 20221115 Drosophila mojavensis - chico.Micro Publ Biol 202210.17912/micropub.biology.00067736468157 PMC 9709638 · doi ↗ · pubmed ↗
- 5Drosophila 12 Genomes Consortium. Clark AG Eisen MB Smith DR Bergman CM Oliver B Markow TA Kaufman TC Kellis M Gelbart W Iyer VN Pollard DA Sackton TB Larracuente AM Singh ND Abad JP Abt DN Adryan B Aguade M Akashi H Anderson WW Aquadro CF Ardell DH Arguello R Artieri CG Barbash DA Barker D Barsanti P Batterham P Batzoglou S Begun D Bhutkar A Blanco E Bosak SA Bradley RK Brand AD Brent MR Brooks AN Brown RH Butlin RK Caggese C Calvi BR Bernardo de Carvalho A Caspi A Castrezana S Celniker SE Chang JL Chapple C Chatterji S Chinwalla A Civetta A C · doi ↗ · pubmed ↗
- 6Durando CM Baker RH Etges WJ Heed WB Wasserman M De Salle R 200081 Phylogenetic analysis of the repleta species group of the genus Drosophila using multiple sources of characters.Mol Phylogenet Evol 1621055-790329630710.1006/mpev.2000.082410942616 · doi ↗ · pubmed ↗
- 7Gil-Ranedo J Gonzaga E Jaworek KJ Berger C Bossing T Barros CS 201964 STRIPAK Members Orchestrate Hippo and Insulin Receptor Signaling to Promote Neural Stem Cell Reactivation.Cell Rep 271029212933.e 510.1016/j.celrep.2019.05.02331167138 PMC 6581792 · doi ↗ · pubmed ↗
- 8Gramates LS Agapite J Attrill H Calvi BR Crosby MA Dos Santos G Goodman JL Goutte-Gattat D Jenkins VK Kaufman T Larkin A Matthews BB Millburn G Strelets VB the Fly Base Consortium. 202244 Fly Base: a guided tour of highlighted features.Genetics 22040016-673110.1093/genetics/iyac 03535266522 PMC 8982030 · doi ↗ · pubmed ↗
