Cornified Epithelial Teeth of Jawless Vertebrates Contain Proteins Similar to Keratin-Associated Proteins of Mammalian Skin Appendages
Attila Placido Sachslehner, David A. D. Parry, Leopold Eckhart

TL;DR
Jawless vertebrates like the sea lamprey have proteins in their teeth similar to those in mammalian skin appendages, suggesting a shared function despite independent evolution.
Contribution
Discovery of KRTAP-like proteins in lamprey teeth, revealing a parallel evolution to mammalian keratin-associated proteins.
Findings
KRTAP-like proteins in lamprey teeth share features like high cysteine and tyrosine content with mammalian KRTAPs.
Lamprey KRTAP-like genes are clustered and contain two exons, differing from the single exon structure in mammals.
KRTAP-like proteins are conserved across diverse lamprey species, indicating functional importance.
Abstract
Keratins and keratin-associated proteins (KRTAPs) are the main components of mammalian nails and hair. Comparative genomics and gene expression studies have revealed that keratins are conserved in all vertebrates, whereas KRTAPs exist only in mammals. Recently, we found hair keratin-like cysteine-rich keratins in jawless vertebrates with confirmed expression in the cornified epithelial teeth of the sea lamprey (Petromyzon marinus). Here, we report that KRTAP-like proteins are also present in the horny teeth of the lamprey. Mass spectrometry-based proteomics identified proteins that share features with KRTAPs, such as high contents of cysteine and tyrosine residues, which support intermolecular interactions, and abundant glycine residues, which endow the proteins with flexibility. Genes encoding KRTAP-like proteins are arranged in a cluster in P. marinus, and the presence of at least one…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5- —Austrian Science Fund (FWF)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSkin and Cellular Biology Research · Wnt/β-catenin signaling in development and cancer · Hair Growth and Disorders
1. Introduction
Skin appendages, such as hair, nails, feathers, and scales, are prominent traits of amniotes [1,2,3,4,5]. Hairs are characteristic skin appendages of mammals, and feathers are characteristic for birds. However, cornified skin structures are also found in other vertebrates. Among amphibians, several clades of frogs and salamanders have claws [6] and/or a cornified beak in the developmental stage of tadpoles [7]. Breeding tubercles develop on some teleost fish [8,9], but generally, jawed fish do not have cornified skin appendages. By contrast, all jawless vertebrates (cyclostomes), comprising lampreys and hagfish, form cornified epithelial teeth [10,11,12,13].
Cornified skin appendages consist of dead epithelial cells that are packed with specific proteins [4]. The main protein components of hair and nails are keratins, which form intermediate filaments, and keratin-associated proteins (KRTAPs) [14,15]. The latter are relatively small proteins that share neither a specific fold nor a specific sequence motif, but they fall into clearly defined families known as high-sulfur (HS), ultra-high-sulfur (UHS), and high-glycine–tyrosine (HGT) proteins [16,17,18]. Members of these KRTAP families are either rich in cysteine residues (HS and UHS KRTAPs) or rich in glycine and tyrosine residues (HGT KRTAPs). The common ancestry of KRTAPs is supported by the fact that they all share the same gene structure, that is, a single exon containing the entire coding sequence, and many of the gene family members are arranged in clusters that have apparently evolved by repeated gene duplications [19,20,21]. One cluster of KRTAP genes, localized on chromosome 17q21.2, comprises KRTAPs of the HS and UHS families, and another cluster, localized on chromosome 21q22.1, comprises HS KRTAPs and HGT KRTAPs [22]. Given that the KRTAP gene cluster on chromosome 17q21.2 is located within the type I keratin gene cluster, it is conceivable that the primordial KRTAP gene evolved from a keratin gene [23]. Specifically, KRTAPs may be derived from the first exon of a keratin gene because this exon encodes a protein segment which lacks the predominantly alpha-helical structure of the central rod domain and mediates protein–protein interactions in a way similar to KRTAPs [24,25]. Thus, while keratin intermediate filaments provide the core elements of the epithelial cytoskeleton [26,27], the matrix-forming KRTAPs bind them together (Figure 1). KRTAPs are absent from soft epithelia and present in hard mammalian epithelial structures, such as hair, quills, nails, bovine horn, rhino horn, baleen of whales, and filiform papillae on the dorsal tongue surface of many species [28,29,30,31].
Numerous publications have used the term “keratin-associated proteins” to generally refer to proteins that bind to keratins [32]. Examples of such proteins include corneous beta proteins (CBPs), also called beta-keratins, which form both 3.4 nm diameter filaments and the matrix that lies between them [33]. Other examples are proteins encoded by genes of the epidermal differentiation complex both in sauropsids and mammals [4]. Two of them are filaggrin, a keratin filament aggregating protein of the S100 fused-type protein family [34], and trichohyalin, which contains a specific sequence motif implicated in the binding to keratins [35].
Based on protein extraction and two-dimensional electrophoretic analysis, the horny teeth of the Lombardy lamprey (Lethenteron zanandreai) were reported to contain both keratins and keratin-associated proteins [12]. Recently, we could show by mass spectrometry-based proteomic analysis that the horny teeth of the sea lamprey (Petromyzon marinus) contain cysteine-rich keratins which have evolved independently from mammalian hair keratins and sauropsid-specific hard keratins [31,36]. Here, we searched for KRTAP-like proteins in the proteome of lamprey teeth and corresponding genes in the genomes of jawless vertebrates.
2. Materials and Methods
2.1. Comparative Genomics
The following genome sequences of jawless vertebrates were investigated: Petromyzon marinus (sea lamprey) genome assembly: GCF_010993605.1, assembly name: kPetMar1.pri, assembly provider: Vertebrate Genomes Project, annotation name: Petromyzon marinus annotation release 100, annotation provider: NCBI; Lethenteron reissneri (Far Eastern brook lamprey) genome assembly: GCF_015708825.1, assembly name: ASM1570882v1, assembly provider: Center for Ecological and Environmental Sciences [37], annotation name: GCF_015708825.1-RS_2023_12, annotation provider: NCBI RefSeq. Furthermore, we investigated the unannotated whole-genome shotgun sequences of Mordacia mordax (Australian lamprey), GenBank accession number JBCLOA000000000.1, and Geotria australis (pouched lamprey), GenBank accession number JBCLOB000000000.1 (both submitted by Hardy, C.M.; Court, L.; Rane, R.; Walsh, T.; and Pandey, G., H&B and Environment, CSIRO Applied Genomics Initiative, Parkville, Victoria, Australia), as well as river lamprey (Lampetra fluviatilis), GenBank accession number CAXMYT000000000.1, submitted by the Wellcome Sanger Tree of Life Programme, Wellcome Genome Campus, Hinxton, U.K.
KRTAP-like genes were first identified in the current genome annotation of the sea lamprey. The gene LOC116956410 was annotated as “keratin-associated protein 5-1-like [Petromyzon marinus (sea lamprey)]”. Using the encoded protein, XP_032833892.1, as a query in tBLASTn searches, we screened for similar genes adjacent to the locus of LOC116956410 and in the entire genome of the sea lamprey. Additional BLAST searches were performed on other cyclostome genome sequences available in GenBank. Amino acid sequences were aligned with MUltiple Sequence Comparison by Log-Expectation (MUSCLE) [38] and MultAlin [39].
2.2. Analysis of Proteome Data of Sea Lamprey
The proteome of horny teeth of the sea lamprey (Proteomics Identification database, PRIDE, accession number PXD048873) [40] was screened for proteins with sequence features similar to those of mammalian KRTAPs. Cornified teeth and the skin of a sea lamprey specimen (inventory number NMW-63577, kindly provided by the Natural History Museum Vienna) were lysed in a buffer containing urea, thiourea, 3-([3-Cholamidopropyl]dimethylammonio)-2-hydroxy-1-propanesulfonate, and dithiothreitol and analyzed by mass spectrometry-based proteomics, as reported previously [40]. Proteins were identified using the “NCBI_Petromyzon_marinus_tx7757_230919.fasta” dataset, which was downloaded from the Common Repository of Adventitious Proteins (https://www.thegpm.org/crap/, last accessed on 7 March 2025). To test whether the peptide sequences observed are present only in lamprey KRTAPLs or if they could have come from human contaminants, we used the sequences as queries in BLASTp searches in the human proteome. The absence of BLAST hits indicated that the peptides were not derived from contaminating human KRTAPs or other human proteins.
3. Results
3.1. Prediction of KRTAP-like Genes in Sea Lamprey
We compared lamprey tooth proteins identified by proteomic analysis [31,40] to proteins of other vertebrates and localized the genes that encode these proteins in the genome sequence of the sea lamprey [41,42]. A group of KRTAP-like (KRTAPL) genes, tentatively named KRTAPL1 through KRTAPL6, was identified in the form of a gene cluster on chromosome 65 of the sea lamprey (Figure 2a). A KRTAP-like gene is present at a syntenic locus on chromosome 68 of the Far Eastern brook lamprey (Lethenteron reissneri), which is the second species of lampreys for which a chromosome-level genome assembly is available (Figure 2b). At another locus on chromosome 65 of the sea lamprey, a further KRTAP-like gene, KRTAPL7, was identified (Figure 2c).
The KRTAP-like genes of lampreys comprise two exons, which both contain protein-coding sequences (Figure 2d). Important sequence elements, such as a canonical TATA box in the proximal promoter, a canonical splice donor site, and a canonical splice acceptor site are conserved in KRTAPL orthologs of different species of lampreys (Figure S1), including the pouched lamprey (Geotria australis) and the Australian lamprey (Mordacia mordax), which diverged from the lineage leading to the sea lamprey more than 90 million years ago [43]. KRTAPL3 deviates from the consensus gene organization of KRTAPLs because the first in-frame start codon is located in the second exon. KRTAPL5 and KRTAPL6 are canonical KRTAPLs but have been erroneously combined in a single gene prediction with three exons in the currently available genome annotation of the sea lamprey (Figure S2).
3.2. The Detection of KRTAP-like Proteins in the Horny Teeth of the Sea Lamprey
The analysis of the proteomics data of the horny teeth of the sea lamprey [31,40] (Table S1) revealed the presence of tryptic peptides corresponding to KRTAPL1, KRTAPL2, KRTAPL3, KRTAPL6, and KRTAPL7 (Table 1 and Table S2). Among the 107 proteins identified by proteomics, 5 corresponded to KRTAPLs (Table S1). The positions of the peptides within the KRTAPL proteins are indicated in Figure 3.
KRTAPLs of the sea lamprey are in the molecular mass range of 15–32 kilo-Dalton (Table 2), which is smaller than the molecular masses of keratin intermediate filament proteins [31] but within the range of the molecular masses of mammalian KRTAPs [30]. The isoelectric point of all lamprey KRTAPLs is slightly basic (Table 2).
3.3. KRTAP-like Proteins of the Lamprey Are Rich in Glycine, Tyrosine, Cysteine, and Histidine
All lamprey KRTAPLs have a glycine content in the range of 23.2–36.4%, which is similar to the glycine content of prototypical mammalian KRTAPs (Table 2). Four out of seven KRTAPLs have a tyrosine content above 9%, and four out of seven KRTAPLs have a cysteine content above 8%, with KRTAPL6 containing more than 21% cysteine residues (Table 2). Cysteine and tyrosine residues are considered critical for the binding of KRTAPs to keratin intermediate filaments and possibly other proteins [30,44,45]. Notably, lamprey KRTAPLs have a markedly higher histidine content than human KRTAPs of all three subclasses (Table 2). A sequence analysis showed that the biased amino acid composition of lamprey KRTAPLs is mainly caused by the presence of the aforementioned residues within short sequence repeats (Figure S3).
The sequence alignment of KRTAPL1 proteins of phylogenetically diverse species of lampreys showed a high degree of conservation over the entire length of the protein, with variation arising mainly from different lengths of repetitive sequences (Figure 4). No orthologs of lamprey KRTAPLs were found in whole-genome tBLASTn searches and the detailed analysis of syntenic chromosomal loci (Figure 2) in jawed fishes and tetrapods.
4. Discussion
The results of this study show a family of proteins that are components of horny teeth. These proteins are relatively small and have an amino acid composition biased towards glycine, tyrosine, and cysteine. In this regard, they are similar to mammalian KRTAPs, which are components of hair, nails, quills, and other hard cornified structures [30,46,47]. KRTAPLs and KRTAPs differ with regard to the gene structure (two exons in KRTAPLs and only one exon in KRTAPs) and the abundance of specific amino acids. For example, arginine is abundant in lamprey KRTAPL6 but not in human KRTAPs, and the histidine content is generally higher in KRTAPLs than in KRTAPs (Table 2). The differences in the gene structure and the species distributions of the genes suggest a model for the evolution of KRTAPLs and KRTAPs, which is depicted in Figure 5. According to this model, the evolution of the horny teeth of lampreys did not only involve the evolution of cysteine-rich keratins [31] but also the evolution of KRTAPLs. The latter are the products of convergent evolution relative to mammalian KRTAPs.
The present study supports and extends the concept that proteins with sequence similarities to mammalian KRTAPs exist in non-mammalian species [48]. A recent study explored the phylogenetic history of KRTAPs and proteins of similar sequence features, but no support for common ancestry was found [49]. The comparison of gene structures and the limitation of sequence similarities to stretches of low sequence complexity indicates that KRTAPs and lamprey KRTAPLs have not evolved from a common ancestral gene. Accordingly, we conclude that non-mammalian taxa including sauropsids (reptiles and birds) and lampreys do not have homologs, i.e., proteins of common evolutionary ancestry of KRTAPs. Given the apparent co-localization of KRTAPs and KRTAPLs with keratins in hard cornified epithelial structures, we propose that the similarities of KRTAPs and KRTAPLs have not arisen by chance but in the course of convergent evolution. The assumptions underlying this hypothesis need to be investigated further.
This study provides new insights into the protein composition of horny teeth. Using proteomics [40], KRTAPLs were detected in the teeth but not in the skin. This pattern fits with the predominant expression of mammalian KRTAPs at sites of hard cornification [30]. However, the absence of detection in the skin cannot be regarded as proof of absence. Therefore, it will be important to determine the tissue expression pattern of KRTAPLs in lampreys using alternative methods, such as RNA-seq, RT-PCR, and antibody-dependent methods, such as immunoblot analysis and immunohistochemistry. It will be interesting to study KRTAPL gene expression during the development, homeostasis, and regeneration of the horny teeth of lampreys.
The development and differentiation of the horny teeth of lampreys has not been fully characterized so far. Some studies addressed specific questions, such as the roles of sulfhydryl oxidase [50] and transglutaminases [40] in protein cross-linking. Cysteine-rich keratins and KRTAPLs of horny teeth are candidate markers of epithelial differentiation and likely functional equivalents of structural proteins in hard skin appendages of land-dwelling vertebrates [4,51].
Both KRTAPs and KRTAPLs are present in hard cornified epithelial structures and their amino acid compositions are similar. However, it remains to be experimentally tested whether KRTAPs and KRTAPLs have similar functions as matrix proteins between intermediate filaments, as depicted in Figure 1. Investigating the functions of these proteins is difficult because hard cornification cannot be fully mimicked in vitro [52]. Gene knockout studies are presumably complicated by the functional redundancy of genes of the same family, similar to the study of subgroups of keratins [53,54]. Despite these challenges being anticipated, the present study provides a basis for the design of future studies into the molecular structure of cornified teeth in jawless vertebrates.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Akat E. YenmişM. Pombal M.A. Molist P. Megías M. Arman S. VeselỳM. Anderson R. Ayaz D. Comparison of vertebrate skin structure at class level: A review Anat. Rec.20223053543360810.1002/ar.2490835225424 · doi ↗ · pubmed ↗
- 2Schempp C. Emde M. Wölfle U. Dermatology in the Darwin anniversary. Part 1: Evolution of the integument J. Dtsch. Dermatol. Ges.2009775075710.1111/j.1610-0387.2009.07193.x 19712227 · doi ↗ · pubmed ↗
- 3Dhouailly D. A new scenario for the evolutionary origin of hair, feather, and avian scales J. Anat.200921458760610.1111/j.1469-7580.2008.01041.x 19422430 PMC 2736124 · doi ↗ · pubmed ↗
- 4Holthaus K.B. Steinbinder J. Sachslehner A.P. Eckhart L. Skin appendage proteins of tetrapods: Building blocks of claws, feathers, hair and other cornified epithelial structures Animals 20251545710.3390/ani 1503045739943227 PMC 11816140 · doi ↗ · pubmed ↗
- 5Alibardi L. Keratinization and cornification are not equivalent processes but keratinization in fish and amphibians evolved into cornification in terrestrial vertebrates Exp. Dermatol.20223179479910.1111/exd.1452535007368 · doi ↗ · pubmed ↗
- 6Maddin H.C. Eckhart L. Jaeger K. Russell A.P. Ghannadan M. The anatomy and development of the claws of Xenopus laevis (Lissamphibia: Anura) reveal alternate pathways of structural evolution in the integument of tetrapods J. Anat.200921460761910.1111/j.1469-7580.2009.01052.x 19422431 PMC 2736125 · doi ↗ · pubmed ↗
- 7Alibardi L. Cornification of the beak of Rana dalmatina tadpoles suggests the presence of basic keratin-associated proteins Zool. Stud.2010495163
- 8Mittal A.K. Whitear M. Keratinization of fish skin with special reference to the catfish Bagarius bagarius Cell Tissue Res.197920221323010.1007/BF 00232236519704 · doi ↗ · pubmed ↗
