# Degradation Determinants Are Abundant in Human Noncanonical Proteins and Minor Annotated Isoforms

**Authors:** Claudio Casola, Adekola Owoyemi, Nikolaos Vakirlis

PMC · DOI: 10.1093/gbe/evag009 · Genome Biology and Evolution · 2026-01-19

## TL;DR

The study finds that noncanonical proteins are rich in features linked to degradation, suggesting many are unstable but some may still be biologically functional.

## Contribution

The study introduces C-terminal tail hydrophobicity as a proxy for degradation propensity in noncanonical proteins.

## Key findings

- Noncanonical proteins are enriched for degradation-related features compared to canonical proteins.
- C-terminal tail hydrophobicity is a reliable indicator of degradation propensity in noncanonical proteins.
- Noncanonical proteins with phenotypic effects upon knock-out show higher stability.

## Abstract

The comprehensive characterization of human proteins, a key objective in contemporary biology, has been revolutionized by the identification of thousands of potential novel proteins through ribosome profiling and proteomics. Determining the physiological activity of these noncanonical proteins has proven difficult, because they are encoded by different types of coding regions and tend to share no sequence similarity with canonical polypeptides. Evidence from immunopeptidomic assays combined with a better understanding of the quality control of protein synthesis suggest that many noncanonical proteins may possess low stability in the cellular environment. Here, we tested this hypothesis by analyzing the frequency of multiple sequence features eliciting either proteasomal degradation or autophagy across 91,003 canonical (annotated) protein isoforms and 11,499 noncanonical proteins. Overall, noncanonical proteins were enriched for degradation-related features compared to all canonical proteins. Notably, degradation determinants were also enriched in canonical protein isoforms starting with a non-methionine amino acid. Analyses of original and shuffled sequences showed evidence of selective pressure either against or toward the accumulation of specific degradation signatures only in major isoforms of canonical proteins. However, stability was significantly higher in noncanonical proteins with evidence of phenotypic effects upon knock-out in cell lines. Notably, we found that the C-terminal tail hydrophobicity represents a reliable proxy for degradation propensity with potential applications in identifying functional noncanonical proteins. These findings underscore the critical role of degradation processes in regulating the half-life of noncanonical proteins and demonstrate the power of degradation-associated signatures in discriminating noncanonical genes likely to encode biologically functional molecules.

## Full-text entities

- **Genes:** MRLN (myoregulin) [NCBI Gene 100507027] {aka LINC00948, Linc-RAM, M1, MLN, MUSER1}, BAG6 (BAG cochaperone 6) [NCBI Gene 7917] {aka BAG-6, BAT3, D6S52E, G3}, SLN (sarcolipin) [NCBI Gene 6588], HSPA8 (heat shock protein family A (Hsp70) member 8) [NCBI Gene 3312] {aka HEL-33, HEL-S-72p, HSC54, HSC70, HSC71, HSP71}, PLN (phospholamban) [NCBI Gene 5350] {aka CMD1P, CMH18, PLB}, SGTA (small glutamine rich tetratricopeptide repeat co-chaperone alpha) [NCBI Gene 6449] {aka SGT, SGT1, Vpu, alphaSGT, hSGT}, CP (ceruloplasmin) [NCBI Gene 1356] {aka AB073614, CP-2}, GABARAPL1 (GABA type A receptor associated protein like 1) [NCBI Gene 23710] {aka APG8-LIKE, APG8L, ATG8, ATG8B, ATG8L, GEC1}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, MAP1LC3A (microtubule associated protein 1 light chain 3 alpha) [NCBI Gene 84557] {aka ATG8E, LC3, LC3A, MAP1ALC3, MAP1BLC3}, MTLN (mitoregulin) [NCBI Gene 205251] {aka LEMP, LINC00116, MOXI, MPM, NCRNA00116, SMIM37}
- **Diseases:** CP (MESH:D002972), mCDS (MESH:C537834), CMA (MESH:C564093), CTTH (OMIM:211750), Terminal Intrinsic Disorder (MESH:D020919), NCP (MESH:D011488), toxicity (MESH:D064420)
- **Chemicals:** C (MESH:D002244), acids (MESH:D000143), lipid (MESH:D008055), amino acids (MESH:D000596), CTTH (-)
- **Species:** Cercopithecidae (monkey, family) [taxon 9527], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** H9 — Homo sapiens (Human), Sezary syndrome, Cancer cell line (CVCL_1240), K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12863090/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12863090/full.md

## References

94 references — full list in the complete paper: https://tomesphere.com/paper/PMC12863090/full.md

---
Source: https://tomesphere.com/paper/PMC12863090