# Identification of Short Amino Acid Sequences That Correlate with Cytoplasmic Retention of Human Proteins

**Authors:** Jay C. Brown, Baomin Wang

PMC · DOI: 10.3390/cells15020133 · Cells · 2026-01-12

## TL;DR

The paper introduces a new method to find short amino acid sequences that help keep human proteins in the cytoplasm, identifying 15 such sequences.

## Contribution

A novel bioinformatic method was developed and applied to identify 15 candidate cytoplasmic retention sequences in human proteins.

## Key findings

- Cytoplasmic retention sequences are consistently located at the same site across protein isoforms.
- Fifteen candidate cytoplasmic retention peptides were identified, ten of which have unrelated amino acid sequences.
- The amino acid sequence of a cytoplasmic retention site influences the sub-region of the cytoplasm a protein occupies.

## Abstract

What are the main findings?
Presented here is a novel method to identify cytoplasmic retention amino acid sequences within the sequence of human proteins.The new method is used to identify fifteen candidate cytoplasmic retention sequences in the human genome.

Presented here is a novel method to identify cytoplasmic retention amino acid sequences within the sequence of human proteins.

The new method is used to identify fifteen candidate cytoplasmic retention sequences in the human genome.

What is the implication of the main finding?
Cytoplasmic retention sequences were found to be located at the same site in all isoforms of a given protein regardless of the sequence of the retention sequence itself.

Cytoplasmic retention sequences were found to be located at the same site in all isoforms of a given protein regardless of the sequence of the retention sequence itself.

One group of human proteins found in the cytoplasm, but not in the nucleus, is characterized by the presence of short (6–9 aa), specific amino acid sequences thought to be involved in retaining proteins in the cytoplasm (cytoplasmic retention sequences). While strong evidence supports the ability of some peptides to act in this way, the number of such supported cases is small. We have taken the view that the situation would be improved by enhancing the methods available to identify cytoplasmic retention (CR) sequences. Here, we describe an appropriate bioinformatic method to identify CR peptides using information about their location at the ends of cytoplasmic proteins. The method was then used to link seven different human cytoplasmic proteins with sequences suggested to have cytoplasmic retention activity. Further bioinformatic analysis was carried out with isoforms of the cytoplasmic proteins identified. Amino acid sequence information showed that while the proposed CR amino acid sequences can be the same or distinct in different protein isoforms, they are always located at the same site in the protein. For instance, while the proposed retention sequence of CCDC57 isoform X18 is MLARLVSNS, in isoform 7 it is SEPALNEL, yet the two sequences are each located between amino acids 5 and 13 in the CCDC57 sequence. The results support the view that the protein isoform is involved in determining the location of the CR sequence in a protein, while the amino acid sequence itself affects other variables such as the sub-region of the cytoplasm the protein needs to occupy. Overall, the study yielded identification of 15 candidate CR peptides in which 10 of the 15 have unrelated amino acid sequences.

## Linked entities

- **Proteins:** CCDC57 (coiled-coil domain containing 57)

## Full-text entities

- **Genes:** CCDC57 (coiled-coil domain containing 57) [NCBI Gene 284001]
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12839037/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12839037/full.md

## References

15 references — full list in the complete paper: https://tomesphere.com/paper/PMC12839037/full.md

---
Source: https://tomesphere.com/paper/PMC12839037