# Large Data Set Analysis Reveals Structural Origin of Peptide Collisional Cross Section Bimodal Behavior

**Authors:** Allyn M. Xu, Dániel Szöllősi, Helmut Grubmüller, Oded Regev

PMC · DOI: 10.1021/jasms.5c00325 · Journal of the American Society for Mass Spectrometry · 2025-12-21

## TL;DR

This study explains why some peptides have two distinct collisional cross-sectional area modes using machine learning and simulations.

## Contribution

The paper identifies basic site positioning as a key factor in determining peptide CCS modes.

## Key findings

- Peptides in the high CCS mode adopt extended, helical conformations.
- Low CCS mode peptides tend to have compact, globular structures.
- Protonation near the C-terminus supports helix formation in high CCS mode.

## Abstract

Recent advances in ion mobility spectrometry have enabled
the measurement
of rotationally averaged collisional cross-sectional area (CCS) for
millions of peptides as part of routine proteomic mass spectrometry
workflows. One of the most striking findings in recent large ion mobility
data sets is that CCS exhibits two distinct modes, most notably for
charge 3+ peptides, with peptides predominantly exhibiting
CCS in either the high or low mode. Here, using classical machine
learning approaches, we identify that basic site positioning is a
key sequence feature determining a peptide’s CCS mode. Molecular
dynamics simulations suggest that peptides in the high CCS mode tend
to adopt more extended conformations and form charge-stabilized helical
structures, whereas those in the low CCS mode adopt more compact,
globular conformations. Further supporting this structural hypothesis,
we provide evidence for preferential protonation near the C-terminus
and uncover multiple position-dependent sequence determinants that
all suggest the predominance of helix formation in the high CCS mode.
Together, these findings will enable better integration of CCS measurements
into protein identification and quantification pipelines, improving
the performance of ion mobility-based proteomics.

## Full-text entities

- **Genes:** CCS (copper chaperone for superoxide dismutase) [NCBI Gene 9973], GLS (glutaminase) [NCBI Gene 2744] {aka AAD20, CASGID, DEE71, EIEE71, GAC, GAM}
- **Chemicals:** tryptophan (MESH:D014364), polyalanine (MESH:C019529), amino acid (MESH:D000596), hydrocarbon (MESH:D006838), valine (MESH:D014633), Glycine (MESH:D005998), Ac-Ala n -LysH (-), acid (MESH:D000143), lysine (MESH:D008239), arginine (MESH:D001120), amine (MESH:D000588), Ala (MESH:D000409), proline (MESH:D011392), histidine (MESH:D006639)
- **Species:** C. elegans [taxon 328850], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Escherichia coli (E. coli, species) [taxon 562], Drosophila melanogaster (fruit fly, species) [taxon 7227], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12784393/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12784393/full.md

## References

41 references — full list in the complete paper: https://tomesphere.com/paper/PMC12784393/full.md

---
Source: https://tomesphere.com/paper/PMC12784393