# RAMZIS: a bioinformatic toolkit for rigorous assessment of the alterations to glycoprotein composition that occur during biological processes

**Authors:** William Edwin Hackett, Deborah Chang, Luis Carvalho, Joseph Zaia

PMC · DOI: 10.1093/bioadv/vbae012 · Bioinformatics Advances · 2024-01-25

## TL;DR

RAMZIS is a new tool that helps researchers better understand changes in glycoprotein composition during biological processes by analyzing proteomics data more rigorously.

## Contribution

RAMZIS introduces a novel R package using similarity metrics to assess glycoproteomics data quality and biological significance.

## Key findings

- RAMZIS uses permutation tests to generate contextual similarity for assessing glycopeptide abundance differences.
- The tool enables comparison of datasets that are stochastic, small, or sparse without relying on interpolation.
- RAMZIS is validated through theoretical cases and a proof-of-concept application.

## Abstract

Glycosylation elaborates the structures and functions of glycoproteins; glycoproteins are common post-translationally modified proteins and are heterogeneous and non-deterministically synthesized as an evolutionarily driven mechanism that elaborates the functions of glycosylated gene products. Glycoproteins, accounting for approximately half of all proteins, require specialized proteomics data analysis methods due to micro- and macro-heterogeneities as a given glycosite can be divided into several glycosylated forms, each of which must be quantified. Sampling of heterogeneous glycopeptides is limited by mass spectrometer speed and sensitivity, resulting in missing values. In conjunction with the low sample size inherent to glycoproteomics, a specialized toolset is needed to determine if observed changes in glycopeptide abundances are biologically significant or due to data quality limitations.

We developed an R package, Relative Assessment of m/z Identifications by Similarity (RAMZIS), that uses similarity metrics to guide researchers to a more rigorous interpretation of glycoproteomics data. RAMZIS uses a permutation test to generate contextual similarity, which assesses the quality of mass spectral data and outputs a graphical demonstration of the likelihood of finding biologically significant differences in glycosylation abundance datasets. Investigators can assess dataset quality, holistically differentiate glycosites, and identify which glycopeptides are responsible for glycosylation pattern change. RAMZIS is validated by theoretical cases and a proof-of-concept application. RAMZIS enables comparison between datasets too stochastic, small, or sparse for interpolation while acknowledging these issues in its assessment. Using this tool, researchers will be able to rigorously define the role of glycosylation and the changes that occur during biological processes.

https://github.com/WillHackett22/RAMZIS.

## Full-text entities

- **Genes:** TF (transferrin) [NCBI Gene 7018] {aka HEL-S-71p, PRO1557, PRO2086, TFQTL1}, HP (haptoglobin) [NCBI Gene 3240] {aka HP2ALPHA2, HPA1S}
- **Diseases:** Glioblastoma (MESH:D005909), Influenza (MESH:D007251), AGP (MESH:D006009)
- **Chemicals:** Glycopeptide (MESH:D006020), glycan (MESH:D011134), lactosamine (MESH:C032233), hexose (MESH:D006601), GlycReSoft (-), N (MESH:D009584), asparagines (MESH:D001216)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC10879752/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC10879752/full.md

## References

28 references — full list in the complete paper: https://tomesphere.com/paper/PMC10879752/full.md

---
Source: https://tomesphere.com/paper/PMC10879752